BIOINFORMATICS Introduction To Bioinformatics Course: B.Sc Biochemistry Subject: Basic of Bioinformatics Unit: I
Jul 14, 2015
BIO
INFO
RM
ATIC
S
Introduction To Bioinformatics
Course: B.Sc BiochemistrySubject: Basic of Bioinformatics
Unit: I
BIO
INFO
RM
ATIC
S
BIOINFORMATICS
• Introduction to bioinformatics• History• Aim scope and importance of bioinformatics• Origin of bioinformatics its application and benefits in life science
industry.• Research areas of bioinformatics.• Fundamentals of Internet: WWW, HTML, URLs Browsers,
Netscape/Opera/Explorer Search Engines: Google, NCBI, PUBMED, entraz
BIO
INFO
RM
ATIC
S
INTRODUCTION TO BIOINFORMATICS
• Definition: Bioinformatics is an interdisciplinary scientific field that develops methods and software tools for storing, retrieving, organizing and analyzing biological data. As an interdisciplinary field, bioinformatics combines computer Science, statistics, mathematics and engineering to study biological data and processes.
BIO
INFO
RM
ATIC
S
P. Paulsharma Chakravarthy
WHAT IS BIOINFORMATICS?
1.
BIO
INFO
RM
ATIC
SBIOINFORMATICS DEALS WITH
• Design and implementation of new algorithms and statistics which assess relationship among members of large data sets.
• Analysis and interpretation of various data types, which includes nucleotide and amino acid sequences and structure of protein.
• To develop computational tools and databases that enables efficient analysis, access and management of biologically significant information.
BIO
INFO
RM
ATIC
SAIMS OF BIOINFORMATICS
• Improve content and utility of databases.• Develop better tools for data generation, capture, and annotation.• Develop and improve tools and databases for comprehensive
functional studies.• Develop and improve tools for representing and analyzing sequence
similarity and variation.• Create mechanisms to support effective approaches for producing
robust, exportable software that can be widely shared.
BIO
INFO
RM
ATIC
SGOALS OF BIOINFORMATICS
• Development and implementation of computer programs that enable efficient access to , use and management of various type of information
• Development of new algorithms and statistical measures with which to assess relationships among members of large data.
• Understanding the biological process• Pattern reorganization• Seq. alignment• Gene finding• Assembly• Drug designing• Protein structure alignment• Gene expression• Genome annotation
BIO
INFO
RM
ATIC
SHISTORY OF BIOINFORMATICS
BIO
INFO
RM
ATIC
S
The IBM 7090 computer, which Margaret Dayhoff used for her early workMargaret Oakley Dayhoff (Ruth Dayhoff's mother), who created the first large scale database of protein sequences, and developed many of the algorithms for analysis of relatedness of proteins, early 1970s
2.
BIO
INFO
RM
ATIC
S
BIO
INFO
RM
ATIC
S
WHAT ARE THE BIOLOGICAL PROBLEMS?
• What is the role of a particular gene?• Does a particular gene help cause a disease?• How does a drug affect a cell?• Can we insert a gene into corn to protect it against diseases or
pests?• Can we design a drug to accomplish a particular purpose?• Can we build a cell that eats pollution?
BIO
INFO
RM
ATIC
SBIOLOGICAL PROBLEMS THAT COMPUTER CAN HELP
BIO
INFO
RM
ATIC
S
BIO
INFO
RM
ATIC
SWhy Is Bioinformatics So Important?
• Global perspective in experimental design the ability to capitalize on the emerging technology of database-mining - the process by which testable hypotheses are generated regarding the function or structure of a gene or protein of interest by identifying similar sequences in better characterized organisms.
• Although a human disease may not be found in exactly the same form in animals, there may be sufficient data for an animal model that allow researchers to make inferences about the process in humans.
BIO
INFO
RM
ATIC
S
3.
BIO
INFO
RM
ATIC
S>100,000 species are represented in GenBank
all species 128,941
viruses 6,137
bacteria 31,262
archaea 2,100
eukaryota 87,147
BIO
INFO
RM
ATIC
S
BIO
INFO
RM
ATIC
SNECESSARY SKILL SETS
• Knowledge in Molecular Biology• Statistics• Mathematics (algorithm development)• Communicate biological problems to computer scientists• Working knowledge in bioinformatics tools• Computer proficiency (windows/command line)• Programming Skills• Data administration
BIO
INFO
RM
ATIC
SMOLECULAR BIOLOGY KNOWELEDGE
4.
BIO
INFO
RM
ATIC
S
BIO
INFO
RM
ATIC
S
BIO
INFO
RM
ATIC
S
BIO
INFO
RM
ATIC
S
BIO
INFO
RM
ATIC
S
BIO
INFO
RM
ATIC
S
BIO
INFO
RM
ATIC
S
5.
BIO
INFO
RM
ATIC
SPROTEIN STRUCUTRE AND FUNCTION
6.
BIO
INFO
RM
ATIC
S
7.
BIO
INFO
RM
ATIC
S
BIO
INFO
RM
ATIC
SAPPLICATIONS OF BIOINFORMATICS
• Knowledge-based drug design, forensic DNA analysis, and agricultural biotechnology.
• Computational studies of protein–ligand interactions• Knowledge of the three-dimensional structures of proteins• This informatics-based approach significantly reduces the time and
cost necessary to develop drugs with higher potency, fewer side effects, and less toxicity than using the traditional trial-and-error approach.
• In forensics, results from molecular phylogenetic analysis have been accepted as evidence
• Genomics and bioinformatics are now poised to revolutionize our healthcare system by developing personalized and customizing
medicine
BIO
INFO
RM
ATIC
S
• Bioinformatics tools are being used in agriculture as well. • Plant genome databases and gene expression profile analyses have
played an important role in the development of new crop varieties that have higher productivity and more resistance to disease.
APPLICATIONS OF BIOINFORMATICS
8.
BIO
INFO
RM
ATIC
SAPPLICATIONS OF BIOINFORMATICS
9.
BIO
INFO
RM
ATIC
S
10.
BIO
INFO
RM
ATIC
S
.
LIMITATION OF BIOINFORMATICS
• Bioinformatics is Bioinformatics and experimental biology are independent, but complementary, activities.
• Bioinformatics depends on experimental science to produce raw data for analysis. It, in turn, provides useful interpretation of experimental data and important leads for further experimentalresearch.
• Quality of bioinformatics predictions depends on the quality of data and the sophistication of the algorithms being used.
• Sequence data from high throughput analysis often contain errors. If the sequences are wrong or annotations incorrect, the results from the downstream analysis are misleading as well.
• They often make incorrect predictions that make no sense when placed in a biological context. Errors in sequence alignment.
BIO
INFO
RM
ATIC
SSPECIALIZED FIELDS
• Computational Biology• Genomics• Proteomics• Bioprogramming• Cheminformatics• Structural Biology• Systems Biology• Pharmacogenomics
BIO
INFO
RM
ATIC
SMAJOR RESEARCH AREA
• Computational evolutionary biology• Sequence analysis• Genome annotation
• Analysis of regulation• In cancer Analysis of mutations
• Comparative genomics
BIO
INFO
RM
ATIC
S
Origin of internet
The management and, more importantly, accessibility of the data is directly attributable to the development of the Internet, particularly the World Wide Web (WWW).Originally developed for military purposes in the 60's and expanded by the National Science Foundation in the 80's, scientific use of the Internet grew dramatically following the release of the WWW by CERN in 1992.
BIO
INFO
RM
ATIC
S
HTML
• The WWW is a graphical interface based on hypertext by which text and graphics can be displayed and highlighted.
• Each highlighted element is a pointer to another document or an element in another document which can reside on any internet host computer. Page display, hypertext links and other features are coded using a simple, cross-platform HyperText Markup Language (HTML) and viewed on UNIX workstations, PCs and Apple Macs as WWW pages using a browser.
BIO
INFO
RM
ATIC
SASSIGNMENT -1--------------------------------10 Marks
1. What is bioinformatics? Why do people consider it as an interdisciplinary subject
2. What are the necessary skills required for bioinformatics study?
3. Write about history of bioinformatics.
4. Describe the application of bioinformatics.
5. What is NCBI?
6. What is Entraz?
7. What is search engine?
8. What is PubMed?
9. Enlist the major research areas of bioinformatics.
10. Give the importance of bioinformatics in biological science.
BIO
INFO
RM
ATIC
STest-1-----------------------------------------------10 Marks 1. What is bioinformatics?-------------------------------------------------1
2. Describe the application of bioinformatics.------------------------2
3. What is NCBI?-------------------------------------------------------------1
4. What is Entraz?-----------------------------------------------------------1
5. What is PubMed?---------------------------------------------------------1
6. Enlist the major research areas of bioinformatics.---------------4
BIO
INFO
RM
ATIC
S
Web and Books References
• Books Name :
1. Introduction To Bioinformatics by T. K. Attwood
2. BioInformatics by Sangita
3. Basic Bioinformatics by S.Ignacimuthu, s.j.
4. Application of bioinformatics by david mount
• http://en.wikipedia.org/wiki/Bioinformatics• http://bioinformaticsweb.net/applications.html• http://bioinformaticsinstitute.ru/sites/default/files/lapidus_1_0.pdf• History:
http://www.bioinformatics.nl/webportal/background/timeline.html• Links: http://www.ncbi.nlm.nih.gov
41
BIO
INFO
RM
ATIC
S
Image References• 1. http://www.esat.kuleuven.be/sista/GGS/images/bioi.jpg• 2. http://www.nature.com/nrg/journal/v1/n3/images/nrg1200-231a-f4.jpg• 3. https://encrypted-tbn2.gstatic.com/images?
q=tbn:ANd9GcSg58MFhqEDknaHb3nHWmkP6mMgUbCjD7ASmqII4befeovn566L9w
• 4. http://www.ncbi.nlm.nih.gov/Class/MLACourse/Modules/MolBioReview/images/central_dogma.gif
• 5. https://encrypted-tbn3.gstatic.com/images?q=tbn:ANd9GcRx8BAquGrTstZrdTzPvvfn2ItAnPVGsjK03JfzMZF6SbpdasCFAw
• 6.https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRc-CJZHa1e-8Xj78HMvWR-FNzw0KF4tEHrmH8clOcCzlOeYT3i
• 7. http://upload.wikimedia.org/wikipedia/commons/e/e6/Spombe_Pop2p_protein_structure_rainbow.png
• 8. https://encrypted-tbn1.gstatic.com/images?q=tbn:ANd9GcSHmWGzX5jd6PvwdiaYdxnGcCKOXjmxBmoz_YiE7L1xLNvzAkQvITmdSrRy
• 9. & 10.Book: Basic Bioinformatics by S.Ignacimuthu, s.j.