Top Banner
Bioinformatics Dr. Aladdin Hamwieh Khalid Al-shamaa Abdulqader Jighly 2010-2011 Lecture 1 Introduction po University lty of technical engineering rtment of Biotechnology
34

Bioinformatics

Feb 25, 2016

Download

Documents

Arella

2010-2011. Bioinformatics. Lecture 1 Introduction. Dr. Aladdin Hamwieh Khalid Al- shamaa Abdulqader Jighly. Aleppo University Faculty of technical engineering Department of Biotechnology. Main Lines. Definition Bioinformatics areas Bioinformatics data Data types - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Bioinformatics

Bioinformatics

Dr. Aladdin Hamwieh Khalid Al-shamaaAbdulqader Jighly

2010-2011

Lecture 1Introduction

Aleppo UniversityFaculty of technical engineeringDepartment of Biotechnology

Page 2: Bioinformatics

Main Lines• Definition• Bioinformatics areas• Bioinformatics data– Data types– Applications for these data

• Next generation sequencing• Bioinformatics algorithms• Joint international programming

initiatives

Page 3: Bioinformatics

Definition• Bioinformatics is the field of science in

which biology, computer science, and information technology merge into a single discipline.

• Bioinformatics is the science of managing and analyzing biological data using advanced computing techniques

• Bioinformatics applies principles of information science to make the vast, diverse, and complex life sciences data more understandable and useful.

Page 4: Bioinformatics

Definition• There are two extremes in

bioinformatics work– Tool users (biologists): know how to

press the buttons and the biology but have no clue what happens inside the program

– Tool shapers (informaticians): know the algorithms and how the tool works but have no clue about the biology

Page 5: Bioinformatics

Bioinformatics areas• Molecular sequence analysis

1. Sequence alignment2. Sequence database searching3. Motif discovery4. Gene and promoter finding5. Reconstruction of evolutionary

relationships6. Genome assembly and

comparison

Page 6: Bioinformatics

Bioinformatics areas• Molecular structural analysis

1. Protein structure analysis2. Nucleic acid structure analysis3. Comparison4. Classification5. prediction

Page 7: Bioinformatics

Bioinformatics areas• Molecular functional analysis

1. gene expression profiling2. Protein–protein interaction

prediction3. protein sub-cellular localization

prediction4. Metabolic pathway reconstruction5. simulation

Page 8: Bioinformatics
Page 9: Bioinformatics

Bioinformatics data

There is different data types usually used in

bioinformatics

The same data may be used in different

areas

Page 10: Bioinformatics

Data types• DNA sequences• RNA sequences• Expression (microarray) profile• Proteome (x-ray, NMR) profile• Metabolome profile• Haplotype profile• Phenotype profile

Page 11: Bioinformatics

1 -DNA Sequences• Simple sequence analysis– Database searching– Pairwise and multiple analysis

• Regulatory regions • Gene finding• Whole genome annotation• Comparative genomics

Page 12: Bioinformatics
Page 13: Bioinformatics

2 -RNAs• Splice variants• Tissue specific expression• 2D structure• 3D structure• Single gene analysis• Microarray

Page 14: Bioinformatics

2D and 3D structure of tRNA

Page 15: Bioinformatics

2D and 3D structure of rRNA

Page 16: Bioinformatics

Microarray

• 20,000 to 60,000 short DNA probes of specified sequences are orderly tethered on a small slide. Each probe corresponds to a particular short section of a gene.

Page 17: Bioinformatics

• DNA microarrays measure the RNA abundance with either 1 channel (one color) or 2 channels (two colors).

• Stanford microarrays measure by competitive hybridization the relative expression under a given condition (fluorescent red dye Cy5) compared to its control (labeled with a green fluorescent dye, Cy3) (Two channels)

• Affymetrix GeneChip has 1 channel and use either fluorescent red dye Cy5 or green fluorescent dye, Cy3

Microarray

Page 18: Bioinformatics
Page 19: Bioinformatics

3 -Proteins• Protein sequences analysis– Database searching– Pairwise and multiple analysis

• 2D structure• 3D structure• Classification of proteins families• Protein arrays

Page 20: Bioinformatics

3D structure

Page 21: Bioinformatics

Animation

Page 22: Bioinformatics

4- Metabolome and molecular biology

• Metabolic pathways• Regulatory networks

Helps to understand systems biology

Page 23: Bioinformatics
Page 24: Bioinformatics

5- Haplotype• Molecular Markers– RFLP– RAPD– SSR– ISSR– AFLP– DArT

– SNP– ….

Page 25: Bioinformatics

SNP

Page 26: Bioinformatics

6 -Phenotype• Morphological data• Physiological data• Stresses tolerance• Pathogenic infections• Diseases resistance • Cancers types• …..

Page 27: Bioinformatics

Haplotype & Phenotype

Page 28: Bioinformatics

Next Generation Sequencing

SMRT Helicos AB SOLiD

IlluminaSolexa

RocheGSFLX

ABI 3730 Sequencing Machine

Target release 2010

2008 2007 2006 2004 2000 Launched

964 28 25-35 35-70 250-400 800-1100 Read lengthNA 85M 170M 120M 400K 96 Reads/runNA 2 GB 6 GB 6 GB 100 MB 0.1 MB Throughput

per runNA NA $5.81 k $5.97 k $84.39 High cost Cost/Mb

Page 29: Bioinformatics

Short reads assembly problems

Page 30: Bioinformatics

Short reads assembly problems

Page 31: Bioinformatics

Short reads assembly problems

Page 32: Bioinformatics

• String algorithms• Dynamic programming• Machine learning (NN, k-NN, SVM, GA, ..)• Markov chain models• Hidden Markov models• Markov Chain Monte Carlo (MCMC) algorithms• Stochastic context free grammars• EM algorithms• Gibbs sampling• Clustering• Tree algorithms (suffix trees)• Graph algorithms• Text analysis• Hybrid/combinatorial techniques• ….

Algorithms in bioinformatics

Page 33: Bioinformatics

Joint international programming initiatives• Bioperl

http://www.bioperl.org/wiki/Main_Page

• Biopythonhttp://www.biopython.org/

• BioTclhttp://wiki.tcl.tk/12367

• BioJavawww.biojava.org/wiki/Main_Page

Page 34: Bioinformatics

Thank You