Top Banner
Bioinformatics Ayesha M.Khan 25 th April,2012
14

Bioinformatics

Feb 24, 2016

Download

Documents

Bioinformatics. Ayesha M.Khan 25 th April,2012. Phylogenetic Basics. One central field in biology is to infer the relation between species. Do they possess a common ancestor? When did they separate from each other? - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Bioinformatics

Bioinformatics

Ayesha M.Khan25th April,2012

Page 2: Bioinformatics

Lec-11 2

Phylogenetic BasicsOne central field in biology is to infer the relation between species. Do they possess a common ancestor? When did they separate from each other?

Phylogenetics is the study of evolutionary relationships among and within species.

Phylogenetics is the field of systematics that focuses on evolutionary relationships between organisms or genes/proteins (phylogeny).Systematics: an attempt to understand the interrelationships of living things

Page 3: Bioinformatics

Lec-11 3

Phylogenetic Basics (contd.)

•The actual pattern of evolutionary history is thephylogeny or evolutionary tree which we try to estimate.

• A tree is a mathematical structure which is used to model the actual evolutionary history of a group of sequences or organisms.

Page 4: Bioinformatics

Lec-11 4

Phylogenetic Basics (contd.)•Homologues are similar sequences in two different organisms that have been derived from a common ancestor sequence.

• Orthologues are similar sequences in two different organisms that have arisen due to a speciation event. Orthologs typically retain identical or similar functionality throughout evolution.

• Paralogues are similar sequences within a single organism that have arisen due to a gene duplication event. They tend to have differing functions.

• Xenologues are similar sequences that do not share the same evolutionary origin, but rather have arisen out of horizontal transfer events through symbiosis, viruses, etc.

Page 5: Bioinformatics

5Lecture 4.2

Early globin gene

mouse

ß-chain gene-chain gene

cattle ß human ß mouse ßhuman cattle

Orthologs () Orthologs (ß)Paralogs (cattle)

Homologs

Gene Duplication

Orthologs – diverged after speciation – tend to have similar function

Paralogs – diverged after gene duplication – some functional divergence occurs

Therefore, for linking similar genes between species, or performing “annotation transfer”, identify orthologs

Page 6: Bioinformatics

Lec-11 6

Molecular phylogeneticsWhy focus on molecular phylogenies in contrast to phylogenies which are based on characteristics like wings, feathers, etc, i.e. morphological characters?

With molecular phylogenetics, the differences between organisms are measured on the proteins and RNA coded in the DNA, i.e. on amino acid and nucleotide sequences.

Page 7: Bioinformatics

Lec-11 7

Molecular phylogenetics (contd.)

Also, molecular phylogenetics is more precise than its counterpart based on external features and behavior and can also distinguish small organism like bacteria or even viruses.the DNA must be inherited and connects all speciesthe molecular phylogenetics can be based on mathematical and statistical methods and is even model-based as mutations can be modeled, remote homologies can be detectedthe distance is not only based on one feature but on many genes.

Page 8: Bioinformatics

Lec-11 8

Molecular Phylogeny AnalysisMolecular phylogeny methods allow, from a given set of aligned sequences, the suggestion of phylogenetic trees (inferred trees) which aim at reconstructing the history of successive divergence which took place during the evolution, between the considered sequences and their common ancestor. These trees may not be the same as the true tree.

• Reconstruction of phylogenetic trees is a statistical problem, and a reconstructed tree is an estimate of a true tree with a given topology and given branch length;

• In practice, phylogenetic analyses usually generate phylogenetic trees with accurate parts and imprecise parts.

Page 9: Bioinformatics

Lec-11 9

Key features of molecular phylogenetic trees

Page 10: Bioinformatics

Lec-11 10

Molecular Phylogeny Analysis (contd.)Sequences reflect relationshipsAfter working with sequences for a while, one develops an intuitive understanding that for a given gene, closely related organisms have similar sequences and more distantly related organisms have more dissimilar sequences. These differences can be quantified.Given a set of gene sequences, it should be possible to reconstruct the evolutionary relationships among genes and among organisms.

Page 11: Bioinformatics

Lec-11 11

Example: Pseudomonas aeruginosa- one of the top three causes of opportunistic infections, noted for its antimicrobial resistance and resistance to detergents.

Page 12: Bioinformatics

Lec-11 12

Phylogenetic tree constructionConsider the set of sequences to analyse

Align "properly" these sequences

Apply phylogenetic making tree methods

Evaluate statistically the obtained phylogenetic tree

Page 13: Bioinformatics

Lec-11 13

Phylogenetic tree construction methodsThree categories: of methods exist:distance-based, maximum parsimony, and maximum likelihood.•Distance methods: evolutionary distances are computed for all sequences and build tree where distance between sequences “matches” these distances•Maximum parsimony (MP): choose tree that minimizes number of changes required to explain data•Maximum likelihood (ML): under a model of sequence evolution, find the tree which gives the highest likelihood of the observed data

Page 14: Bioinformatics

Lec-11 14