Top Banner
Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder * , Bernard M.E. Moret * University of Texas at Austin (currently the Program for Evolutionary Dynamics, Harvard University) University of New Mexico
53

Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Dec 20, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Network (Reticulate) Evolution: Biology, Models, and Algorithms

C. Randal Linder*, Bernard M.E. Moret†

*University of Texas at Austin(currently the Program for Evolutionary Dynamics,

Harvard University)

†University of New Mexico

Page 2: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Purpose of Tutorial

• Familiarize you with the nature of reticulation in biology, especially hybrid speciation

• Discuss the implications of reticulation for our understanding of evolution

• Present currently available methods for simulating, detecting and reconstructing reticulation

• Consider deficiencies of the current methods

Page 3: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Overview of Reticulation in Biology

• What happens at the genetic level?

• How does it relate to population genetic processes?

– In particular, what processes can give the appearance of species level reticulation

• How can we detect it?

• How can we reconstruct it?

• What biological tools need to be in place to generate the requisite data?

Page 4: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Idealized Nature

• Wouldn’t it be nice if…

– Sexual creatures would just behave themselves

– Asexual lineages would keep their pseudopods to themselves

Page 5: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Then we could stick with bifurcating graphs (trees) to properly describe the evolutionary history of organismal lineages

A B C D E F G H

Page 6: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Unruly Nature

Whatever is not forbidden will occur.

-- Gerald Myers

(ca 1980)

Page 7: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

In Other Words• Nature does not care about our nice systems

• Rather, the only rule is:– If a set of genes can be brought together in a cell,

survival and reproduction will be determined by the phenotype produced in the environment of the organism.

• If the organism can survive and reproduce as well as or better than its competitors, it “works” no matter the mating/process that produced it

Page 8: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Therefore• Some “species” are able to interbreed or

exchange genes in ways that violate “normal” notions about species and speciation

• Reticulation is violation of the independence of each evolutionary lineage– Instead of bifurcation, lineages can mix and

produce new lineages

Page 9: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

• This leads to the production of networks instead of trees

A B C D E F G H I

Page 10: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Molecular phylogeneticists will have failed to find the “true tree,” not because their methods are inadequate or because they have chosen the wrong genes, but because the history of life cannot properly be represented as a tree.

--Ford Doolittle

Page 11: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.
Page 12: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Before Reticulation• Paradoxically, I’ll begin with non-reticulate evolution• Bifurcating evolution (and sometimes hard

polytomies)– Evolutionary lineages split and evolve independently from

one another

Page 13: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Before Reticulation• Key Evolutionary Insight: Because all evolution is a

product of change from one generation to the next, the information must initially change in some form of bifurcating process.

agct

gcct

agctacct

agctacct agatgcct gact

Page 14: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

With Reticulation

• The end result is admixture of different evolutionary histories

agct

agctacat

acat agct

acat agctacct

Page 15: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Levels of Reticulation

• Life is organized hierarchically and so reticulation can occur at different levels

– Chromosomal (meiotic recombination)

– Population (sexual recombination)

– Species (interspecific hybridization and horizontal gene transfer)

Page 16: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Levels of Reticulation• Chromosomal (meiotic recombination)

Page 17: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Levels of ReticulationPopulation (sexual recombination of haploid genomes)

Page 18: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Levels of Reticulation• Species (hybridization and gene transfer)

Page 19: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Levels Nested within Levels

Page 20: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Areas of Biological Research• Most of the work on reticulation has been

done at the population genetic level– A great deal of work on recombination,

especially meiotic recombination

• Hybrid speciation and lateral gene transfer are less well studied– Intersect with the population genetic

perspective– Will talk about this a bit later and from other

speakers

Page 21: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Types of Hybrid Speciation• Allopolyploidization: each parent of the hybrid

contributes it’s entire nuclear genome (usually uniparental inheritance of the organelles)– Parents needn’t have the same number of chromosomes

Page 22: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Types of Hybrid Speciation• Diploid (Homoploid) Hybridization: each parent

contributes half of its diploid chromosome set, as it would with normal sex.– Parents almost always have the same number of

chromosomes

Page 23: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Types of Hybrid Speciation

• Autopolyploidization: a doubling of the diploid chromosome number in a single species– From a biological and topological perspective, could be

considered a type of bifurcating speciation

Page 24: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Horizontal Gene Transfer

• Hybridization between lineages, but an independent lineage is not produced– Hybrids backcross to one or both

parents allowing introgression of genes between “species”

• Genes are moved between lineages by a third party (vector), e.g., a virus

Page 25: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Horizontal Gene Transfer:Introgressive Hybridization

Page 26: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Horizontal Gene Transfer:Genome Capture

• A complete organellar genome is transferred by hybridization

Page 27: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Horizontal Gene Transfer:Bacterial Sex

• Genetic material is moved by conjugation between compatible bacteria

Page 28: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Bacteria: Promiscuous DNA Sharers• Lawrence, Ochman estimated that 755 of 4,288

ORFs in E. coli were from at least 234 lateral gene transfer events (Proc. Natl Acad. Sci. USA 95, 9413-9417 (1998) )

• General evidence:

Page 29: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Horizontal Gene Transfer:Exchange by a Vector

• Genetic material is moved by a third party such as a virus or a combination of organisms, e.g., mosquito and protozoan.

Page 30: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Neworks Have Incongruent Trees Within Them

Page 31: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Reticulation Events Have Incongruent Trees Within Them

Page 32: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Reticulation Events Have Incongruent Trees Within Them

Page 33: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Fundamental Insight

• At the lowest possible level

(individual DNA nucleotides on a

single DNA strand) all evolution is

ultimately tree-like.

Page 34: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

How Might We Detect Reticulation?

• Fundamentally, reticulation is a mixing of different evolutionary signals. Therefore: – The signal from a genome that has

experienced reticulation will be an “average” of its parents (Median approach)

– Unrecombined stretches of DNA will have a signal that comes from one parent. (Incongruence approach)

• Will see both approaches in methods for detection and reconstruction

Page 35: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Evolutionary Events that Mimic Species-Level Reticulation

• Lineage Sorting (gene tree/species tree problem)

• Reticulation at lower levels, e.g., meiotic recombination

Page 36: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Evolutionary Events that Mimic Species-Level Reticulation

• Lineage Sorting (gene tree/species tree problem)– When reconstructing a species-level

phylogeny using DNA sequence information we are actually reconstructing a gene tree.

– Ancient alleles (alleles arising prior to some monophyletic group) may not be inherited by all species.

– In essence, it is either a sampling problem or an irretrievable information loss problem.

Page 37: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Gene Tree/Species Tree

Page 38: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Gene Tree/Species Tree• All of the versions of a gene from a single common history (everything that is the same color) are referred to as orthologues.

• Versions of a gene from a duplication event or the production of a new allele are paralogues

Page 39: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Gene Tree/Species Tree

Page 40: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Gene Tree/Species Tree

Page 41: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Gene Tree/Species Tree

Page 42: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Gene Tree/Species Tree

Page 43: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Gene Tree/Species Tree

Page 44: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Gene Tree/Species Tree

Page 45: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Gene Tree/Species Tree

• Under a molecular clock, it is possible to detect the difference between incongruence due to hybridization and to a gene tree/species tree sampling problem.

•GT/ST incongruences will occur at different depths.

Page 46: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Evolutionary Events that Mimic Species-Level Reticulation

• Reticulation at lower levels, e.g., meiotic recombination

– Recombination can lead to loss of an allele

for a lineage in a particular region of DNA

essentially giving rise to a lineage sorting

problem.

Page 47: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Recombination Example

Page 48: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Second Key Insight

• Events that masquerade as species-level

reticulate evolution are always the

product of either true data loss or

inadequate sampling.

– Here, we encounter the importance of a

population genetic perspective in

phylogenetics.

Page 49: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Given the problem of

misleading signals, how can

we distinguish true species-

level reticulation from

reticulation at other levels,

simple data loss, and

inadequate sampling?

Page 50: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Possible Solution• Increase the number of individuals

sampled from a species/population and the number of markers.

• Therefore, must take a multiple marker approach to recovering the species-level relationships– Data loss and lower level reticulation

events should almost always act randomly with respect to which phylogeny is favored

– Species-level reticulation will be biased toward a particular interpretation

Page 51: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Practical Concerns

• Practical problems (for biologists):– Cost – Time– Lack of prior knowledge that all of the

orthologues are there to be found

Page 52: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Caveats

• Reticulation events that quickly follow speciation may not be detectable

• Ancient reticulation events may not be recoverable

• The computational requirements to detect and reconstruct reticulation may be considerable

• We may have to rethink our ideas of species (levels/units of speciation)

Page 53: Network (Reticulate) Evolution: Biology, Models, and Algorithms C. Randal Linder *, Bernard M.E. Moret † * University of Texas at Austin (currently the.

Assembling the Network of Assembling the Network of Life: ANOLLife: ANOL