DUF1220 Domains & the Search for the Genes that Made Us Human James M. Sikela, Ph.D. Human Medical Genetics, Neuroscience, & Comparative Genomics Programs, Department of Biochemistry & Molecular Genetics, University of Colorado School of Medicine Genomics Course February 28, 2012
38
Embed
DUF1220 Domains & the Search for the Genes that Made Us Human James M. Sikela, Ph.D. Human Medical Genetics, Neuroscience, & Comparative Genomics Programs,
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
DUF1220 Domains & the Search for the Genes that Made Us Human
DUF1220 Domains & the Search for the Genes that Made Us Human
James M. Sikela, Ph.D.Human Medical Genetics, Neuroscience, &
Comparative Genomics Programs,Department of Biochemistry & Molecular Genetics, University of Colorado School of
Medicine
Genomics CourseFebruary 28, 2012
James M. Sikela, Ph.D.Human Medical Genetics, Neuroscience, &
Comparative Genomics Programs,Department of Biochemistry & Molecular Genetics, University of Colorado School of
Medicine
Genomics CourseFebruary 28, 2012
Key PointsKey Points• First gene-based and first genome-wide study of
lineage-specific gene duplication and loss in human and primate evolution
• Dramatic human-specific increase in copy number of DUF1220 protein domains
• DUF1220 copy number linked to evolution of brain size
• Selection of evolutionarily adaptive genome sequences may be driving disease, e.g. 1q21.1
• First gene-based and first genome-wide study of lineage-specific gene duplication and loss in human and primate evolution
• Dramatic human-specific increase in copy number of DUF1220 protein domains
• DUF1220 copy number linked to evolution of brain size
• Selection of evolutionarily adaptive genome sequences may be driving disease, e.g. 1q21.1
Primate EvolutionPrimate Evolution
New World Monkeys (e.g. squirrel monkey,spider monkey)New World Monkeys (e.g. squirrel monkey,spider monkey)
Old World Monkeys (e.g. baboon, rhesus, etc.)Old World Monkeys (e.g. baboon, rhesus, etc.)
•“There is now ample evidence that gene duplication is the most important mechanism for generating new genes and new biochemical processes that
have facilitated the evolution of complex organisms from primitive
ones.”- W. H. Li in Molecular
Evolution, 1997
•“Exceptional duplicated regions underlie exceptional biology”
- Evan Eichler, Genome Research 11:653-656,
2001
Fig 1. Measuring genomic DNA copy number alteration using cDNA microarrays (array CGH). Fluorescence ratios are depicted in a pseudocolor scale, such that red indicates increased, and green decreased, gene copy number in the test (right) compared to reference sample (left).
• Carry out pairwise cDNA aCGH comparisons between human and other hominoid species
• Use a >39,000 cDNA microarray representing >29,000 human genes
• Hybridize human genomic DNA (reference sequence: cy3/green) and other hominoid genomic DNAs (test sequence: cy5/red) simultaneously to the microarray
• Visualize aCGH signals “gene-by-gene” along each chromosome across five species: human (n=5), bonobo (n=3), chimpanzee (n=4), gorilla (n=3) and orangutan (n=3)
• Carry out pairwise cDNA aCGH comparisons between human and other hominoid species
• Use a >39,000 cDNA microarray representing >29,000 human genes
• Hybridize human genomic DNA (reference sequence: cy3/green) and other hominoid genomic DNAs (test sequence: cy5/red) simultaneously to the microarray
• Visualize aCGH signals “gene-by-gene” along each chromosome across five species: human (n=5), bonobo (n=3), chimpanzee (n=4), gorilla (n=3) and orangutan (n=3)
Whole Genome Caryoscope Image of Interhominoid aCGH DataWhole Genome Caryoscope Image of Interhominoid aCGH Data
Human & Great Ape Genes Showing Lineage-Specific Copy Number Gain/LossHuman & Great Ape Genes Showing Lineage-Specific Copy Number Gain/Loss
Fortna, et al, PLoS Biol. 2004Fortna, et al, PLoS Biol. 2004
Summary of Human/Primate ArrayCGH Results
Summary of Human/Primate ArrayCGH Results
• First genome-wide and first gene-based aCGH comparison of human and nonhuman primate gene copy number variation (Fortna, et al 2004)
• 1,004 (4,159) genes identified that showed lineage-specific changes in copy number
• Time machine of evolutionary copy number change
• Gene candidates to underlie lineage-specific traits• Genes identified represent most of major lineage-
specific gene duplications and losses over the last 60 million years of human and primate evolution (Dumas, et al 2007)
• First genome-wide and first gene-based aCGH comparison of human and nonhuman primate gene copy number variation (Fortna, et al 2004)
• 1,004 (4,159) genes identified that showed lineage-specific changes in copy number
• Time machine of evolutionary copy number change
• Gene candidates to underlie lineage-specific traits• Genes identified represent most of major lineage-
specific gene duplications and losses over the last 60 million years of human and primate evolution (Dumas, et al 2007)
Human & Great Ape Genes Showing Lineage-Specific Copy Number Gain/LossHuman & Great Ape Genes Showing Lineage-Specific Copy Number Gain/Loss
Fortna, et al, PLoS Biol. 2004Fortna, et al, PLoS Biol. 2004
““This (Fortna, et al, 2004) is the first This (Fortna, et al, 2004) is the first time that copy number changes among time that copy number changes among
apes have been assayed for the vast apes have been assayed for the vast majority of human genes, and we can majority of human genes, and we can
expect that the biological consequences expect that the biological consequences of the 140 human-specific copy number of the 140 human-specific copy number changes identified in this study will be changes identified in this study will be heavily investigated over the coming heavily investigated over the coming
*Vandepoule, et al, Mol. Biol. & Evol, 2005*Vandepoule, et al, Mol. Biol. & Evol, 2005
Copy Number of DUF1220 (Q8IX62/17-33) Sequences in Primate Species
0
10
20
30
40
50
60
70
Q-P
CR
Pre
dic
ted
Co
py
Nu
mb
er
70
60
50
40
30
20
100
Q-P
CR
Pre
dic
ted
Co
py
N
um
ber
Copy Number of DUF1220 (Q8IX62/17-33)Sequences in Primate Species
Hu
ma
n
Bo
no
bo
Ch
imp
Go
rill
a
Ora
ng
uta
n
Gib
bo
n
Mac
aqu
e
Bab
oo
n
Summary of aCGH, Q-PCR and BLAT results:
Summary of aCGH, Q-PCR and BLAT results:
• DUF1220 domains are highly amplified in human, reduced in great apes, further reduced in Old & New World monkeys, single or low copy non-primate mammals and absent in non-mammals
• DUF1220 domains are highly amplified in human, reduced in great apes, further reduced in Old & New World monkeys, single or low copy non-primate mammals and absent in non-mammals
Other VertebratesOther VertebratesA total of 40 genomes were A total of 40 genomes were searched, but only the 22 with 4X searched, but only the 22 with 4X coverage or higher are displayed.coverage or higher are displayed.
DUF1220 Copy Number Statistics in hg19 build
DUF1220 Copies
Total in Human Genome 272
Total amplified HLS DUF1220 Triplets 129
Total DUF1220 in Last Common Ancestor of Homo/Pan 102
Total of Newly Added Copies in Human Lineage 167
Total Copies Added via Domain Amplification 146
Total Copies Added via Gene Duplication 21
Average Number Added to Human Lineage every million years 28
This table shows the unprecedented DUF1220 copy number increase in the human lineage. The primary mechanism for this expansion was domain amplification via hyper-amplification of the HLS DUF1220 triplet.
Sequences encoding DUF1220 domains
Sequences encoding DUF1220 domains
• Show a major copy number burst in primates
• Are increasingly amplified generally as a function of a species evolutionary proximity to humans, where the greatest number of copies (270) is found
• Show signs of positive selection• Are highly expressed in brain regions
associated with higher cognitive function• In brain show neuron-specific expression
preferentially in cell bodies and dendrites
• Show a major copy number burst in primates
• Are increasingly amplified generally as a function of a species evolutionary proximity to humans, where the greatest number of copies (270) is found
• Show signs of positive selection• Are highly expressed in brain regions
associated with higher cognitive function• In brain show neuron-specific expression
preferentially in cell bodies and dendritesPopesco, et al, Science 2006Popesco, et al, Science 2006
•Recurrent Reciprocal 1q21.1 Deletions and Duplications Associated with Microcephaly or Macrocephaly and Developmental and Behavioral Abnormalities
Brunetti-Pierri, et al, Nature Genetics 2008•Recurrent Rearrangements of Chromosome 1q21.1 and Variable Pediatric Phenotypes
Mefford, et al, N. Engl. J. Med. 2008
•Recurrent Reciprocal 1q21.1 Deletions and Duplications Associated with Microcephaly or Macrocephaly and Developmental and Behavioral Abnormalities
Brunetti-Pierri, et al, Nature Genetics 2008•Recurrent Rearrangements of Chromosome 1q21.1 and Variable Pediatric Phenotypes
Mefford, et al, N. Engl. J. Med. 2008
1q21.1 Deletions* Linked to Microcephaly
1q21.1 Duplications* Linked to Macrocephaly
1q21.1 Deletions* Linked to Microcephaly
1q21.1 Duplications* Linked to Macrocephaly
We note that these CNVs encompass or are immediately flanked by DUF1220 sequences (Dumas & Sikela, Cold Spring Harbor Symposium Quant. Biol., 2009)
We note that these CNVs encompass or are immediately flanked by DUF1220 sequences (Dumas & Sikela, Cold Spring Harbor Symposium Quant. Biol., 2009)
*Implies human brain size directly related to the dosage of one or more genes in these 1q21.1 CNVs*Implies human brain size directly related to the dosage of one or more genes in these 1q21.1 CNVs
DUF1220 model proposes that: 1) DUF1220 copy number is directly involved
in influencing human brain size, and2) the evolutionary advantage of rapidly
increasing DUF1220 copy number in the human lineage has resulted in favoring retention of the high genomic instability of the 1q21.1 region which, in turn, has precipitated a spectrum of recurrent human brain and developmental disorders
DUF1220 model proposes that: 1) DUF1220 copy number is directly involved
in influencing human brain size, and2) the evolutionary advantage of rapidly
increasing DUF1220 copy number in the human lineage has resulted in favoring retention of the high genomic instability of the 1q21.1 region which, in turn, has precipitated a spectrum of recurrent human brain and developmental disorders
*Dumas & Sikela, Cold Spring Harbor Symposium Quant. Biol., 2009*Dumas & Sikela, Cold Spring Harbor Symposium Quant. Biol., 2009
Concluding ThoughtsConcluding Thoughts• DUF1220 domains shows the largest HLS DUF1220 domains shows the largest HLS
protein coding copy number increase in the protein coding copy number increase in the genome genome – But no one gene made us humanBut no one gene made us human– DUF1220 genotyping challengesDUF1220 genotyping challenges
• We know more about our genome than everWe know more about our genome than ever– But there are vast areas of our genome But there are vast areas of our genome
about which we know virtually nothingabout which we know virtually nothing– No mammalian genome has been No mammalian genome has been
completely sequencedcompletely sequenced
AcknowledgementsAcknowledgements• Sikela Lab• Laura Dumas • Majesta O’Bleness• Maggie Popesco• Erik MacLaren• Andy Fortna • Jan Hopkins• Jonathon Keeney• Jack Davis• Jay Jackson• Megan Sikela• Michael Cox• Kriste Marshall• Matt Brenton• Sonya Burgers• Raquel Hink• Erin Dorning• Park McNair
• Sikela Lab• Laura Dumas • Majesta O’Bleness• Maggie Popesco• Erik MacLaren• Andy Fortna • Jan Hopkins• Jonathon Keeney• Jack Davis• Jay Jackson• Megan Sikela• Michael Cox• Kriste Marshall• Matt Brenton• Sonya Burgers• Raquel Hink• Erin Dorning• Park McNair
• Collaborators• Stanford
– Jon Pollack– Young Kim
• Univ. of Kansas - Gerald Wyckoff
• Univ of Utah– Lynn Jorde
• Baylor College– Pawel Stankiewicz– Sau Wai Cheng
• UCSOM– Epidemiology
• Tasha Fingerlin– Preventive Medicine &
Biometrics• Anis Karimpour-Fard
– Neuroscience Program• Rock Levinson• John Caldwell
• Collaborators• Stanford
– Jon Pollack– Young Kim
• Univ. of Kansas - Gerald Wyckoff
• Univ of Utah– Lynn Jorde
• Baylor College– Pawel Stankiewicz– Sau Wai Cheng
• UCSOM– Epidemiology
• Tasha Fingerlin– Preventive Medicine &
Biometrics• Anis Karimpour-Fard
– Neuroscience Program• Rock Levinson• John Caldwell
A Walk Through Our GenomeA Walk Through Our Genome
--All regions of the genome are not created --All regions of the genome are not created equalequal--All regions of the genome are not created --All regions of the genome are not created equalequal