Comparative Genomics Comparative genomics is the study of the relationship of genome structure and function across different biological species or strains. It is an attempt to take advantage of the information provided by the signatures of selection to understand the function and evolutionary processes that act on genomes. Though it is a very new field, it holds great promise to yield insights into many aspects of the evolution of modern species.It includes the study of analyzing and comparing genetic material from different species to study evolution, gene function, and inherited disease. It exploits both similarities and differences in the proteins, RNA, and regulatory regions of different organisms to determine how selection has acted upon these elements. Those elements that are responsible for similarities between different species should be conserved through time (stabilizing selection), while those elements responsible for differences among species should be divergent (positive selection). Finally, those elements that are unimportant to the evolutionary success of the organism will be unconserved (selection is neutral). It involves the use of computer programs that can line up multiple genomes and look for regions of similarity among them. The following are the comparative genome sizes of humans and other organisms being studied in the recent past: organism estimated size estimated average gene density chromosome gene number number Homo sapiens 2900 million bases ~30,000 1 gene per 100,000 bases 46 (human) Rattus norvegicus 2,750 million bases ~30,000 1 gene per 100,000 bases 42 (rat) Mus musculus 2500 million bases ~30,000 1 gene per 100,000 bases 40 (mouse) Drosophila melanogaster 180 million bases 13,600 1 gene per 9,000 bases 8 (fruit fly) Arabidopsis thaliana 125 million bases 25,500 1 gene per 4000 bases 5 (plant) Caenorhabditis elegans 97 million bases 19,100 1 gene per 5000 bases 6 (roundworm) Saccharomyces cerevisiae 12 million bases 6300 1 gene per 2000 bases 16 (yeast) Escherichia coli 4.7 million bases 3200 1 gene per 1400 bases 1 (bacteria) H. influenzae 1.8 million bases 1700 1 gene per 1000 bases 1 (bacteria)
12
Embed
Comparative Genomicsbioinformatics.iasri.res.in/BAMAST/Book.html/EbookNew/...Comparative Genomics Comparative genomics is the study of the relationship of genome structure and function
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Comparative Genomics
Comparative genomics is the study of the relationship of genome structure and function
across different biological species or strains. It is an attempt to take advantage of the
information provided by the signatures of selection to understand the function and
evolutionary processes that act on genomes. Though it is a very new field, it holds great
promise to yield insights into many aspects of the evolution of modern species.It includes
the study of analyzing and comparing genetic material from different species to study
evolution, gene function, and inherited disease. It exploits both similarities and
differences in the proteins, RNA, and regulatory regions of different organisms to
determine how selection has acted upon these elements. Those elements that are
responsible for similarities between different species should be conserved through time
(stabilizing selection), while those elements responsible for differences among species
should be divergent (positive selection). Finally, those elements that are unimportant to
the evolutionary success of the organism will be unconserved (selection is neutral). It
involves the use of computer programs that can line up multiple genomes and look for
regions of similarity among them.
The following are the comparative genome sizes of humans and other organisms being
studied in the recent past:
organism estimated size
estimated average gene density
chromosome
gene number number
Homo sapiens 2900 million bases ~30,000
1 gene per 100,000 bases 46 (human)
Rattus norvegicus 2,750 million bases ~30,000
1 gene per 100,000 bases 42 (rat)
Mus musculus 2500 million bases ~30,000
1 gene per 100,000 bases 40 (mouse)
Drosophila melanogaster
180 million bases 13,600 1 gene per 9,000 bases 8 (fruit fly)
Arabidopsis thaliana
125 million bases 25,500 1 gene per 4000 bases 5 (plant)
Caenorhabditis elegans
97 million bases 19,100 1 gene per 5000 bases 6 (roundworm)
Saccharomyces cerevisiae
12 million bases 6300 1 gene per 2000 bases 16 (yeast)
Escherichia coli
4.7 million bases 3200 1 gene per 1400 bases 1 (bacteria)
H. influenzae
1.8 million bases 1700 1 gene per 1000 bases 1 (bacteria)