Top Banner
Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin Chen, Mike Daly, Hoa Dinh, Erinn Hama, Robert Hinman, Julio Ng, Michael Sneddon, Hoa Troung, Jerry Wang, Che Fung Yung, Ron Shamir, Yael Mandel
33

Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Sep 21, 2018

Download

Documents

builien
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Introduction to Computational

Genomics

Based on Slides by: Angela Brooks, Raymond Brown, Calvin Chen, Mike Daly, Hoa Dinh, Erinn Hama, Robert Hinman, Julio Ng, Michael Sneddon, Hoa Troung, Jerry Wang, Che Fung Yung, Ron Shamir, Yael Mandel

Page 2: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

How Molecular Biology came about?

• Microscopic biology began in

1665

• Robert Hooke (1635-1703)

discovered organisms are

made up of cells

• Matthias Schleiden (1804-

1881) and Theodor Schwann

(1810-1882) further

expanded the study of cells

in 1830s

• Robert

Hooke

• Theodor

Schwann

• Matthias

Schleiden

Page 3: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

What is Life made of?

Page 4: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Life begins with Cell

• A cell is a smallest structural unit of an

organism that is capable of independent

functioning

• All cells have some common features

Page 5: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Cells

• Chemical composition-by weight

• 70% water

• 7% small molecules • salts

• Lipids

• amino acids

• nucleotides

• 23% macromolecules • Proteins

• Polysaccharides

• Complex Lipids

Page 6: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

All Cells have common Cycles

• Born, eat, replicate, and die

Page 7: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Cell Information and Machinery

• Cells store all information to replicate itself

• Human genome is around 3 billions base pair long

• Almost every cell in human body contains same set of genes

• But not all genes are used or expressed by those cells

• Machinery:

• Collect and manufacture components

• Carry out replication

• Kick-start its new offspring

(A cell is like a car factory)

Page 8: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Major events in the history of Molecular

Biology 1800 - 1870

• 1865 Gregor Mendel

discover the basic rules of

heredity of garden pea.

• An individual organism has

two alternative heredity units

for a given trait (dominant

trait v.s. recessive trait)

• 1869 Johann Friedrich

Miescher discovered DNA

and named it nuclein.

Mendel: The Father of Genetics

Johann Miescher

Page 9: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

DNA: The Code of Life

• The structure and the four genomic letters (nucleic acids = nucleotides) code for all living organisms

• Adenine, Guanine, Thymine, and Cytosine which pair (based on hydrogen bonds) A-T and C-G on complimentary strands.

• DNA = Deoxyribo Nucleic Acid

Page 10: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

DNA, continued • DNA has a double helix

structure which is composed of • sugar molecule

• phosphate group

• and a base (A,C,G,T)

• DNA always reads from 5’ end to 3’ end for transcription replication 5’ ATTTAGGCC 3’

3’ TAAATCCGG 5’

Page 11: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Overview of organizations of life

• Nucleus = library

• Chromosomes = bookshelves

• Genes = books

• Books represent all the information (DNA)

that every cell in the body needs so it can

grow and carry out its various functions.

• Almost every cell in an organism contains the

same libraries and the same sets of books.

Page 12: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Life begins with Cell

• A cell is a smallest structural unit of an

organism that is capable of independent

functioning

• All cells have some common features

Page 13: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

All Life depends on 3 critical molecules

• DNAs

• Hold information on how cell works

• RNAs

• Act to transfer short pieces of information to different parts

of cell

• Provide templates to synthesize into protein

• Proteins

• Form enzymes that send signals to other cells and regulate

gene activity

• Form body’s major components (e.g. hair, skin, etc.)

Page 14: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Proteins: Workhorses of the Cell

• 20 different amino acids • different chemical properties cause the protein chains to fold up

into specific three-dimensional structures that define their particular functions in the cell.

• Proteins do all essential work for the cell

• build cellular structures

• digest nutrients

• execute metabolic functions

• Mediate information flow within a cell and among cellular communities.

• Proteins work together with other proteins or nucleic acids as "molecular machines"

• structures that fit together and function in highly specific, lock-and-key ways.

Page 15: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

> DNA sequence

AATTCATGAAAATCGTATACTGGTCTGGTACCGG

CAACACTGAGAAAATGGCAGAGCTCATCGCTAAA

GGTATCATCGAATCTGGTAAAGACGTCAACACCA

TCAACGTGTCTGACGTTAACATCGATGAACTGCT

GAACGAAGATATCCTGATCCTGGGTTGCTCTGCC

ATGGGCGATGAAGTTCTCGAGGAAAGCGAATTTG

Gene Function

> Protein sequence

MKIVYWSGTGNTEKMAELIAKGIIES

GKDVNTINVSDVNIDELLNEDILILGC

SAMGDEVLEESEFEPFIEEISTKISG

KKVALFGSYGWGDGKWMRDFEER

MNGYGCVVVETPLIVQNEPDEAEQD

CIEFGKKIANI

The Central Dogma of Molecular Biology

RNA DNA PROTEIN

Page 16: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Cell Information: Instruction book of Life

• DNA, RNA, and Proteins are examples of strings written in either the four-letter nucleotide of DNA and RNA (A C G T/U)

• or the twenty-letter amino acid of proteins. Each amino acid is coded by 3 nucleotides called codon. (Leu, Arg, Met, etc.)

Page 17: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

DNA, RNA, and the Flow of

Information

Translation Transcription

Replication

Page 18: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin
Page 19: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin
Page 20: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

The Human Genome: numbers

• 23 pairs of chromosomes

• ~3,000,000,000 bases

• ~20,000 protein-coding genes

• up to 1,000,000 protein variants

Page 21: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Some Terminology

• Genome: an organism’s genetic material

• Gene: a discrete units of hereditary information located on the chromosomes and consisting of DNA.

• Nucleic acid: Biological molecules(RNA and DNA) that allow organisms to reproduce;

Page 22: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

More Terminology

• The genome is an organism’s complete set of DNA.

• a bacteria contains about 600,000 DNA base pairs

• human and mouse genomes have some 3 billion.

• human genome has 24 distinct chromosomes.

• Each chromosome contains many genes.

• Gene

• basic physical and functional units of heredity.

• specific sequences of DNA bases that encode instructions on how to make proteins.

• Proteins

• Make up the cellular structure

• large, complex molecules made up of smaller subunits called amino acids.

Page 23: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Major events in the history of Molecular

Biology 1880 - 1900

• 1881 Edward Zacharias showed chromosomes are

composed of nuclein.

• 1899 Richard Altmann renamed nuclein to nucleic acid.

• By 1900, chemical structures of all 20 amino acids had

• been identified

Page 24: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Major events in the history of Molecular

Biology 1900-1911 • 1902 - Emil Hermann Fischer wins Nobel

prize: showed amino acids are linked and form proteins • Postulated: protein properties are defined by

amino acid composition and arrangement, which we nowadays know as fact

• 1911 – Thomas Hunt Morgan discovers genes on chromosomes are the discrete units of heredity

• 1911 Pheobus Aaron Theodore Lerene discovers RNA

Emil

Fischer

Thomas

Morgan

Page 25: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Major events in the history of Molecular

Biology 1940 - 1950 • 1941 – George Beadle and

Edward Tatum identify that genes

make proteins

• 1950 – Edwin Chargaff find

Cytosine complements Guanine

and Adenine complements

Thymine

George

Beadle Edward

Tatum

Edwin

Chargaff

Page 26: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Major events in the history of Molecular

Biology 1952 - 1960

• 1952-1953 James D.

Watson and Francis H. C.

Crick deduced the double

helical structure of DNA

• 1956 George Emil Palade

showed the site of enzymes

manufacturing in the

cytoplasm is made on RNA

organelles called ribosomes.

James Watson

and Francis Crick

George Emil Palade

Page 27: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Major events in the history of Molecular Biology

1986 - 1995 • 1986 Leroy Hood: Developed

automated sequencing mechanism

• 1986 Human Genome Initiative announced

• 1990 The 15 year Human Genome project is launched by congress

• 1995 Moderate-resolution maps of chromosomes 3, 11, 12, and 22 maps published (These maps provide the locations of “markers” on each chromosome to make locating genes easier)

Leroy Hood

Page 28: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

30

From DNA to Genome

Watson and

Crick DNA

model 1955

1960

1965

1970

1975

1980

1985

Page 29: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

31

1995

1990

2000 First human

genome draft

First genome Hemophilus Influenzae

Yeast genome

Fly genome

Page 30: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

32

Total 1379 294

Eukaryotes 133 39

Bacteria 1152 235

Archaea 94 23

Complete Genomes

2010 2005

Page 31: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

1,000 Genomes Project: Expanding the

Map of Human Genetics

Researchers hope the effort will speed up

the discovery of many diseases's genetic

roots

33

Page 32: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

Synthetic Biology: In 2001, Craig Venter made headlines for sequencing the

human genome.

In 2003, he started mapping the ocean's biodiversity.

In 2010 Craig Venter Boots up First Synthetic Cell: Craig Venter and team created the first fully functioning,

reproducing cell controlled by synthetic DNA, thus

creating the first synthetic lifeforms.

http://www.ted.com/talks/craig_venter_unveils_syntheti

c_life.html

Page 33: Introduction to Computational Genomics - BGUbccg131/wiki.files/Molecular_Biology... · Introduction to Computational Genomics Based on Slides by: Angela Brooks, Raymond Brown, Calvin

35

Main Goal:

To understand the living cell

Annotation Comparative

genomics Functional

genomics

25000 genomes… What’s Next ?

The “post-genomics” era

Systems

Biology