Molecular Genetics Part II

Molecular Genetics

Part IIFrom Gene to Protein

History

• Archibald Garrod – 1909• First to suggest that genes dictate

phenotype through production of proteins

• Believed that genetic diseases resulted from the inability to make particular enzymes

• “Inborn errors of metabolism”

One Gene – One Enzyme

• Beadle & Ephrussi – 1930’s• Studied mutations affecting eye

color in Drosophila • Concluded that each mutation

blocks pigment synthesis at a specific step by preventing production of the enzyme that catalyzes that step

• Specific pathways were not known, so results were inconclusive

Beadle & Tatum• Treated Neurospora (a mold) with X-rays• Looked for mutations in nutritional

requirements– Wild type Neurospora grows on minimal medium

(agar enriched with a few nutrients)– All mutants will grow on complete medium (agar plus

all 20 amino acids & other nutrients)

• Identified the specific amino acid required for growth by each mutant– That identified the defective synthetic pathway– Looked at each intermediate step in the blocked

synthetic pathway

• Concluded that mutation in a single gene blocked production of a single enzyme

One Gene – One Polypeptide

• Not all proteins are enzymes• Can extend one gene = one

enzyme doctrine to one gene = one polypeptide

• Many proteins are comprised of two or more polypeptides

Central Dogma• How does the sequence of a strand of DNA

correspond to the amino acid sequence of a protein?

• The central dogma of molecular biology, states that:

Transcription & Translation

• DNA is first copied (transcribed) to an RNA intermediate

• The RNA intermediate is then translated to protein

• Why have an intermediate between DNA and the proteins it encodes?

Why RNA?• The DNA remains protected in the

nucleus, away from caustic enzymes in the cytoplasm.

• Gene information can be amplified – Many copies of an RNA can be made from

one copy of DNA. • Greater regulation of gene expression

– Specific controls can act at each step in the pathway between DNA and proteins.

– The more elements there are in the pathway, the more opportunities there are for control

What is RNA?• RNA has the same primary structure as DNA

– consists of a sugar-phosphate backbone, with nucleotides attached to the 1' C of the sugar.

• Differences between DNA and RNA : – Contains the sugar ribose instead of

deoxyribose– The nucleotide, uracil, is substituted for

thymine – RNA exists as a single-stranded molecule.

• Because of the extra hydroxyl group on the sugar, RNA is too bulky to form a a stable double helix.

• Regions of double helix can form where there is some base pair complementation resulting in hairpin loops.

Types of RNA• mRNA - messenger RNA

– A copy of a gene. – Has a sequence complementary to one strand

of the DNA & identical to the other strand. – Carries the information stored in DNA in the

nucleus to the ribosomes in the cytoplasm where protein is made.

• tRNA - transfer RNA – A small RNA with a very specific structure

that can bind an amino acid at one end, and mRNA at the other end.

– Acts as an ‘adaptor’ to carry & attach amino acids to the appropriate place on the mRNA.

Types of RNA (Cont.)• rRNA - ribosomal RNA

– One of the structural components of the ribosome.

– Has a sequence complimentary to regions of the mRNA

– Allows ribosome to bind to an mRNA

• snRNA - small nuclear RNA – Is involved in the machinery that

processes RNA's as they travel between the nucleus and the cytoplasm.

The Genetic Code • How does mRNA specify an amino acid

sequence? • It would be impossible for each amino

acid to be specified by one nucleotide – there are only 4 nucleotides and 20 amino

acids. – two nucleotide combinations could only

specify 16 amino acids.

• Each amino acid is specified by a combination of three nucleotides, called a codon

The Code is Redundant, Not Ambiguous

• Each amino acid may be specified by up to six codons– In many cases, codons that are synonyms differ

only in the third base of the triplet

• Different organisms have different frequencies of codon usage. – A giraffe might use CGC for arginine much more

often than CGA, and the reverse might be true for a sperm whale.

• Some codons specify “stop” (or “start)• There is no ambiguity

– the same codon ALWAYS codes for the same amino acid

Codons & Anticodons• How do tRNAs recognize to which

codon to bring an amino acid? • The tRNA has an anticodon on its

mRNA-binding end • The anticodon is complementary to

the codon on the mRNA. • Each tRNA only binds the

appropriate amino acid for its anticodon

t-RNA Structure

Transcription

• How does the sequence information from DNA get transferred to mRNA?

• How is this information carried to the ribosomes in the cytoplasm?

• This process is called transcription• Highly similar to DNA replication. • Different enzymes are used in

transcription. • The most important is RNA

polymerase

RNA Polymerase• RNA polymerase is a holoenzyme

– an agglomeration of many different factors

• Together, direct the synthesis of mRNA • Pries the DNA strands apart • Strings complimentary RNA nucleotides on

the DNA template• Like DNA polymerase, can only add to the

3’ end• So only one mRNA is made, elongating 5’

3’

Stages of Transcription

• Initiation• Elongation• Termination

Initiation• RNA polymerase must recognize the beginning of

a gene to know where to start synthesizing mRNA.

• One part of the enzyme has a high affinity for a particular DNA sequence that appears at the beginning of genes.

• The sequence where RNA polymerase attaches to the DNA and begins transcription = the promoter– a unidirectional sequence on one strand of the DNA

• Tells RNA polymerase both where to start and in which direction (that is, on which strand) to continue synthesis.

The Promoter• In prokaryotes, RNA polymerase

recognizes and binds the promoter• The bacterial promoter almost always

contains some version of the following elements:

Eukaryotic Promoters• In eukaryotes special proteins, transcription

factors, mediate binding RNA polymerase and the promoter

• RNA polymerase binds to the promoter only after transcription factors bind

• Transcription factors + RNA polymerase, bound to the promoter = transcription initiation complex

• Eukaryotic promoters usually include a TATA box– A nucleotide sequence containing TATA

about 25 nucleotides prior to the start point

Elongation• The RNA polymerase stretches open

the double helix at the start point in the DNA and begins synthesis of a complementary RNA strand on one of the DNA strands

• The RNA polymerase recruits RNA nucleotides in the same way that DNA polymerase recruits dNTPs.

• Since synthesis only proceeds in the 5' to 3' direction, there is no need for Okazaki fragments.

Sense & Antisense

• Synthesis only occurs in the 5’ to 3’ direction

• In transcription, only one DNA strand is copied

• We call the strand that is copied the antisense or template strand

• The other strand, which is identical to the mRNA made (substituting U for T), is the sense or coding strand.

Termination of Transcription • How does RNA polymerase know when to stop

transcribing a gene?• Sequence that signals the end of transcription =

terminator• RNA polymerase transcribes the terminator

– The transcribed terminator actually ends the process

• In prokaryotes there is no nucleus, so ribosomes can begin making protein from an mRNA immediately

• The terminator sequence of the mRNA allows it to form a hairpin loop, which blocks the ribosome. – The ribosome falls off the mRNA,– That signals termination by the RNA polymerase. – RNA polymerase falls off the DNA and transcription

ceases.

Eukaryotic Termination

• RNA polymerase continues for hundreds of nucleotides beyond the termination signal

• AAUAAA

• At a point 10 to 35 nucleotides past the AAUAAA, the forming m-RNA is cut free

• The cleavage site is the point of addition of a poly-A tail

Post Transcription Modification

• In eukaryotes, enzymes modify pre-mRNA before it is sent to the cytoplasm

• Both ends of the transcript are altered• The 5’ end is capped with modified guanine

– Protects mRNA from degradation– Helps attach the ribosome

• At the 3’ end an enzyme makes a poly-A tail formed from 50 to 250 adenine nucleotides– Inhibits degradation and helps ribosome attach– May also help export mRNA out of the nucleus

• Interior sections are cut out, and the remaining parts are spliced together

RNA Processing

Introns & Exons• Most eukaryotic genes and their RNA

transcripts have long noncoding stretches of nucleotides = introns

• Noncoding sequences are interspersed between coding sections

• Coding sections = exons• That is, the sequence of eukaryotic DNA

that codes for a polypeptide is not continuous

• RNA polymerase transcribes both introns and exons

RNA Splicing• Introns are cut out and exons are

spliced together before mRNA exits the nucleus

• Short nucleotide sequences at the end of introns are signals for RNA splicing

• Small nuclear ribonucleoproteins (snRNPs) recognize splice sites– Composed of snRNA & protein

• Several snRNPs and additional proteins form a complex = spliceosome

• At splice sites at the end of an intron, cuts out the intron and fuses the exons

The Spliceosome

Why Introns?• Introns may play regulatory role in the

cell• Split genes allow a single gene to code

more than one kind of polypeptide• Outcome depends on which sections are

treated as exons during RNA processing– Alternative RNA splicing

• May facilitate evolution of new proteins• Increase possibility of potentially

beneficial crossing-over of genes

Translation• How do messenger RNAs direct

protein synthesis?• The message encoded in the mRNA is

an amino acid sequence• mRNA travels to ribosome in the

cytoplasm, where the message is read• The specified amino acids are

assembled on the mRNA template on the ribosome

• Enzymes help form the sequenced amino acids into a polypeptide

The Ribosome • The cellular factory where proteins are

synthesized• Consists of structural RNA and ~ 80 different

proteins. • In its inactive state, it exists as two subunits

– a large subunit and a small subunit. • When the small subunit encounters an mRNA, it

begins translation of the mRNA to protein. • There are three sites in the large subunit

– The A site accepts a new tRNA bearing an amino acid

– the P site bears the tRNA attached to the growing chain.

– The E site contains the exiting tRNA

http://web.mit.edu/esgbio/www/cb/org/organelles.html#nucleus

Charging the tRNA • tRNA (transfer RNA) acts as a translator

between mRNA and protein • Each tRNA has a specific anticodon and an

amino acid acceptor site. • Each tRNA also has a specific charger protein;

– This protein can only bind to that particular tRNA and attach the correct amino acid to the acceptor site.

– These charger proteins are called aminoacyl tRNA synthetases

• The energy to make this bond comes from ATP.

Aminoacyl-tRNA Synthases• Each tRNA must match with the correct amino acid

– Each tRNA must attach only the amino acid specified by the mRNA codon to which the tRNA anticodon binds

• The amino acid is joined to the tRNA by an aminoacyl-tRNA synthase– There are 20 of these enzymes; one for each amino acid

• Catalyzes the covalent bond between the amino acid and tRNA

• The active site of each aminoacyl-tRNA synthase fits only a specific amino acid and tRNA

• Once the amino acid is bound, the tRNA is aminoacyl tRNA

Wobble• If there was one tRNA for each mRNA

codon, there would be 61 different tRNAs

• Actually, there are fewer• Some tRNAs have anticodons that

recognize 2 or more different codons• Base pairing rules between the third

base of a codon and its tRNA anticodon are not a rigid as DNA to mRNA pairing– Example: U in tRNA can pair with either A

or T in the third position of an mRNA codon• This flexibility is called wobble

Initiation of Translation• The start signal for translation is the

codon ATG– Codes for methionine. – Not every protein starts with methionine,– Often this first amino acid will be removed

in post-translational processing. • A tRNA charged with methionine binds

to the translation start signal. • The large subunit binds to the mRNA

and the small subunit• Elongation begins.

Elongation of the New Protein

• After the first charged tRNA appears in the A site, the ribosome shifts so that the tRNA is in the P site.

• New charged tRNAs, corresponding the codons of the mRNA, enter the A site, and a peptide bond is formed between the two amino acids.

• The first tRNA is now released • The ribosome shifts again so that a tRNA

carrying two amino acids is now in the P site• A new charged tRNA can bind to the A site. • This process of elongation continues until the

ribosome reaches a stop codon.

Termination of the Protein

• When the ribosome reaches a stop codon, no aminoacyl tRNA binds to the empty A site.

• This is the ribosome’s signal to break into its large and small subunits,

• Releasing the new protein and the mRNA.

Polyribosomes

• A single mRNA can be used to make many copies of a polypeptide at the same time

• Multiple ribosomes can read the same mRNA strand, like beads on a string

• These strings are called polyribosomes

Polyribosomes

Post-Translational Processing

• This isn't always the end of the story for the new protein.

• Often it will undergo post-translational modifications.

• Modifications include: • Cleavage by a proteolytic (protein-cutting)

enzyme at a specific place. • Having some amino acids altered.

– For example, a tyrosine residue might be phosphorylated.

• Become glycosylated. – Many proteins have carbohydrates covalently attached

to asparagine residues.

Mutations

• What kinds of errors can occur in DNA?

• What causes them? • What are their effects? • Types of mutations:

– Chromosomal mutations – Point mutations– Frameshift mutations

Chromosomal Mutations • Mutations that occur at a macroscopic

level. • Large sections of chromosomes can be

altered or shifted, leading to changes in the way genes are expressed.

• Types of chromosomal mutations:– Translocations– Inversions– Deletions– Nondisjunction

Translocations & Inversions

• Translocation– The interchange of large segments of DNA

between two chromosomes. – Can change gene expression if a gene is at

the translocation breakpoint or if it is reattached so that it is incorrectly regulated

• Inversion– Occurs when a region of DNA flips its

orientation with respect to the rest of the chromosome.

– Rotates, end for end– This can lead to the same problems as

translocations.

Deletions & Nondisjunction

• Deletion– Sometimes large regions of a chromosome are

deleted. – This can lead to a loss of important genes.

• Nondisjunction– Sometimes chromosomes do not divide correctly in

cell division– When large regions of a chromosome are altered

(such as translocation), the chromosome may not segregate properly during cell division

– One daughter cell will end up with extra genetic material, one will end up with less than its share

– This is called nondisjunction. – When there are extra or too few copies of a gene,

the cell will have problems

Point Mutations• Point mutations are single base pair changes. • Three possible outcomes:• Nonsense mutation

– Creates a stop codon where none previously existed. – This shortens the resulting protein, possibly

removing essential regions.

• Missense mutation – Changes the code of the mRNA. – Which changes the resulting amino acid– This may alter the shape and properties of the

protein.

• Silent mutation – Has no effect on protein sequence. – Because the genetic code is redundant, some

changes have no effect

Frameshift Mutations

• Insertions or deletions have a disastrous effect

• mRNA is “read” as a series of three letter words

• Insertions or deletions that are not multiples of three, shift the reading frame

Frameshift Example• Given the coding sequence:

AGA UCG ACG UUA AGC• corresponding to the protein:

arginine - serine - threonine - leucine - serine• The insertion of a C-G base pair between bases

6 and 7 would result in the following new code: AGA UCG CAC GUU AAG C

• which would result in a non-functional protein: arginine - serine - histidine - valine - lysine

• Every amino acid after the insertion will be wrong.

• The frame shift might even generate a stop codon which would prematurely end the protein.

DNA Repair• If replication of DNA proceeded as was

described previously, DNA polymerase would make a mistake on average about every 1000 base pairs.

• This level would be unacceptable, because too many genes would be rendered non-functional.

• Organisms have elaborate DNA proofreading and repair mechanisms, which can recognize false base-pairing and DNA damage, and repair it.

• The actual error rate is more in the region of one in a million to one in a billion.

The Beauty of Mutations• Why mutations?• Our environment constantly changes, the

Earth and its ecosystems change. • Populations must change to survive• Evolutionary change requires variation, the

raw material on which natural selection works• One mechanism for variation and change is at

the DNA level. • Mutations can be beneficial and enable the

organism to adapt to a changing environment. • However, most mutations are deleterious, and

cause varied genetic diseases

Molecular Genetics Part II

Education

rna intermediate

rna polymerase rna polymerase

small rna

initiation rna

thymine rna

amino acid sequence

specific amino acid

dna polymerase