Top Banner
RNA Secondary RNA Secondary Structure Structure Prediction Prediction
16

RNA Secondary Structure Prediction

Feb 09, 2016

Download

Documents

Callum

RNA Secondary Structure Prediction. 16s rRNA. RNA Secondary Structure. Pseudoknot. Dangling end. Single- Stranded. Interior Loop. Bulge. Junction (Multiloop). Stem. Hairpin loop. Image– Wuchty. RNA secondary structure. G A A A G G A-U U-G C-G - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: RNA Secondary Structure Prediction

RNA Secondary RNA Secondary Structure PredictionStructure Prediction

Page 2: RNA Secondary Structure Prediction
Page 3: RNA Secondary Structure Prediction

16s rRNA

Page 4: RNA Secondary Structure Prediction

RNA Secondary Structure

Hairpin loop

Junction (Multiloop)Bulge

Single-Stranded Interior Loop

Stem

Image– Wuchty

Pseudoknot

Dangling end

Page 5: RNA Secondary Structure Prediction

RNA secondary structure

G A

A A G G

A-U U-G C-G A-U G-C

Loop

Stem

wobble pair

canonical pair

Page 6: RNA Secondary Structure Prediction

Legitimate structurePseudoknots

RNA secondary structure representation

Page 7: RNA Secondary Structure Prediction

Non-canonical interactions of RNA Non-canonical interactions of RNA secondary-structure elementssecondary-structure elements

Pseudoknot

Kissing hairpins

Hairpin-bulge contact

These patterns are excluded from the prediction schemes as their computation is too intensive.

Page 8: RNA Secondary Structure Prediction

“Rules for 2D RNA prediction”• Base Pairs in stems: GOOD• Additional possible assumptions: e.g.,

G:C better than A:T• Bulges, Loops: BAD• Canonical Interactions (base pairs, stems,

bulges, loops): OK• Non canonical interactions (pseudoknots,

kissing hairpins): Forbidden• The more interactions: The better

Page 9: RNA Secondary Structure Prediction

Predicting RNA secondary Structure

• Allowed base pairing rules (Watson-Crick A:U, G:C, and Wobble pair G:U)

• Sequences may form different structures• An free energy value is associated with each

possible structure• Predict the structure with the minimal free

energy (MFE)

Page 10: RNA Secondary Structure Prediction

Simplifying Assumptions for Structure Prediction

• RNA folds into one minimum free-energy structure.

• There are no non-canonical interactions.• The energy of a particular base pair in a double

stranded regions is sequence independent– Neighbors have no influence.

Was solved by dynamic programmingZucker and Steigler 1981

Page 11: RNA Secondary Structure Prediction

Sequence-dependent free-energy (the nearest neighbor model)

U U

C G G C A UG CA UCGAC 3’

U U

C G U A A UG CA UCGAC 3’

Example values:GC GC GC GCAU GC CG UA -2.3 -2.9 -3.4 -2.1

Page 12: RNA Secondary Structure Prediction

Free energy computationFree energy computation

U UA A G C G C A G C U A A U C G A U A 3’A5’

-0.3

-0.3

-1.1 mismatch of hairpin-2.9 stacking

+3.3 (1 nt bulge) -2.9 stacking

-1.8 stacking

5’ dangling

-0.9 stacking-1.8 stacking

-2.1 stacking

G= -4.6 KCAL/MOL

+5.9 (4 nt loop)

Page 13: RNA Secondary Structure Prediction

Prediction Programs

• Mfoldhttp://www.bioinfo.rpi.edu/applications/mfold/old/rna/form1.cgi

• Vienna RNA Secondary Structure Predictionhttp://rna.tbi.univie.ac.at/cgi-bin/RNAfold.cgi

Page 14: RNA Secondary Structure Prediction

Mfold - Suboptimal FoldingMfold - Suboptimal Folding• For any sequence of N nucleotides, the expected

number of structures is greater than 1.8N

• A sequence of 100 nucleotides has ~31025 possible folds. If a computer can calculate 1000 folds/second, it would take 1015 years (age of universe = ~1010 years)!

• Mfold generates suboptimal folds whose free energy fall within a certain range of values. Many of these structures are different in trivial ways. These suboptimal folds can still be useful for designing experiments.

Page 15: RNA Secondary Structure Prediction

Example:

Page 16: RNA Secondary Structure Prediction

Output: