Top Banner
Predicting Protein Predicting Protein Structure: Structure: Comparative Modeling Comparative Modeling (homology modeling) (homology modeling)
33

Predicting Protein Structure: Comparative Modeling (homology modeling)

Dec 13, 2015

Download

Documents

Hubert Ross
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Predicting Protein Structure: Comparative Modeling (homology modeling)

Predicting Protein Structure:Predicting Protein Structure:Comparative ModelingComparative Modeling(homology modeling)(homology modeling)

Page 2: Predicting Protein Structure: Comparative Modeling (homology modeling)

??

KQFTKCELSQNLYDIDGYGRIALPELICTMFHTSGYDTQAIVENDESTEYGLFQISNALWCKSSQSPQSRNICDITCDKFLDDDITDDIMCAKKILDIKGIDYWIAHKALCTEKLEQWLCEKE

Predicting Protein Structure:Predicting Protein Structure:Comparative ModelingComparative Modeling

(formerly, homology modeling)(formerly, homology modeling)

Use as template & model

8lyz1alc

KVFGRCELAAAMKRHGLDNYRGYSLGNWVCAAKFESNFNTQATNRNTDGSTDYGILQINSRWWCNDGRTPGSRNLCNIPCSALLSSDITASVNCAKKIVSDGNGMNAWVAWRNRCKGTDVQAWIRGCRLShare

Similar Sequence

Homologous

Page 3: Predicting Protein Structure: Comparative Modeling (homology modeling)

• In an ideal world, we would be able to accurately predict protein structure from the sequence only!

• Because of the myriad possible configurations of a protein chain – This goal can’t reliably be achieved, yet.

• Knowledge based prediction vs. Simulation based on physical forces.

• Here we will only concern ourselves with knowledge-based methods, although we might use simulation in order to optimize our models.

Structure prediction

Page 4: Predicting Protein Structure: Comparative Modeling (homology modeling)

MNIFEMLRID EGLRLKIYKD TEGYYTIGIG HLLTKSPSLN AAKSELDKAI GRNCNGVITKDEAEKLFNQD VDAAVRGILR NAKLKPVYDS LDAVRRCALI NMVFQMGETG VAGFTNSLRMLQQKRWDEAA VNLAKSRWYN QTPNRAKRVI TTFRTGTWDA YKNL

Can we predict protein

structures ?

• ab initio folding simulation: not yet ...• Rosetta approach: neither ...• Fold recognition (threading):

Often works, but ...• ???

Page 5: Predicting Protein Structure: Comparative Modeling (homology modeling)

obtain sequence (target)

fold assignment

comparativemodeling

ab initiomodeling

build, assess model

Approaches to predicting protein structures

Page 6: Predicting Protein Structure: Comparative Modeling (homology modeling)

Homology Modelling of Proteins

• Definition: Prediction of three dimensional structure of a target protein from the

amino acid sequence (primary structure) of a homologous (template) protein for which an X-ray or NMR structure is available.

• Why a Model:A Model is desirable when either X-ray crystallography or NMR spectroscopy cannot determine the structure of a protein in time or at all. The built model provides a wealth of information of how the protein functions with information at residue property level. This information can than be used for mutational studies or for drug design.

Page 7: Predicting Protein Structure: Comparative Modeling (homology modeling)

Homology modeling

= Comparative protein modeling = Knowledge-based modeling

Idea: Extrapolation of the structure for a new (target) sequence from the known 3D-structures of related family members (templates).

Page 8: Predicting Protein Structure: Comparative Modeling (homology modeling)

Homology models have RMSDs less than 2Å more than 70% of the time.

Homology models can be very smart!

Page 9: Predicting Protein Structure: Comparative Modeling (homology modeling)

.

0

20

40

60

80

100

0 50 100 150 200 250

identity

Number of residues aligned

Perc

enta

ge s

equence

identi

ty/s

imila

rity

(B.Rost, Columbia, NewYork)

Sequence identity implies structural similarity

Don’t know region .....

Sequence similarity implies structural similarity?

Page 10: Predicting Protein Structure: Comparative Modeling (homology modeling)

Step 1 in Homology Modeling - Fold Identification

Aim: To find a template or templates structures from protein data base

Improved Multiple sequence alignment methods improves sensitivity - remote homologs

PSIBLAST, CLUSTAL

pairwise sequence alignment - finds high homology sequences BLAST

http://www.ncbi.nlm.nih.gov/BLAST/

Page 11: Predicting Protein Structure: Comparative Modeling (homology modeling)

Comparative ModelingKnown Structures

(Templates)

Target Sequence Template Selection

Alignment Template - Target

Structure modeling

Structure Evaluation &Assessment

HomologyModel(s)

• Protein Data Bank PDB http://www.pdb.org

Database of templates

• Separate into single chains• Remove bad structures

(models)• Create BLAST database

Page 12: Predicting Protein Structure: Comparative Modeling (homology modeling)

Model Building from template

Multiple templates

Protein Fold

Core conserved regions

Variable Loop regions

Side chains

Calculate the framework from average of all template structures

Generate one model for each template and evaluate

Page 13: Predicting Protein Structure: Comparative Modeling (homology modeling)
Page 14: Predicting Protein Structure: Comparative Modeling (homology modeling)

I. Manual Modeling

[ http://www.expasy.org/spdbv/ ]

Page 15: Predicting Protein Structure: Comparative Modeling (homology modeling)

• averaging core template backbone atoms

(weighted by local sequence similarity with the target sequence)

• Leave non-conserved regions (loops) for later ….

a) Build conserved core framework

II. Template based fragment assembly

Page 16: Predicting Protein Structure: Comparative Modeling (homology modeling)

Dressing up the Core ModelCore Model-Rigid Body Assembly

Add Side chains

Add loops

End Game in protein folding - Molecular dynamics of all atoms in

explicit solvent

Page 17: Predicting Protein Structure: Comparative Modeling (homology modeling)

• use the “spare part” algorithm to find

compatible fragments in a Loop-Database

• “ab-initio” rebuilding of loops (Monte Carlo,

molecular dynamics, genetic algorithms, etc.)

b) Loop modeling

II. Template based fragment assembly

Page 18: Predicting Protein Structure: Comparative Modeling (homology modeling)

Loop BuildersMini protein folding problem-

3 to 10 residues longer in membrane proteins

Ab Initio methods - generates various

random conformations of loops and score

Compare the loop sequence string to

DB and get hits and evaluate.

Some Homology modeling methods

have less number of loops to be added

because of extensive multiple

sequence alignment of profiles

Loops result from

substitutions, insertions and deletions in

the same family

Page 19: Predicting Protein Structure: Comparative Modeling (homology modeling)

Using database of loops which appear in known structures. The loops could be catagorised by their length or sequence

Ab initio methods - without any prior knowledge. This is done by empirical scoring functions that check large number of conformations and evaluates each of them.

Construction of loops might be done by:

Page 20: Predicting Protein Structure: Comparative Modeling (homology modeling)
Page 21: Predicting Protein Structure: Comparative Modeling (homology modeling)
Page 22: Predicting Protein Structure: Comparative Modeling (homology modeling)

c) Side Chain placement

Find the most probable side chain

conformation, using

• homologues structures • back-bone dependent rotamer libraries

• energetic and packing criteria

II. Template based fragment assembly

Page 23: Predicting Protein Structure: Comparative Modeling (homology modeling)

• modeling will produce unfavorable contacts and bonds

idealization of local bond and angle geometry

• extensive energy minimization will move coordinates away

keep it to a minimum

• SwissModel is using GROMOS 96 force field for a steepest descent

d) Energy minimization

II. Template based fragment assembly

Page 24: Predicting Protein Structure: Comparative Modeling (homology modeling)

d) Energy minimization

II. Template based fragment assembly

Page 25: Predicting Protein Structure: Comparative Modeling (homology modeling)

Homology Modeling Programs

Modeller (http://guitar.rockefeller.edu/modeller)

Swiss-Model (http://www.expasy.ch/swissmod)

Whatif (http://www.cmbi.kun.nl/whatif)

Page 26: Predicting Protein Structure: Comparative Modeling (homology modeling)

Swiss-Model• Method:

Knowledge-based approach.

• Requirements:At least one known 3D-structure of a related protein. Good quality sequence alignements.

• Procedures:Superposition of related 3D-structures. Generation of a multiple a alignement.Generation of a framework for the new sequence. Rebuild lacking loops. Complete and correct backbone. Correct and rebuild side chains. Verify model structure quality and check packing. Refine structure by energy minimisation and molecular dynamics.

Page 27: Predicting Protein Structure: Comparative Modeling (homology modeling)

Model Confidence Factors

The Model B-factors are determined as follows:

• The number of template structures used for model building.

• The deviation of the model from the template structures.

• The Distance trap value used for framework building.

The Model B-factor is computed as:

85.0 * (1/ # selected template str.) * (Distance trap / 2.5)

and

99.9 for all atoms added during loop and side-chain building

Page 28: Predicting Protein Structure: Comparative Modeling (homology modeling)

Verifying the Model

• PROCHECK

• WHAT IF

• PROSA II

• VERIFY 3D, Profile3D

Page 29: Predicting Protein Structure: Comparative Modeling (homology modeling)

Errors in Models !!!

• Incorrect template selection

• Incorrect alignments

• Errors in positioning of sidechains and loops

Page 30: Predicting Protein Structure: Comparative Modeling (homology modeling)

General Structure Prediction Scheme

Any given protein sequence

Structure selection

Check sequence identity with proteins with known structure

HomologyModeling

> 35%

Fold Recognition

ab initioFolding

< 35%< 35%

Structure refinement

Final Structure

Structure selection

Page 31: Predicting Protein Structure: Comparative Modeling (homology modeling)

Baker and Sali (2000)

Page 32: Predicting Protein Structure: Comparative Modeling (homology modeling)

EVA

Evaluation of Automatic protein structure prediction

[ Burkhard Rost, Andrej Sali, http://maple.bioc.columbia.edu/eva/ ]

CASPCommunity Wide Experiment on the Critical Assessment of Techniques for Protein Structure Predictionhttp://PredictionCenter.llnl.gov/casp5/

3D - Crunch

Very Large Scale Protein Modelling Project

http://www.expasy.org/swissmod/SM_LikelyPrecision.html

Model Accuracy Evaluation

Page 33: Predicting Protein Structure: Comparative Modeling (homology modeling)

Several web pages for homology modeling

COMPOSER – felix.bioccam.ac.uksoft-base.html

MODELLER – guitar.rockefeller.edu/modeller/modeller.html

WHAT IF – www.sander.embl-heidelberg.de/whatif/

SWISS-MODEL – www.expasy.ch/SWISS-MODEL.html