Top Banner
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure
34

Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Dec 25, 2015

Download

Documents

Nathan Jackson
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Protein Structure Prediction

Dr. G.P.S. Raghava

ProteinSequence +

Structure

Page 2: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Protein Structure Prediction• Experimental Techniques

– X-ray Crystallography

– NMR

• Limitations of Current Experimental Techniques– Protein DataBank (PDB) -> 23000 protein structures

– SwissProt -> 100,000 proteins

– Non-Redudant (NR) -> 10,00,000 proteins

• Importance of Structure Prediction– Fill gap between known sequence and structures

– Protein Engg. To alter function of a protein

– Rational Drug Design

Page 3: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Different Levels of Protein Structure

Page 4: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Page 5: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Protein Architecture

• Proteins consist of amino acids linked by peptide bonds

• Each amino acid consists of:– a central carbon atom– an amino group– a carboxyl group and– a side chain

• Differences in side chains distinguish the various amino acids

Page 6: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Amino Acid Side Chains

Vary in:

• Size

• Shape

• Polarity

Page 7: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Peptide Bond

Page 8: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Peptide Bonds

Page 9: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Dihedral Angles

Page 10: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Conformation Flexibility• Backbone (main

chain of atoms in peptide bonds, minus side chains)

conformation:– Torsion or rotation

angles around:• C-N bond ()• C-C bond ()

– Sterical hinderance:• Most – Pro• Least - Gly

Page 11: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Ramachandran Plot

Page 12: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Protein Secondary Structure

Secondary Structure

Regular Secondary Structure(-helices, -sheets)

Irregular SecondaryStructure(Tight turns, Random coils, bulges)

Page 13: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Secondary Structure:Helices

H-bond

IndividualAmino acid

ALPHA HELIX : a result of H-bonding between every fourth peptide bond (via amino and carbonyl groups) along the length of the polypeptide chain

Page 14: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Page 15: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Helix formation is localresidues

iandi+3

THYROID hormone receptor (2nll)

Page 16: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Secondary Structure:Beta Sheets

BETA PLEATED SHEET: a result of H-bonding between polypeptide chains

Page 17: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

-sheet formation is NOT local

Erabutoxin (3ebx)

Page 18: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Definition of -turn

A -turn is defined by four consecutive residues i, i+1, i+2 and i+3 that do not form a helix and have a C(i)-C(i+3) distance less than 7Å and the turn lead to reversal in the protein chain. (Richardson, 1981).

The conformation of -turn is defined in terms of and of two central residues, i+1 and i+2 and can be classified into different types on the basis of and .

i

i+1 i+2

i+3H-bond

D <7Å

Page 19: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Page 20: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Tight turns

Type No. of residues H-bonding

-turn 2 NH(i)-CO(i+1)

-turn 3 CO(i)-NH(i+2)

-turn 4 CO(i)-NH(i+3)

-turn 5 CO(i)-NH(i+4)

-turn 6 CO(i)-NH(i+5)

Page 21: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Secondary Structureshortcuts

Page 22: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Tertiary Structure: Hexokinase (6000 atoms, 48 kD, 457 amino acids)

polypeptides with a tertiary level of structure are usually referred to as

globular proteins, since their shape is irregular and globular in form

Page 23: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Quarternary Structure:Haemoglobin

Page 24: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

What determines fold?• Anfinsen’s experiments in 1957

demonstrated that proteins can fold spontaneously into their native conformations under physiological conditions. This implies that primary structure does indeed determine folding or 3-D stucture.

• Some exceptions exist– Chaperone proteins assist folding– Abnormally folded Prion proteins can

catalyze misfolding of normal prion proteins that then aggregate

Page 25: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Levels of Description of Structural Complexity

• Primary Structure (AA sequence)• Secondary Structure

– Spatial arrangement of a polypeptide’s backbone atoms without regard to side-chain conformations , , coil, turns (Venkatachalam, 1968)

– Super-Secondary Structure , , /, + (Rao and Rassman, 1973)

• Tertiary Structure – 3-D structure of an entire polypeptide

• Quarternary Structure– Spatial arrangement of subunits (2 or more polypeptide chains)

Page 26: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Techniques of Structure Prediction

• Computer simulation based on energy calculation– Based on physio-chemical principles

– Thermodynamic equilibrium with a minimum free energy

– Global minimum free energy of protein surface

• Knowledge Based approaches– Homology Based Approach

– Threading Protein Sequence

– Hierarchical Methods

Page 27: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Energy Minimization Techniques

Energy Minimization based methods in their pure form, make no priori assumptions and attempt to locate global minma.

• Static Minimization Methods– Classical many potential-potential can be construted

– Assume that atoms in protein is in static form

– Problems(large number of variables & minima and validity of potentials)

• Dynamical Minimization Methods– Motions of atoms also considered

– Monte Carlo simulation (stochastics in nature, time is not cosider)

– Molecular Dynamics (time, quantum mechanical, classical equ.)

• Limitations– large number of degree of freedom,CPU power not adequate

– Interaction potential is not good enough to model

Page 28: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Molecular Dynamics• Provides a way to observe the motion of

large molecules such as proteins at the atomic level – dynamic simulation

• Newton’s second law applied to molecules• Potential energy function

– Molecular coordinates– Force on all atoms can be calculated, given this

function– Trajectory of motion of molecule can be

determined

Page 29: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Knowledge Based Approaches

• Homology Modelling– Need homologues of known protein structure– Backbone modelling– Side chain modelling – Fail in absence of homology

• Threading Based Methods– New way of fold recognition– Sequence is tried to fit in known structures– Motif recognition– Loop & Side chain modelling– Fail in absence of known example

Page 30: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Homology Modeling

• Simplest, reliable approach

• Basis: proteins with similar sequences tend to fold into similar structures

• Has been observed that even proteins with 25% sequence identity fold into similar structures

• Does not work for remote homologs (< 25% pairwise identity)

Page 31: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Homology Modeling

• Given:– A query sequence Q– A database of known protein structures

• Find protein P such that P has high sequence similarity to Q

• Return P’s structure as an approximation to Q’s structure

Page 32: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Threading

• Given:– sequence of protein P with unknown structure

– Database of known folds

• Find:– Most plausible fold for P

– Evaluate quality of such arrangement

• Places the residues of unknown P along the backbone of a known structure and determines stability of side chains in that arrangement

Page 33: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.

Hierarcial Methods

Intermidiate structures are predicted, instead of predicting tertiary structure of protein from amino acids sequence

• Prediction of backbone structure– Secondary structure (helix, sheet,coil)

– Beta Turn Prediction

– Super-secondary structure

• Tertiary structure prediction

• Limitation

Accuracy is only 75-80 %

Only three state prediction

Page 34: Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.