Top Banner
USING MOTION PLANNING TO STUDY PROTEIN FOLDING PATHWAYS by Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi
26

By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

Dec 17, 2015

Download

Documents

Nelson Rogers
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

USING MOTION PLANNING TO

STUDY PROTEIN FOLDING

PATHWAYSby Guang Song and Nancy M. Amato

Journal of Computational Biology,

April 1, 2002

Presentation by Athina Ropodi

Page 2: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

OUTLINE Introduction- Definition

Motion planning Probabilistic Roadmap Method Denaturation C-space

Probabilistic Roadmap Method Basic steps Degrees of freedom/C-space Node Generation Sampling strategy Potential Energy Computations

Results

Page 3: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

INTRODUCTION- MOTION PLANNING Motion planning is a term used in

robotics for the process of detailing a task into atomic robotic motions.

This issue is also known as the “navigation problem”:

Given an environment, a start and a goal position of an object, the objective is to find a valid path (continuous sequence of validconfigurations) from start to goal.

Page 4: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

PRM- DENATURATION Probabilistic Roadmap (PRM) motion

planning techniques are applied to “small” proteins (up to 60 residues) in order to compute folding pathways from a denaturated state to its native fold.

Denaturate: To cause the structure to unfold, so that some of its original properties, especially its biological activity, are diminished or eliminated. Usually caused by extreme conditions, e.g. high temperature.

Page 5: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

HOW DENATURATION OCCURS AT LEVELS OF PROTEIN STRUCTURE In quaternary structure denaturation, protein sub-units

are dissociated and/or the spatial arrangement of protein subunits is disrupted.

Tertiary structure denaturation involves the disruption of: Covalent interactions between amino acid side chains (such as

disulfide bridges between cysteine groups) Noncovalent dipole-dipole interactions between polar amino acid

side chains (and the surrounding solvent) Van der Waals (induced dipole) interactions between nonpolar

amino acid side chains. In secondary structure denaturation, proteins lose all

regular repeating patterns such as alpha-helices and beta-pleated sheets, and adopt a random coil configuration.

Primary structure, such as the sequence of amino acids held together by covalent peptide bonds, is not disrupted by denaturation

Page 6: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

BASIC ASSUMPTION The article investigates the folding

mechanisms of a protein assuming we know its native fold.

Results are validated by comparing the formation order to pulse-labeling experimental results.

The configuration space (C-Space) of a movable object is the space consisting of all positions and orientations of that object.

Pulse labeling is a biochemistry technique of identifying the target molecule presence by inclusion of a pulse of a radioactive compound.

Page 7: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

OUTLINE Introduction- Definition

Motion planning Probabilistic Roadmap Method Denaturation C-space

Probabilistic Roadmap Method Basic steps Degrees of freedom/C-space Node Generation Sampling strategy Potential Energy Computations

Results

Page 8: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

PROBABILISTIC ROADMAP METHOD FOR PROTEIN FOLDING Any complete motion planner would require

time exponential in the number of degrees of freedom (dof).

Several methods such as energy minimization, molecular dynamics, Monte-Carlo and genetic algorithms have been used.

This method tries to simulate the true dynamics of the folding process using the classical Newton’s motion equations.

However, an exact simulation would depend on the start conformation and could result in local minima.

Page 9: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

PROBABILISTIC ROADMAP METHOD FOR PROTEIN FOLDING

The basic steps are: First of all, PRM samples points randomly

from C-space and retains those that satisfy certain requirements.

Then, the points are connected and form a graph using a simple planning method.

Finally, paths connecting the start and goal configurations are extracted using standard graph search techniques.

[Kavraki, Svestka, Latombe,Overmars 1996]

In this case, low-energy conformatio

ns are preferable.

Page 10: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

PROBABILISTIC ROADMAP METHOD FOR PROTEIN FOLDING

Page 11: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

C-SPACES All atomic bond lengths and angles are

considered to be constants. We only consider 2 dofs, φ and ψ angles. Side-chains are modeled as spheres

with no dof. Fold positions (atomic bonds)

correspond to joints and atoms correspond to links.

Thus, for k residues our model has 2k links and 2k revolute joints…

Page 12: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

NODE GENERATION Different configurations are produced by

assigning possible angle values. But, the nodes are accepted or rejected

based on their potential energy:

where Emin=50000 and Emax=89000KJouls/mol

A configuration with higher potential is

more likely to be rejected.

Page 13: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

SAMPLING STRATEGY Due to the high dimensionality of the

problem, a very dense uniform sampling is required.

Since prior knowledge of the native fold is assumed, a sampling strategy biased to the native fold is applied:

Sampling is performed from a set of normal distributions around the native fold. The standard deviations used are {5, 10, 20, 40, 80,160 degrees}.Small STDs capture the

detail around the goal, and larger ensure

adequate roadmap coverage

Page 14: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

CONSTRUCTION OF ROADMAP

For each node, the k-nearest neighbors are found(k=20 and the metric is Euclidean).

For each connection a feasibility check is performed. (two nodes are connected by a straight line)

For 2 consecutive intermediate conformations, i and i+1, we first check their potential energies and then the probability of moving from i to i+1:

RMSD metric proved inferior

Page 15: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

CONSTRUCTION OF ROADMAP

The total weight of the edge is:

Dijkstra’s algorithm is then used to find the smallest weight path.

Path optimization: resampling is performed around the nodes of paths with high potential.

Page 16: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

POTENTIAL ENERGY COMPUTATIONS

To reduce the cost of calculations, an approximation function is used. We only consider contribution from side chains and those are modeled as spheres.

The cost is then reduced by 2 orders of magnitude.

The 1st term represents

constraints that favor secondary structure,

hydrogen and disulphide bonds

The 2nd the van der Waals interactions

among atoms.

Page 17: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

OUTLINE Introduction- Definition

Motion planning Probabilistic Roadmap Method Denaturation C-space

Probabilistic Roadmap Method Basic steps Degrees of freedom/C-space Node Generation Sampling strategy Potential Energy Computations

Results

Page 18: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

RESULTS

Page 19: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

RESULTS Folding process of GB1:

Page 20: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

RESULTS Protein GB1(56

residues): 1 α-helix and 4 β-strands

Protein A (60 residues): 3 α-helices

Page 21: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

PROTEIN GB1: PHI/PSI DISTRIBUTION OF ROADMAP NODES

Page 22: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

PROTEIN GB1: POTENTIAL VS RMSDDISTRIBUTION OF ROADMAP NODES

Page 23: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

PROTEIN GB1: EFFECT OF RESOLUTION POTENTIAL PROFILES FOR DIFFERENT SIZE ROADMAPS

Peaks show where atoms are close and

Van der Waals interactions dominate.

Bigger roadmaps have smoother

paths.

Page 24: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

EFFECT OF RE-SAMPLING AROUND THE PEAKS

More samples around the peaks improve the path.

Page 25: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

WHAT ABOUT DIFFERENT START CONFORMATIONS?

Different pathways tend to come together and appear to have some common channels, as they approach the native fold.

Page 26: By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

THANK YOU!Bibliography:1. Probabilistic roadmaps for path

planning in high-dimensional configuration spacesL. Kavraki, P. Svestka, J-C. Latombe, M. H. Overmars, 1996

2. Protein Folding by Restrained Energy Minimization and Molecular Dynamics Michael Levitt,1983

3. http://faculty.cs.tamu.edu/amato/dsmft/research/folding/index.shtml.OLD2