Top Banner
Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude Latombe, Chris Varma Computer Science Department Stanford University 1 Department of Biochemistry, Stanford University 2 Computer Science Department, University of North Carolina
42

Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Dec 21, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Probabilistic Roadmaps: A Tool for Computing

Ensemble Properties of Molecular Motions

Serkan Apaydin, Doug Brutlag1

Carlos Guestrin, David Hsu2

Jean-Claude Latombe, Chris Varma

Computer Science DepartmentStanford University

1 Department of Biochemistry, Stanford University2 Computer Science Department, University of North Carolina

Page 2: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Goal of our ResearchGoal of our Research

Develop efficient computational representations and algorithms to study molecular pathways for protein folding and ligand-protein binding

Protein folding RECOMB ’02Ligand-protein binding ECCB ‘02

Page 3: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

AcknowledgementsAcknowledgements

People: Leo Guibas Michael Levitt, Structural BiologyItay Lotan Vijay Pande, ChemistryFabian Schwarzer Amit SinghRohit Singh

Funding: NSF-ITR ACI-0086013Stanford’s Bio-X and Graduate Fellowship

programs

Page 4: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Analogy with RoboticsAnalogy with Robotics

Page 5: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Configuration SpaceConfiguration Space

Approximate the free space by random sampling

Probabilistic Roadmaps

Page 6: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Probabilistic RoadmapProbabilistic Roadmap

free space

[Kavraki, Svetska, Latombe,Overmars, 95][Kavraki, Svetska, Latombe,Overmars, 95]

Page 7: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Probabilistic CompletenessProbabilistic Completeness

The probability that a roadmap fails to correctly capture the connectivity of the

free space goes to 0 exponentially in the number of milestones (~ running time).

Random sampling is convenient incremental

scheme for approximating the free space

Page 8: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Computed ExamplesComputed Examples

Page 9: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Biology Biology Robotics Robotics

Energy field, instead of joint controlContinuous energy field, instead of binary free and in-collision spacesMultiple pathways, instead of single collision-free pathPotentially many more degrees of freedomRelation to real world is more complex

Page 10: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Initial WorkInitial Work[Singh, Latombe, Brutlag, 99][Singh, Latombe, Brutlag, 99]

Study of ligand-protein bindingProbabilistic roadmaps with edges weighted by energetic plausibilitySearch of most plausible paths

Page 11: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Initial WorkInitial Work[Singh, Latombe, Brutlag, 99][Singh, Latombe, Brutlag, 99]

Study of ligand-protein bindingProbabilistic roadmaps with edges weighted by energetic plausibilitySearch of most plausible pathsStudy of energy profiles along such paths

CatalyticSite

energy

Page 12: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Initial WorkInitial Work[Singh, Latombe, Brutlag, 99][Singh, Latombe, Brutlag, 99]

Study of ligand-protein bindingProbabilistic roadmaps with edges weighted by energetic plausibilitySearch of most plausible pathsStudy of energy profiles along such pathsExtensions to protein folding[Song and Amato, 01] [Apaydin et al., 01]

Page 13: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

New Idea: New Idea: Capture the stochastic nature of molecular Capture the stochastic nature of molecular motion by assigning probabilities to edgesmotion by assigning probabilities to edges

vi

vj

Pij

Page 14: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Why is this a good idea?Why is this a good idea?

1) We can approximate Monte Carlo simulation as closely as we wish

2) Unlike with MC simulation, we avoid the local-minima problem

3) We can consider all pathways in the roadmap at once to compute ensemble properties

Page 15: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Edge probabilitiesEdge probabilities

Follow Metropolis criteria:

otherwise. ,

1

;0 if ,)/exp(

i

iji

Bij

ij

N

EN

TkE

P

Self-transition probability:

ijijii PP 1

vj

vi

Pij

Pii

Page 16: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Stochastic simulation on roadmap and Monte Carlo simulation converge to same Boltzmann distribution

S

Stochastic Roadmap SimulationStochastic Roadmap Simulation

Pij

Page 17: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Problems with Problems with Monte Carlo SimulationMonte Carlo Simulation

Much time is wasted in local minima Each run generates a single pathway

Page 18: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

SolutionSolution

Pij

Treat roadmap as a Markov chain and use the First-Step Analysis tool

Page 19: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Example #1: Example #1:

Probability of Folding pProbability of Folding pfoldfold

Unfolded set Folded set

pfold1- pfold

“We stress that we do not suggest using pfold as a transition coordinate for practical purposes as it is

very computationally intensive.” Du, Pande, Grosberg, Tanaka, and Shakhnovich “On the Transition

Coordinate for Protein Folding” Journal of Chemical Physics (1998).

HIV integrase[Du et al. ‘98]

Page 20: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Pii

F: Folded setU: Unfolded set

First-Step AnalysisFirst-Step Analysis

Pij

i

k

j

l

m

Pik Pil

Pim

Let fi = pfold(i)After one step: fi = Pii fi + Pij fj + Pik fk + Pil fl + Pim fm

=1 =1

One linear equation per node Solution gives pfold for all nodes

No explicit simulation run All pathways are taken into account Sparse linear system

Page 21: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

In Contrast …In Contrast …

Computing pfold with MC simulation requires:

Performing many MC simulation runs

Counting the number of times F is attained firstfor every conformation of interest:

Page 22: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Computational TestsComputational Tests• 1ROP (repressor of

primer)• 2 helices• 6 DOF

• 1HDD (Engrailed homeodomain)

• 3 helices• 12 DOF

H-P energy model with steric clash exclusion [Sun et al., 95]

Page 23: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

1ROP

Correlation with MC ApproachCorrelation with MC Approach

Page 24: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Correlation with MC ApproachCorrelation with MC Approach

1HDD

Page 25: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Computation Times (1ROP)Computation Times (1ROP)

Monte Carlo:

49 conformations Over 11 days ofcomputer time

Over 106 energy

computations

Roadmap:

5000 conformations1 - 1.5 hours ofcomputer time

~15,000 energycomputations

~4 orders of magnitude speedup!

Page 26: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Example #2: Example #2: Ligand-Protein InteractionLigand-Protein Interaction

Computation of escape time from funnels of attraction around potential binding sites(funnel = ball of 10A rmsd)

Page 27: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Computing Escape Time with Computing Escape Time with RoadmapRoadmap

Funnel of Attraction

ij

kl

m

Pii

Pim

PilPikPij

i = 1 + Pii i + Pij j+ Pik k + Pil l + Pim m

(escape time is measured as number of stepsof stochastic simulation)

= 0

Page 28: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Similar Computation Similar Computation Through Simulation Through Simulation [Sept,

Elcock and McCammon `99]

10K to 30K independent simulations

Page 29: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

ApplicationsApplications

1) Distinguishing catalytic site: Given several potential binding sites, which one is the catalytic site?

Page 30: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Complexes StudiedComplexes Studied

ligand protein # random nodes

# DOFs

oxamate 1ldm 8000 7

Streptavidin 1stp 8000 11

Hydroxylamine 4ts1 8000 9

COT 1cjw 8000 21

THK 1aid 8000 14

IPM 1ao5 8000 10

PTI 3tpi 8000 13

Page 31: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Distinction Based on Distinction Based on EnergyEnergy

Protein Bound state

Best potential binding site

1stp -15.1 -14.6

4ts1 -19.4 -14.6

3tpi -25.2 -16.0

1ldm -11.8 -13.6

1cjw -11.7 -18.0

1aid -11.2 -22.2

1ao5 -7.5 -13.1 (kcal/mol)

Able to distinguish

catalytic site

Not able

Page 32: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Distinction Based on Escape Distinction Based on Escape TimeTime

Protein Bound state

Best potential binding site

1stp 3.4E+9 1.1E+7

4ts1 3.8E+10 1.8E+6

3tpi 1.3E+11 5.9E+5

1ldm 8.1E+5 3.4E+6

1cjw 5.4E+8 4.2E+6

1aid 9.7E+5 1.6E+8

1ao5 6.6E+7 5.7E+6(# steps)

Able to distinguishcatalytic

site

Not able

Page 33: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

ApplicationsApplications

1) Distinguishing catalytic site2) Computational mutagenesis

C

C

OO

O

GLN-101

ARG-106

ASP-195HIS-193

ASP-166

ARG-169

NADH

+

+

+

Loop

Chemical environment of LDH-NADH-substrate complex (pyruvate) (catalyzes conversion of pyruvate to lactate in the presence of NADH

CH3

Some amino acids aredeleted entirely, replaced by other amino acids, or sidechains altered

Page 34: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Binding of Pyruvate to LDHBinding of Pyruvate to LDH

ASP-195HIS-193

ASP-166

ARG-169

+

+

+

THR-245

C

C

OO

O

CH3

NADH

GLN-101

ARG-106Loop

Page 35: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

ResultsResults

C

C

OO

O

GLN-101

ARG-106

ASP-195HIS-193

ASP-166

ARG-169

NADH

+

+

+

Loop

CH3

THR-245

Mutant Escape Time

Change

Wildtype 3.216E6 N/A

Page 36: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

ResultsResults

C

C

OO

O

GLN-101

ALA-106

ASP-195ALA-193

ASP-166

ARG-169

NADH

+

Loop

CH3

Mutant Escape Time

Change

Wildtype 3.216E6 N/A

His193 AlaArg106 Ala

4.126E2

Page 37: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

ResultsResults

Mutant Escape Time Change

Wildtype 3.216E6 N/A

His193 AlaArg106 Ala

4.126E2

His193 Ala 3.381E3

Arg106 Ala 2.550E2

Asp195 Asn 5.221E7

Gln101 Arg 1.669E6 No change

Thr245 Gly 4.607E5

C

C

OO

O

GLN-101

ARG-106

ASP-195HIS-193

ASP-166

ARG-169

NADH

+

+

+

Loop

CH3

GLY-245

Page 38: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

ConclusionConclusion

Probabilistic roadmaps are a promising computational tool for studying ensemble properties of molecular pathwaysCurrent and future work: Better kinetic/energetic models Experimentally verifiable tests Non-uniform sampling strategies Encoding MD simulation

Page 39: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Stochastic simulation on a roadmap and MC simulation converge to the same distribution (Boltzman):For any set S, >0, >0,>0, there exists N such that a roadmap with N milestones has error bounded by:

with probability at least 1- )1)(()(ˆ)1)(( SSS

vs

vg

S

Stochastic Roadmap SimulationStochastic Roadmap Simulation

Page 40: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Ligand-Protein ModelingLigand-Protein Modeling

• DOF = 10 – 3 coordinates to position root atom;– 2 angles to specify first bond;– Angles for all remaining non-terminal atoms;– Bond angles are assumed constant;

• Protein assumed rigid[Singh, Latombe and Brutlag `99]

x,y,z

Page 41: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Energy of InteractionEnergy of Interaction

Ev

Rij

Ec

Rij

Ev = 0.2[(R0/Rij)12 - 2(R0/Rij)

6 ]Ec = 332 QiQj/(Rij)

Energy = van der Waals interaction (Ev)

+ electrostatic interaction (Ec)

Page 42: Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude.

Solvent Effects

• Is only valid for an infinite medium of uniform dielectric;• Dielectric discontinuities result in induced surface

charges;

Solution: Poisson-Boltzman equation

Ec = 332 QiQj/(Rij)

Use Delphi [Rocchia et al `01] Finite Difference solution is based on discretizing

the workspace into a uniform grid.

[(r) . (r)] - (r)k(r)2sinh([(r)] + 4rf(r)/kT = 0