Top Banner
Abcd Docking Pose Assessment: The importance of keeping your GARD up David C. Thompson J. Christian Baber [a] Jason B. Cross [b, c] [a] Wyeth Research, Chemical Sciences, Cambridge, MA [b] Wyeth Research, Chemical Sciences, Collegeville, PA [c] Cubist Pharmaceuticals, Inc. Lexington, MA
20

Docking Pose Assessment: The importance of keeping your GARD up

Nov 29, 2014

Download

Education

David Thompson

Presentation made at a regional Schroedinger UGM in 2009. Describing some of our work on docking pose assessment
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Docking Pose Assessment: The importance of keeping your GARD up

Abcd

Docking Pose Assessment:The importance of keeping your GARD up

David C. Thompson

J. Christian Baber[a]

Jason B. Cross[b, c]

[a] Wyeth Research, Chemical Sciences, Cambridge, MA[b] Wyeth Research, Chemical Sciences, Collegeville, PA[c] Cubist Pharmaceuticals, Inc. Lexington, MA

Page 2: Docking Pose Assessment: The importance of keeping your GARD up

AbcdThe Why

• Large-scale docking evaluation study[1]

— Glide, DOCK6, PhDOCK, SurFlex, FlexX, and ICM— Cognate ligand docking— Virtual Screening

• Project aims:— Assess our computational needs: Right tools for the job?— Assess and revise best practices

[1] J. B. Cross et al., J. Chem. Inf. Model. (In press) The Why

Page 3: Docking Pose Assessment: The importance of keeping your GARD up

Abcd

Measures of Accuracy: RMSD

Top scoringpose

Pose # Score1 -72.02 -56.03 -24.04 -9.00… …

RMSD1.92.31.82.7…

How do we assess a docking program’s ability to regenerate a known binding mode?

• We dock the native ligand back into the protein• We look at the RMSD of the top pose• We look at the best RMSD of all the poses

Best RMSD

The Why

Page 4: Docking Pose Assessment: The importance of keeping your GARD up

AbcdComparing docking programs is difficult …[2]

• RMSD, and statistics derived from RMSD, are used heavily in comparing docking programs

• This is fine as RMSD works a lot of the time, however there are some issues

— Not bounded (how big is too big?)— Large RMSDs can dominate aggregate statistics— RMSD is chemically ambivalent

• We may be losing useful information

[2] J. C. Cole et al., Proteins, 60, 325 (2005) The Why

Page 5: Docking Pose Assessment: The importance of keeping your GARD up

AbcdWhat has come before

• These observations on RMSD are not new• Relative Displacement Error (RDE)[3]

— Statistics compiled using the RDE measure are less dominated by very bad docking poses— Would still miss poses that contain correct binding modes

• Interaction-Based Accuracy Classification (IBAC)[4]

— Would not miss poses that have a correct binding mode— Highly subjective, not easily automated

• Real-space R-factor (RSR)[5]

— Inclusion of experimental information— Un-bounded (how big is too big?)

• All of these methods address some of the issues associated with RMSD, but not in one single measure

• RMSTanimoto[6]

[3] R. A. Abagyan et al., J. Mol. Bio., 268, 678 (1997)[4] R. T. Kroemer et al., J. Chem. Inf. Comput. Sci., 44, 871 (2004)[5] D. Yusuf et al., J. Chem. Inf. Model., 48, 1411 (2008)[6] OpenEye Scientific Software, Santa Fe, NM The Why

Page 6: Docking Pose Assessment: The importance of keeping your GARD up

AbcdThe Why: A Recap

RMSD works a lot of the time, so we need a function that preserves this feature, but that also accounts for those difficult cases where useful information maybe lost

We would also like:• To avoid the skewing problem associated with large RMSDs• To have an objective measure• An element of chemical awareness

The Why

Page 7: Docking Pose Assessment: The importance of keeping your GARD up

AbcdThe How

• A Generally Applicable Replacement for RMSD: GARD[7]

• GARD is a metric for analyzing docking poses

• It is bounded on [0,1] to remove arbitrary cutoffs which distortaverage measures

• It is based on an analysis performed by P. R. Andrews et al. [8]*

— Regression analysis of the binding constants and structural components of 200 drugs and enzyme inhibitors

• Automated, and no more expensive than RMSD

[7] Submitted, J. Chem. Inf. Model.[8] P. R. Andrews et al., J. Med. Chem., 27, 1648 1984* Yes, we know that this is an old study . . . The How

Page 8: Docking Pose Assessment: The importance of keeping your GARD up

AbcdGARD: The Algorithm

• For each atom compute an RMSD (di)• Use Andrews weight corresponding to the

atom type (wi)• Define a ‘good’ and ‘bad’ RMSD: dmin and

dmax

— dmin = 1Å— dmax = 2.5Å

Reference structure (cyan); Docking pose (tan)

∑∑

=

ii

iii

w

wδGARD

RMSD = 1.38ÅGARD = 0.90

The How

⎪⎪⎩

⎪⎪⎨

−−

=

0

)(1

minmax

min

ddddi

max

maxmin

min

ddddd

dd

i

i

i

≥≤≤

Atomic RMSD = 3.68Å

Page 9: Docking Pose Assessment: The importance of keeping your GARD up

AbcdGARD: Worked Example

di ATOM TYPE wi δiwi0.28 C (sp3) 0.8 0.8

0.48 C (sp3) 0.8 0.8

0.69 N 1.2 1.2

0.60 C (sp3) 0.8 0.8

0.36 C (sp3) 0.8 0.8

0.96 C (sp2) 0.7 0.7

0.96 N 1.2 1.2

3.68 C (sp3) 0.8 0

0.60 C (sp3) 0.8 0.8

SUM 7.9 7.1

GARD = 7.1/7.9 = 0.90

Reference structure (cyan); Docking pose (tan)

RMSD = 1.38ÅGARD = 0.90

The How

Page 10: Docking Pose Assessment: The importance of keeping your GARD up

AbcdComparing docking programs is difficult … but we do it anyway

“Cognate ligand docking to 68 diverse, high-resolution x-ray complexes revealed that ICM, GLIDE, and Surflex generated ligand poses close to the X-ray conformation more often than the other docking programs. GLIDE and Surflex also outperformed the other docking programs when used for virtual screening, based on mean ROC AUC and ROC enrichment . . .[1]”

Protocol:1. Initial ligand coordinates used as input for the docking were generated using

CORINA[9]

2. The 10 top scoring poses (or fewer, depending on the specific output for a particular X-ray complex/docking program combination) were retained for analysis

3. These poses were then evaluated using both the GARD and RMSD measures

[1] J. B. Cross et al., J. Chem. Inf. Model. (In press)[9] CORINA v1.82, Molecular Networks GmbH: Erlangen, Germany, 1997 The What

Page 11: Docking Pose Assessment: The importance of keeping your GARD up

AbcdThe What

y = -7.3x + 7.2R2 = 0.59

0

5

10

15

20

25

30

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

GARD

RM

SD

Correlation between GARD scores and RMSD across the top 10 poses of compounds from 68 different targets and 6 docking methods (4725 points) The What

Page 12: Docking Pose Assessment: The importance of keeping your GARD up

AbcdThe What: Some Specific Examples

R2 = 0.53

0

0.5

1

1.5

2

2.5

3

3.5

4

4.5

5

0.75 0.8 0.85 0.9 0.95 1

GARD

RMSD

1GLQRMSD = 4.44Å

GARD = 0.77

Correlation between GARD scores and RMSD for those poses with a GARD score of at least 0.75 across the top 10 poses of compoundsfrom 68 different targets and 6 docking methods (1469 points)

1A4QRMSD = 4.90Å

GARD = 0.78

The What

Page 13: Docking Pose Assessment: The importance of keeping your GARD up

Abcd1A4Q: Neuraminidase with dihydropyran-phenethyl-propy-carboxamide inhibitor (1.90Å)

1A4QRMSD = 4.90ÅGARD = 0.78

SurFlex Ringflex docking pose (green wire)X-tal (grey tube) The What

Page 14: Docking Pose Assessment: The importance of keeping your GARD up

Abcd1GLQ: Glutathione-S-transferase with p-nitrobenzyl

glutathione (1.80Å)

1GLQRMSD = 4.44ÅGARD = 0.77

ICM docking pose (green wire)X-tal (grey tube)

The What

Page 15: Docking Pose Assessment: The importance of keeping your GARD up

Abcd

Top ScoringGARD=0.75 / RMSD=2.35

GLIDE SP 4.5 (1/30)

Best RMSDGARD=0.63 / RMSD=1.89

GLIDE SP 4.5 (10/30)

Crystal Structure

1HPX: HIV Protease with KNI-272 inhibitor (2.00 Å)*

1

2

3 4

2

1

3 4

*Additional example, not in the original docking evaluation data set The What

Page 16: Docking Pose Assessment: The importance of keeping your GARD up

AbcdGPCR Model Validation: GLIDE SP 5.0

3

3.5

4

4.5

5

5.5

6

6.5

7

0 0.2 0.4 0.6

GARD

RMSD

25 poses, post-minimization

Pose # 24RMSD = 3.69ÅGARD = 0.48

Evaluate GPCR model’s ability to reproduce known crystallographic binding mode

β2 adrenergic receptor (2RH1)X-tal (green)

GLIDE pose (yellow)

The What

Page 17: Docking Pose Assessment: The importance of keeping your GARD up

AbcdGPCR Model Validation: IFD[9]

IFD, default parameters, Pose #1RMSD = 1.85ÅGARD = 0.65

The What

β2 adrenergic receptor (2RH1)X-tal ligand (cyan); model protein (cyan)

IFD pose (tan); IFD protein (tan)

[9] Schrödinger Suite 2008, Induced Fit Docking protocol; Glide version 5.0, Schrödinger, LLC, New York, NY, 2008; Prime version2.0, Schrödinger, LLC, New York, NY, 2008

Page 18: Docking Pose Assessment: The importance of keeping your GARD up

AbcdConcluding remarks

• RMSD is a good measure most of the time, although it has known drawbacks which can result in the discarding of useful information

• A Generally Applicable Replacement to RMSD (GARD) has been proposed which overcomes most of the drawbacks of RMSD, whilst preserving it’s strengths. This measure is:— Normalized— ‘Chemically aware’— Automated / objective

• Illustrated GARD utility showing specific examples from a large scale docking evaluation exercise, and examples from the Protein Data Bank

• Future application: Use with RMSD to triage docking results for protein model evaluation— Of particular utility when considering multiple models, and tens/hundreds of

docking poses

Page 19: Docking Pose Assessment: The importance of keeping your GARD up

AbcdCultural highlight

• Ethnographic examination of ‘simulators’— Crystallographers— Architects— Oceanographers

• “All models are wrong, but some models are useful” – G. E. P. Box

• “If exactitude is elusive, it is better to be approximately right than certifiably wrong” – B. B. Mandelbrot

Simulation and its discontents, Sherry Turkle, Cambridge, MA: MIT Press (2009)

Page 20: Docking Pose Assessment: The importance of keeping your GARD up

AbcdAcknowledgments

• Boehringer Ingelheim— Dr. Ingo Mügge— Dr. Sandy Farmer

• Wyeth Research— The Docking Evaluation Team

(Dr. YongBo Hu, Dr. Kristi Yi Fan and Dr. Brajesh K. Rai*)— Dr. Jack A. Bikker— Dr. Christine Humblet

* Pfizer Global Research and Development, Groton, CT