Abcd Docking Pose Assessment: The importance of keeping your GARD up David C. Thompson J. Christian Baber [a] Jason B. Cross [b, c] [a] Wyeth Research, Chemical Sciences, Cambridge, MA [b] Wyeth Research, Chemical Sciences, Collegeville, PA [c] Cubist Pharmaceuticals, Inc. Lexington, MA
20
Embed
Docking Pose Assessment: The importance of keeping your GARD up
Presentation made at a regional Schroedinger UGM in 2009. Describing some of our work on docking pose assessment
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Abcd
Docking Pose Assessment:The importance of keeping your GARD up
David C. Thompson
J. Christian Baber[a]
Jason B. Cross[b, c]
[a] Wyeth Research, Chemical Sciences, Cambridge, MA[b] Wyeth Research, Chemical Sciences, Collegeville, PA[c] Cubist Pharmaceuticals, Inc. Lexington, MA
— Would not miss poses that have a correct binding mode— Highly subjective, not easily automated
• Real-space R-factor (RSR)[5]
— Inclusion of experimental information— Un-bounded (how big is too big?)
• All of these methods address some of the issues associated with RMSD, but not in one single measure
• RMSTanimoto[6]
[3] R. A. Abagyan et al., J. Mol. Bio., 268, 678 (1997)[4] R. T. Kroemer et al., J. Chem. Inf. Comput. Sci., 44, 871 (2004)[5] D. Yusuf et al., J. Chem. Inf. Model., 48, 1411 (2008)[6] OpenEye Scientific Software, Santa Fe, NM The Why
AbcdThe Why: A Recap
RMSD works a lot of the time, so we need a function that preserves this feature, but that also accounts for those difficult cases where useful information maybe lost
We would also like:• To avoid the skewing problem associated with large RMSDs• To have an objective measure• An element of chemical awareness
The Why
AbcdThe How
• A Generally Applicable Replacement for RMSD: GARD[7]
• GARD is a metric for analyzing docking poses
• It is bounded on [0,1] to remove arbitrary cutoffs which distortaverage measures
• It is based on an analysis performed by P. R. Andrews et al. [8]*
— Regression analysis of the binding constants and structural components of 200 drugs and enzyme inhibitors
• Automated, and no more expensive than RMSD
[7] Submitted, J. Chem. Inf. Model.[8] P. R. Andrews et al., J. Med. Chem., 27, 1648 1984* Yes, we know that this is an old study . . . The How
AbcdGARD: The Algorithm
• For each atom compute an RMSD (di)• Use Andrews weight corresponding to the
atom type (wi)• Define a ‘good’ and ‘bad’ RMSD: dmin and
dmax
— dmin = 1Å— dmax = 2.5Å
Reference structure (cyan); Docking pose (tan)
∑∑
=
ii
iii
w
wδGARD
RMSD = 1.38ÅGARD = 0.90
The How
⎪⎪⎩
⎪⎪⎨
⎧
−−
=
0
)(1
minmax
min
ddddi
iδ
max
maxmin
min
ddddd
dd
i
i
i
≥≤≤
≤
Atomic RMSD = 3.68Å
AbcdGARD: Worked Example
di ATOM TYPE wi δiwi0.28 C (sp3) 0.8 0.8
0.48 C (sp3) 0.8 0.8
0.69 N 1.2 1.2
0.60 C (sp3) 0.8 0.8
0.36 C (sp3) 0.8 0.8
0.96 C (sp2) 0.7 0.7
0.96 N 1.2 1.2
3.68 C (sp3) 0.8 0
0.60 C (sp3) 0.8 0.8
SUM 7.9 7.1
GARD = 7.1/7.9 = 0.90
Reference structure (cyan); Docking pose (tan)
RMSD = 1.38ÅGARD = 0.90
The How
AbcdComparing docking programs is difficult … but we do it anyway
“Cognate ligand docking to 68 diverse, high-resolution x-ray complexes revealed that ICM, GLIDE, and Surflex generated ligand poses close to the X-ray conformation more often than the other docking programs. GLIDE and Surflex also outperformed the other docking programs when used for virtual screening, based on mean ROC AUC and ROC enrichment . . .[1]”
Protocol:1. Initial ligand coordinates used as input for the docking were generated using
CORINA[9]
2. The 10 top scoring poses (or fewer, depending on the specific output for a particular X-ray complex/docking program combination) were retained for analysis
3. These poses were then evaluated using both the GARD and RMSD measures
[1] J. B. Cross et al., J. Chem. Inf. Model. (In press)[9] CORINA v1.82, Molecular Networks GmbH: Erlangen, Germany, 1997 The What
AbcdThe What
y = -7.3x + 7.2R2 = 0.59
0
5
10
15
20
25
30
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
GARD
RM
SD
Correlation between GARD scores and RMSD across the top 10 poses of compounds from 68 different targets and 6 docking methods (4725 points) The What
AbcdThe What: Some Specific Examples
R2 = 0.53
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
0.75 0.8 0.85 0.9 0.95 1
GARD
RMSD
1GLQRMSD = 4.44Å
GARD = 0.77
Correlation between GARD scores and RMSD for those poses with a GARD score of at least 0.75 across the top 10 poses of compoundsfrom 68 different targets and 6 docking methods (1469 points)
1A4QRMSD = 4.90Å
GARD = 0.78
The What
Abcd1A4Q: Neuraminidase with dihydropyran-phenethyl-propy-carboxamide inhibitor (1.90Å)
1A4QRMSD = 4.90ÅGARD = 0.78
SurFlex Ringflex docking pose (green wire)X-tal (grey tube) The What
Abcd1GLQ: Glutathione-S-transferase with p-nitrobenzyl
glutathione (1.80Å)
1GLQRMSD = 4.44ÅGARD = 0.77
ICM docking pose (green wire)X-tal (grey tube)
The What
Abcd
Top ScoringGARD=0.75 / RMSD=2.35
GLIDE SP 4.5 (1/30)
Best RMSDGARD=0.63 / RMSD=1.89
GLIDE SP 4.5 (10/30)
Crystal Structure
1HPX: HIV Protease with KNI-272 inhibitor (2.00 Å)*
1
2
3 4
2
1
3 4
*Additional example, not in the original docking evaluation data set The What
AbcdGPCR Model Validation: GLIDE SP 5.0
3
3.5
4
4.5
5
5.5
6
6.5
7
0 0.2 0.4 0.6
GARD
RMSD
25 poses, post-minimization
Pose # 24RMSD = 3.69ÅGARD = 0.48
Evaluate GPCR model’s ability to reproduce known crystallographic binding mode
β2 adrenergic receptor (2RH1)X-tal ligand (cyan); model protein (cyan)
IFD pose (tan); IFD protein (tan)
[9] Schrödinger Suite 2008, Induced Fit Docking protocol; Glide version 5.0, Schrödinger, LLC, New York, NY, 2008; Prime version2.0, Schrödinger, LLC, New York, NY, 2008
AbcdConcluding remarks
• RMSD is a good measure most of the time, although it has known drawbacks which can result in the discarding of useful information
• A Generally Applicable Replacement to RMSD (GARD) has been proposed which overcomes most of the drawbacks of RMSD, whilst preserving it’s strengths. This measure is:— Normalized— ‘Chemically aware’— Automated / objective
• Illustrated GARD utility showing specific examples from a large scale docking evaluation exercise, and examples from the Protein Data Bank
• Future application: Use with RMSD to triage docking results for protein model evaluation— Of particular utility when considering multiple models, and tens/hundreds of
docking poses
AbcdCultural highlight
• Ethnographic examination of ‘simulators’— Crystallographers— Architects— Oceanographers
• “All models are wrong, but some models are useful” – G. E. P. Box
• “If exactitude is elusive, it is better to be approximately right than certifiably wrong” – B. B. Mandelbrot
Simulation and its discontents, Sherry Turkle, Cambridge, MA: MIT Press (2009)
AbcdAcknowledgments
• Boehringer Ingelheim— Dr. Ingo Mügge— Dr. Sandy Farmer
• Wyeth Research— The Docking Evaluation Team
(Dr. YongBo Hu, Dr. Kristi Yi Fan and Dr. Brajesh K. Rai*)— Dr. Jack A. Bikker— Dr. Christine Humblet
* Pfizer Global Research and Development, Groton, CT