Top Banner
SAMPL6 pK a Challenge: Predictions of ionization constants performed by the S+pKa method implemented in ADMET Predictor™ software Robert Fraczkiewicz and Marvin Waldman* Simulations Plus, Inc. February 2018
12

SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

May 11, 2018

Download

Documents

phambao
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa method

implemented in ADMET Predictor™ software

Robert Fraczkiewicz and Marvin Waldman*

Simulations Plus, Inc.

February 2018

Page 2: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

Ionizable atom in a microstate

2D Atomic Descriptors for ionizable atom in its molecular environment

Predicted micro pka

QSPR

Simplified overview of pKa modeling

page 2 • Webinar

N

N

Cl

OO

O

N

NH

+

Cl

OO

O

optimal subset from~130 atomic descriptors

Page 3: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

▪ 10 Artificial Neural Network Ensembles (ANNE);

one ANNE for each of the following 10 classes of ionizable atoms:

▪ (1) Hydroxyacids

▪ (2) Acidic amides

▪ (3) Acids of aromatic NH

▪ (4) Thioacids

▪ (5) Carboacids

▪ (6) Amines

▪ (7) Bases of aromatic N

▪ (8) N-oxides

▪ (9) Thiones

▪ (10) Carbobases (protonatable C in some π–excessive rings)

▪ ANNEs use localized atomic descriptors as inputs

▪ ANNEs predict ionization microconstants (micro pKa)

▪ Macroconstants calculated with microequilibria theory

page 3 • Webinar

The main factor determining an atom’s ionization is its type, followed by its local molecular environment

N

N

Cl

OOH

O

The predictive model, S+pKa

Fraczkiewicz, R.; Lobell, M.; Göller, A. H.; Krenz, U.; Schoenneis, R.; Clark, R. D.; Hillisch, A. “Best of Both Worlds: Combining Pharma Data and State of the Art Modeling Technology To Improve in Silico pKa Prediction.” Journal of Chemical Information and Modeling 2014, 55, 389-397.

Page 4: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

Details of data split:Training pool: 25509 pKa values

• 10594 from public sources• 14915 from Bayer Pharma

Internal test set: 8131 pKa values• 3582 from public sources• 4549 from Bayer Pharma

Model training and initial testing with internal test set*

Internal test set prediction statistics:

MAE = 0.34

RMSE = 0.48

R2 = 0.97

*Internal test set compounds have not been used for model training but have been used to select the 10 ANNEs to appear in the final model

page 4 • Webinar

Page 5: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

Atomic Partial Charge Descriptors

5

R² = 0.9863RMSE(train) = 0.048

Ntrain = 16332RMSE(test) = 0.049

Ntest = 2867

-1.5

-0.5

0.5

1.5

2.5

-1.5 -0.5 0.5 1.5 2.5

Ab

in

itio

EEM-Hückel Model

Total Partial Charge

Train

Test

Linear (Train)

NIH SBIR Grants:1R43CA130388-12R44CA130388-02A1

Page 6: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

SAMPL6 pKa challenge results

RMSE = 0.73MAE = 0.579ME = -0.009R2 = 0.925Slope = 0.929

Computation time (wall clock) for all pKa in all 24 compounds:

Under 2 seconds*

* This includes 145 other properties and ~400 molecular descriptors

Page 7: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

Deviations above 1 log unit

+1.95

-1.02

+1.22

-1.03

Page 8: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

Deviations above 1 log unit

-1.18

+1.07

Page 9: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

SM03 prediction

Experimental pKa = 7.02UV-metric psKa, with Yasuda-Shedlovskyextrapolation

Page 10: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

SM03 prediction- Training set analogs

All analogs in training set contain sulfonamide group and have reported pKa’s ranging from 4.8-9.3

Example:First acidic dissociationIs a mixture of twoMicrostates.

Page 11: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

SM10 prediction

Experimental pKa = 9.02UV-metric psKa, with Yasuda-Shedlovskyextrapolation

Page 12: SAMPL6 pKa Challenge: Predictions of ionization constants performed by the S+pKa …pKa_in_SAM… ·  · 2018-02-22ombining Pharma Data and State of the Art Modeling Technology To

SM21 prediction

Experimental pKa = 4.1UV-metric psKa, with Yasuda-Shedlovskyextrapolation