Top Banner
Two Parameter Persistence for Virtual Ligand Screening (joint work with Michael Lesnick and Ted Willke) Bryn Keller Intel Labs Institute for Mathematics and its Applications, University of Minnesota 15 Aug 2018
41

Two Parameter Persistence for Virtual Ligand Screening

Jan 28, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Two Parameter Persistence for Virtual Ligand Screening

Two Parameter Persistence for Virtual Ligand Screening

(joint work with Michael Lesnick and Ted Willke)

Bryn KellerIntel Labs

Institute for Mathematics and its Applications,University of Minnesota

15 Aug 2018

Page 2: Two Parameter Persistence for Virtual Ligand Screening

Drug Discovery

Page 3: Two Parameter Persistence for Virtual Ligand Screening

The problem

• Drug companies have massive databases of chemicals

• They don’t know what most of them do. Some are good (future) drugs. Some are poison, or just useless. Almost all of them have never been studied

• They want a way to take a known good drug and find other things similar to it in their databases, so they can see if those would be good drugs too

!3

Page 4: Two Parameter Persistence for Virtual Ligand Screening

Drug Development Pipeline

Pharmaceutical Research and Manufacturers of America (2016)

It costs about $2.6 billion to develop a single drug!

!4

Page 5: Two Parameter Persistence for Virtual Ligand Screening

�5

Some medicinal chemistry vocabularyVirtual Screening is the process of finding candidate drugs using a computer

Ligands (drugs and other substances) fit into binding pockets (aka targets) in proteins like a key in a lock

Two approaches:• Structure-based: use the lock to find a key

that fits• Ligand-based: use the key to find other keys

that might fit the same lock

Our task is to create a superior ligand-based method

Ligand in a binding pocket

https://en.wikipedia.org/wiki/Docking_(molecular)

Page 6: Two Parameter Persistence for Virtual Ligand Screening

�6

An aside: molecules have complicated shapes• Compounds have multiple conformations that have same

atoms, but are rotated or twisted differently

• How to account for these rotational conformations… without having to store them all in the database?

• How to match them as if they were different compounds?

• These rotations matter – up to 100x difference in drug effectiveness!’

• We’re not even talking about other forms of (structural) isomerism, those kinds of isomers are stored as different molecules.

Even simple compounds have multiple conformations

Page 7: Two Parameter Persistence for Virtual Ligand Screening

�7

Persistent Homology for virtual Screening (PHoS)

• Essence of the problem is nearest neighbor search

• Given a target substance, return the top N best matches for that in the database

• Use 2-parameter persistent homology to capture essential features of the 3D shape of molecules, and generate molecular signatures

• Store these signatures in a database

• Use smart metric data structures to minimize number of comparisons

• Parallelize and distribute!

Page 8: Two Parameter Persistence for Virtual Ligand Screening

Molecular Signatures

Page 9: Two Parameter Persistence for Virtual Ligand Screening

Idea

• Use 2-parameter persistence to capture three persistence modules (H0, H1, H2) for each molecule

• First parameter: Euclidean distance between atoms

• Second parameter: Some kind of chemical property of atoms, e.g. partial charge, mass, hybridization, aromaticity, etc.

!9

Page 10: Two Parameter Persistence for Virtual Ligand Screening

1-parameter persistence on Euclidean distance

�10

0 15 32 45 50 51 53 54 56 57 58 60

Page 11: Two Parameter Persistence for Virtual Ligand Screening

Two parameters

!11

Page 12: Two Parameter Persistence for Virtual Ligand Screening

Choice of parameters matters�12

Page 13: Two Parameter Persistence for Virtual Ligand Screening

RIVET• Rank Invariant Visualization and Exploration Tool

• Tool for calculating 2-parameter persistence modules from data and visualizing them

• Invented by Mike Lesnick & Matthew Wright in about 2013, with help for the last couple of years from me and a growing number of contributors

• Paper: Lesnick, M., & Wright, M. (2015). Interactive Visualization of 2-D Persistence Modules. Preprint ArXiv, 1–75. https://arxiv.org/abs/1512.00180

• Get it at http://rivet.online

• Python API available at https://github.com/rivettda/rivet-python

!13

Page 14: Two Parameter Persistence for Virtual Ligand Screening

RIVE

T

!14

Page 15: Two Parameter Persistence for Virtual Ligand Screening

RIVET in action

Page 16: Two Parameter Persistence for Virtual Ligand Screening

2 views of aspirin

How barcodes vary as we vary distance How barcodes vary as we vary partial charge

!16

Page 17: Two Parameter Persistence for Virtual Ligand Screening

Distances

Page 18: Two Parameter Persistence for Virtual Ligand Screening

2 notions of distance

• (Approximate) matching distance - more accurate, much more expensive

• L2 distance on the restricted Hilbert function - fairly accurate, much faster

• In both cases, we take the total distance between molecules A and B to be the sum of the distances between the ith persistence modules of A and B, i = 0,1,2

!18

Page 19: Two Parameter Persistence for Virtual Ligand Screening

Matching distance• Each choice of angle and offset

produces a (potentially) different barcode.

•We call these the fibered barcodes

•We want to compare 2-parameter persistence modules by considering (a subset of) all possible fibered barcodes

Page 20: Two Parameter Persistence for Virtual Ligand Screening

Matching distance

• B, C are 2-parameter persistence modules

• L ranges over affine lines of positive slope

• B(L) is the fibered barcode of B along L

• wL is a weight that depends only on the slope of L

• db is the bottleneck distance

!20

Page 21: Two Parameter Persistence for Virtual Ligand Screening

Hilbert Function

HilM (a) = dimMa<latexit sha1_base64="+T9bgmahpwPjxpeNvPtMzaxcaxk=">AAACD3icbVDLSgNBEJyNrxhfqx69DAYlXsKuCHoRgl5yCUQwD0jCMjuZJENmdpeZXjEs+wde/BUvHhTx6tWbf+PkIWhiQUNR1U13lx8JrsFxvqzM0vLK6lp2PbexubW9Y+/u1XUYK8pqNBShavpEM8EDVgMOgjUjxYj0BWv4w+ux37hjSvMwuIVRxDqS9APe45SAkTz7uA3sHpRMylykXlJJC+QEX+Iftctliise8ey8U3QmwIvEnZE8mqHq2Z/tbkhjyQKggmjdcp0IOglRwKlgaa4daxYROiR91jI0IJLpTjL5J8VHRuniXqhMBYAn6u+JhEitR9I3nZLAQM97Y/E/rxVD76KT8CCKgQV0uqgXCwwhHoeDu1wxCmJkCKGKm1sxHRBFKJgIcyYEd/7lRVI/LbpO0b05y5euZnFk0QE6RAXkonNUQmVURTVE0QN6Qi/o1Xq0nq03633amrFmM/voD6yPb7kLnG4=</latexit><latexit sha1_base64="+T9bgmahpwPjxpeNvPtMzaxcaxk=">AAACD3icbVDLSgNBEJyNrxhfqx69DAYlXsKuCHoRgl5yCUQwD0jCMjuZJENmdpeZXjEs+wde/BUvHhTx6tWbf+PkIWhiQUNR1U13lx8JrsFxvqzM0vLK6lp2PbexubW9Y+/u1XUYK8pqNBShavpEM8EDVgMOgjUjxYj0BWv4w+ux37hjSvMwuIVRxDqS9APe45SAkTz7uA3sHpRMylykXlJJC+QEX+Iftctliise8ey8U3QmwIvEnZE8mqHq2Z/tbkhjyQKggmjdcp0IOglRwKlgaa4daxYROiR91jI0IJLpTjL5J8VHRuniXqhMBYAn6u+JhEitR9I3nZLAQM97Y/E/rxVD76KT8CCKgQV0uqgXCwwhHoeDu1wxCmJkCKGKm1sxHRBFKJgIcyYEd/7lRVI/LbpO0b05y5euZnFk0QE6RAXkonNUQmVURTVE0QN6Qi/o1Xq0nq03633amrFmM/voD6yPb7kLnG4=</latexit><latexit sha1_base64="+T9bgmahpwPjxpeNvPtMzaxcaxk=">AAACD3icbVDLSgNBEJyNrxhfqx69DAYlXsKuCHoRgl5yCUQwD0jCMjuZJENmdpeZXjEs+wde/BUvHhTx6tWbf+PkIWhiQUNR1U13lx8JrsFxvqzM0vLK6lp2PbexubW9Y+/u1XUYK8pqNBShavpEM8EDVgMOgjUjxYj0BWv4w+ux37hjSvMwuIVRxDqS9APe45SAkTz7uA3sHpRMylykXlJJC+QEX+Iftctliise8ey8U3QmwIvEnZE8mqHq2Z/tbkhjyQKggmjdcp0IOglRwKlgaa4daxYROiR91jI0IJLpTjL5J8VHRuniXqhMBYAn6u+JhEitR9I3nZLAQM97Y/E/rxVD76KT8CCKgQV0uqgXCwwhHoeDu1wxCmJkCKGKm1sxHRBFKJgIcyYEd/7lRVI/LbpO0b05y5euZnFk0QE6RAXkonNUQmVURTVE0QN6Qi/o1Xq0nq03633amrFmM/voD6yPb7kLnG4=</latexit><latexit sha1_base64="+T9bgmahpwPjxpeNvPtMzaxcaxk=">AAACD3icbVDLSgNBEJyNrxhfqx69DAYlXsKuCHoRgl5yCUQwD0jCMjuZJENmdpeZXjEs+wde/BUvHhTx6tWbf+PkIWhiQUNR1U13lx8JrsFxvqzM0vLK6lp2PbexubW9Y+/u1XUYK8pqNBShavpEM8EDVgMOgjUjxYj0BWv4w+ux37hjSvMwuIVRxDqS9APe45SAkTz7uA3sHpRMylykXlJJC+QEX+Iftctliise8ey8U3QmwIvEnZE8mqHq2Z/tbkhjyQKggmjdcp0IOglRwKlgaa4daxYROiR91jI0IJLpTjL5J8VHRuniXqhMBYAn6u+JhEitR9I3nZLAQM97Y/E/rxVD76KT8CCKgQV0uqgXCwwhHoeDu1wxCmJkCKGKm1sxHRBFKJgIcyYEd/7lRVI/LbpO0b05y5euZnFk0QE6RAXkonNUQmVURTVE0QN6Qi/o1Xq0nq03633amrFmM/voD6yPb7kLnG4=</latexit>

For M a persistence module, and a a bigrade

Page 22: Two Parameter Persistence for Virtual Ligand Screening

Hilbert function visualization with RIVET

Aspirin Tylenol Doxorubicin!22

Page 23: Two Parameter Persistence for Virtual Ligand Screening

Restricted Hilbert Function

!23

• Informally, the restricted Hilbert function is the Hilbert function within the bounds shown in RIVET, and 0 elsewhere.

• Formally, for i >= 0, let Mi denote the ith module in a minimal free resolution of M, and let R(M) be the minimal rectangle containing all bigrades of elements in bases for M0 and M1. Then:

RHilM (a) :=

(HilM (a) for a 2 R(M),

0 otherwise.<latexit sha1_base64="9lsXxpJ3Lw/nLigYmNuEvL6wROY=">AAACYXicbVFdSxtBFJ3dajWpH1v76MtgbIlQwm4pWAqFoC++CCqNCtkQZid3k8HZmWXmrhqW/ZN986Uv/hEnyQrx4z4dzjn3Y84kuRQWw/DB8z+srH5cW280P21sbm0Hn3curS4Mhx7XUpvrhFmQQkEPBUq4zg2wLJFwldwcz/SrWzBWaPUXpzkMMjZWIhWcoaOGwX2McI8mKy9OhKyG5WnVZge//8QJjIUquZtsK/rsWbLQbwu2yMtUG7rPYqHoRfv0YP97Fcc0XNI1TsDcCQudKgY1qocOg1bYCedF34KoBi1S19kw+BePNC8yUMgls7YfhTkOSmZQcAlVMy4s5IzfsDH0HVQsAzso5wlV9KtjRnR2aaoV0jm73FGyzNppljhnxnBiX2sz8j2tX2D6a1AKlRcIii8WpYWkqOksbjoSBjjKqQOMG+FupXzCDOPoPqXpQoheP/ktuPzRicJOdP6z1T2q41gnu2SPtElEDkmXnJAz0iOc/PdWvE1vy3v0G37g7yysvlf3fCEvyt99AlBFtno=</latexit><latexit sha1_base64="9lsXxpJ3Lw/nLigYmNuEvL6wROY=">AAACYXicbVFdSxtBFJ3dajWpH1v76MtgbIlQwm4pWAqFoC++CCqNCtkQZid3k8HZmWXmrhqW/ZN986Uv/hEnyQrx4z4dzjn3Y84kuRQWw/DB8z+srH5cW280P21sbm0Hn3curS4Mhx7XUpvrhFmQQkEPBUq4zg2wLJFwldwcz/SrWzBWaPUXpzkMMjZWIhWcoaOGwX2McI8mKy9OhKyG5WnVZge//8QJjIUquZtsK/rsWbLQbwu2yMtUG7rPYqHoRfv0YP97Fcc0XNI1TsDcCQudKgY1qocOg1bYCedF34KoBi1S19kw+BePNC8yUMgls7YfhTkOSmZQcAlVMy4s5IzfsDH0HVQsAzso5wlV9KtjRnR2aaoV0jm73FGyzNppljhnxnBiX2sz8j2tX2D6a1AKlRcIii8WpYWkqOksbjoSBjjKqQOMG+FupXzCDOPoPqXpQoheP/ktuPzRicJOdP6z1T2q41gnu2SPtElEDkmXnJAz0iOc/PdWvE1vy3v0G37g7yysvlf3fCEvyt99AlBFtno=</latexit><latexit sha1_base64="9lsXxpJ3Lw/nLigYmNuEvL6wROY=">AAACYXicbVFdSxtBFJ3dajWpH1v76MtgbIlQwm4pWAqFoC++CCqNCtkQZid3k8HZmWXmrhqW/ZN986Uv/hEnyQrx4z4dzjn3Y84kuRQWw/DB8z+srH5cW280P21sbm0Hn3curS4Mhx7XUpvrhFmQQkEPBUq4zg2wLJFwldwcz/SrWzBWaPUXpzkMMjZWIhWcoaOGwX2McI8mKy9OhKyG5WnVZge//8QJjIUquZtsK/rsWbLQbwu2yMtUG7rPYqHoRfv0YP97Fcc0XNI1TsDcCQudKgY1qocOg1bYCedF34KoBi1S19kw+BePNC8yUMgls7YfhTkOSmZQcAlVMy4s5IzfsDH0HVQsAzso5wlV9KtjRnR2aaoV0jm73FGyzNppljhnxnBiX2sz8j2tX2D6a1AKlRcIii8WpYWkqOksbjoSBjjKqQOMG+FupXzCDOPoPqXpQoheP/ktuPzRicJOdP6z1T2q41gnu2SPtElEDkmXnJAz0iOc/PdWvE1vy3v0G37g7yysvlf3fCEvyt99AlBFtno=</latexit><latexit sha1_base64="9lsXxpJ3Lw/nLigYmNuEvL6wROY=">AAACYXicbVFdSxtBFJ3dajWpH1v76MtgbIlQwm4pWAqFoC++CCqNCtkQZid3k8HZmWXmrhqW/ZN986Uv/hEnyQrx4z4dzjn3Y84kuRQWw/DB8z+srH5cW280P21sbm0Hn3curS4Mhx7XUpvrhFmQQkEPBUq4zg2wLJFwldwcz/SrWzBWaPUXpzkMMjZWIhWcoaOGwX2McI8mKy9OhKyG5WnVZge//8QJjIUquZtsK/rsWbLQbwu2yMtUG7rPYqHoRfv0YP97Fcc0XNI1TsDcCQudKgY1qocOg1bYCedF34KoBi1S19kw+BePNC8yUMgls7YfhTkOSmZQcAlVMy4s5IzfsDH0HVQsAzso5wlV9KtjRnR2aaoV0jm73FGyzNppljhnxnBiX2sz8j2tX2D6a1AKlRcIii8WpYWkqOksbjoSBjjKqQOMG+FupXzCDOPoPqXpQoheP/ktuPzRicJOdP6z1T2q41gnu2SPtElEDkmXnJAz0iOc/PdWvE1vy3v0G37g7yysvlf3fCEvyt99AlBFtno=</latexit>

Page 24: Two Parameter Persistence for Virtual Ligand Screening

L2 Distance on Hilbert Functions

Page 25: Two Parameter Persistence for Virtual Ligand Screening

L2 distance on restricted Hilbert functions (discretely, concretely)

,

=

d2( )pP

00·2<latexit sha1_base64="78qz6XYtiYfgamMf+LS48SPoq9E=">AAACAXicbVC7SgNBFL0bXzG+1tgINoNBsQq7abQM2lhGMQ/IxjA7mSRDZh/O3BXCEht/xcZCEVv/ws7OT3HyKDTxwIXDOfdy7z1+LIVGx/myMkvLK6tr2fXcxubW9o69m6/pKFGMV1kkI9XwqeZShLyKAiVvxIrTwJe87g8uxn79nistovAGhzFvBbQXiq5gFI3Utvc9facw9XQSEMe5TT3WiZCURqO2XXCKzgRkkbgzUigfX+e/AaDStj+9TsSSgIfIJNW66ToxtlKqUDDJRzkv0TymbEB7vGloSAOuW+nkgxE5MkqHdCNlKkQyUX9PpDTQehj4pjOg2Nfz3lj8z2sm2D1rpSKME+Qhmy7qJpJgRMZxkI5QnKEcGkKZEuZWwvpUUYYmtJwJwZ1/eZHUSkXXKbpXbqF8DlNk4QAO4QRcOIUyXEIFqsDgAZ7gBV6tR+vZerPep60ZazazB39gffwARK2YPw==</latexit><latexit sha1_base64="DVWcy7T2b8PsFsDOlim1HuzAFxc=">AAACAXicbVC7SgNBFJ2Nrxhfa2wCNoNBsQq7abQM2lhGMQ/IxjA7O5sMmX04c1cIy9r4KzYWitj6Fzbiv1g4eRSaeODC4Zx7ufceNxZcgWV9Gbml5ZXVtfx6YWNza3vH3C02VZRIyho0EpFsu0QxwUPWAA6CtWPJSOAK1nKH52O/dcek4lF4DaOYdQPSD7nPKQEt9cySo24lpI5KAmxZN6lDvQhwNct6ZtmqWBPgRWLPSLl2dFX8VKXves/8cLyIJgELgQqiVMe2YuimRAKngmUFJ1EsJnRI+qyjaUgCprrp5IMMH2rFw34kdYWAJ+rviZQESo0CV3cGBAZq3huL/3mdBPzTbsrDOAEW0ukiPxEYIjyOA3tcMgpipAmhkutbMR0QSSjo0Ao6BHv+5UXSrFZsq2Jf2uXaGZoij/bRATpGNjpBNXSB6qiBKLpHj+gZvRgPxpPxarxNW3PGbGYP/YHx/gNChJm9</latexit><latexit sha1_base64="DVWcy7T2b8PsFsDOlim1HuzAFxc=">AAACAXicbVC7SgNBFJ2Nrxhfa2wCNoNBsQq7abQM2lhGMQ/IxjA7O5sMmX04c1cIy9r4KzYWitj6Fzbiv1g4eRSaeODC4Zx7ufceNxZcgWV9Gbml5ZXVtfx6YWNza3vH3C02VZRIyho0EpFsu0QxwUPWAA6CtWPJSOAK1nKH52O/dcek4lF4DaOYdQPSD7nPKQEt9cySo24lpI5KAmxZN6lDvQhwNct6ZtmqWBPgRWLPSLl2dFX8VKXves/8cLyIJgELgQqiVMe2YuimRAKngmUFJ1EsJnRI+qyjaUgCprrp5IMMH2rFw34kdYWAJ+rviZQESo0CV3cGBAZq3huL/3mdBPzTbsrDOAEW0ukiPxEYIjyOA3tcMgpipAmhkutbMR0QSSjo0Ao6BHv+5UXSrFZsq2Jf2uXaGZoij/bRATpGNjpBNXSB6qiBKLpHj+gZvRgPxpPxarxNW3PGbGYP/YHx/gNChJm9</latexit><latexit sha1_base64="Gc1+bFICWIx/a6x9Z2OQKvBP8wk=">AAACAXicbVBNS8NAEN3Ur1q/ol4EL4tF8FSSXvRY9OKxgv2AJpbNZtMu3U3i7kQooV78K148KOLVf+HNf+O2zUFbHww83pthZl6QCq7Bcb6t0srq2vpGebOytb2zu2fvH7R1kinKWjQRieoGRDPBY9YCDoJ1U8WIDATrBKOrqd95YErzJL6Fccp8SQYxjzglYKS+feTpewW5pzOJHecu92iYAK5PJn276tScGfAycQtSRQWaffvLCxOaSRYDFUTrnuuk4OdEAaeCTSpepllK6IgMWM/QmEim/Xz2wQSfGiXEUaJMxYBn6u+JnEitxzIwnZLAUC96U/E/r5dBdOHnPE4zYDGdL4oygSHB0zhwyBWjIMaGEKq4uRXTIVGEggmtYkJwF19eJu16zXVq7o1bbVwWcZTRMTpBZ8hF56iBrlETtRBFj+gZvaI368l6sd6tj3lrySpmDtEfWJ8/BfeWlQ==</latexit>

Page 26: Two Parameter Persistence for Virtual Ligand Screening

An Exampledi

stan

ce

partial charge partial charge

!26

Page 27: Two Parameter Persistence for Virtual Ligand Screening

Example Distances

+ + =

Page 28: Two Parameter Persistence for Virtual Ligand Screening

Results

Page 29: Two Parameter Persistence for Virtual Ligand Screening

�29

Scoring

Test databases have two kinds of molecules in them:• Active molecules are (probably) good drugs• Decoy molecules are poisons, inert, or otherwise bad, but have

many similar properties to the actives

Page 30: Two Parameter Persistence for Virtual Ligand Screening

Datasets

!30

• Two databases tested

• Cleves & Jain dataset is small: 979 compounds, about 850 of which are decoys. The same decoys are used for each protein target. Used for comparison with Shin et al. (2015) study.

• DUD-E (Directory of Useful Decoys - Extended) is a large publicly available testing dataset. We used a 1.5M compound drug-like subset, with samples of about 1000 substances per protein target, with a similarly high percentage of decoys. More realistic than Cleves & Jain.

Page 31: Two Parameter Persistence for Virtual Ligand Screening

Comparison with industry leaders

• OpenEye ROCS is the tool to beat

• Expensive ($60,000 per user), so we couldn’t run it ourselves

• Ultrafast Shape Recognition (USR) (Ballester 2007) is an open source tool that is fast and simple

• We’ll use USR as a reference to estimate our performance vs. ROCS, using the Shin et al. 2015 study

!31

Page 32: Two Parameter Persistence for Virtual Ligand Screening

One small caveat• Even to load the Cleves & Jain dataset, one needs to use the $60,000 package.

• Or you can use open source: RDKit, which we did

• It fails to load a few, so our dataset is slightly smaller than the Shin et al. one

• Based on performance of USR (10.16 on our dataset, 8.8 on Shin et al.), calculate adjustment factor of 1.155

• PHoS best average result: 18.598 / 1.155 = estimated 16.10 vs. ROCS 15.9.

• We judge PHoS is likely about as effective as ROCS, an industry leading system with > 10 years’ history and a company behind it

!32

Page 33: Two Parameter Persistence for Virtual Ligand Screening

2-parameter persistence matters (CJ)

Page 34: Two Parameter Persistence for Virtual Ligand Screening

Choice of parameters matters (DUD-E)

Page 35: Two Parameter Persistence for Virtual Ligand Screening

Different targets like different parameters (CJ)

Page 36: Two Parameter Persistence for Virtual Ligand Screening

Accuracy: L2/Hilbert vs Matching (DUD-E)

Page 37: Two Parameter Persistence for Virtual Ligand Screening

Runtime Performance L2 vs Matching (DUD-E)

Page 38: Two Parameter Persistence for Virtual Ligand Screening

Summary• Estimated performance competitive with the best in the industry

• 2-parameter persistence quite a bit better than 1-parameter persistence in this context

• Different choices for 2nd parameter win on different protein targets.

• L2 distance on restricted Hilbert function is surprisingly effective, and much faster than matching distance

• Preprint now available: https://chemrxiv.org/articles/PHoS_Persistent_Homology_for_Virtual_Screening/6969260

!38

Page 39: Two Parameter Persistence for Virtual Ligand Screening

Side effects• RIVET (http://rivet.online) enhanced:

• C, C++ API (included in standard RIVET)

• Python API (https://github.com/rivettda/rivet-python) (now available!)

• Rust API (https://github.com/rivettda/rivet-rust) (coming soon)

• RIVET console application & APIs can generate Betti information / Hilbert function, bounds information, barcode queries.

• Python & Rust APIs also support calculating both distances described today

• Hera enhanced with C API (to be donated soon if desired)

!39

Page 40: Two Parameter Persistence for Virtual Ligand Screening

Future Work• Understand variation in effectiveness of different 2nd parameters on different protein

targets

• Ways to handle conformations directly (e.g. treat molecules as configuration spaces)

• Combinations of signatures (e.g. include both partial charge and hybridization)

• Complexes other than VR (e.g. alpha, cubical)

• More fine tuning of distance metrics

• Possibly using machine learning, e.g. to learn weights for different persistence modules

• Performance tuning

!40

Page 41: Two Parameter Persistence for Virtual Ligand Screening

Thank you!

!41

[email protected]

• @BrynKeller on Twitter

• https://linkedin.com/in/brynkeller

• https://www.xoltar.org

• Thanks to people who improved RIVET in ways that mattered for this work:

• Mike Lesnick, Matthew Wright, Simon Segert, Roy Zhao

• Thanks to the Hera developers:

• Michael Kerber, Dmitriy Morozov, and Arnur Nigmetov