Top Banner
The Chemical Rediscovery Survey and the role of Openness in Chemistry Jean-Claude Bradley March 6, 2014 Research Mini-Symposia Associate Professor of Chemistry Drexel University Drexel University Department of Chemistry
47

A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

May 10, 2015

Download

Education

Jean-Claude Bradley provides examples of how detailed monitoring of chemical mixing can be advantageous for new discoveries and Green Chemistry. The role of openness to successfully accomplish this goal is also discussed.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

The Chemical Rediscovery Survey

and the role of Openness in Chemistry

Jean-Claude Bradley

March 6, 2014

Research Mini-Symposia

Associate Professor of ChemistryDrexel University

Drexel University Department of Chemistry

Page 2: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Forms of Openness in Science

1. Open Access Peer-Reviewed Publication

2. Informal Discussions3. Open Source Software4. Open Datasets and Models5. Open Notebook Science6. Open Research Proposals

Page 3: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Top 5 questions in chemistry according to Scientific American

(Nov 5, 2013)

1. Can we unravel the puzzle of life’s origins?

2. Can we ever beat photosynthesis?

3. How do we make chemistry environmentally friendly?

4. Can we design the perfect drug?5. How do we sell chemistry to the

public?

Page 4: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

The current paradigm of doing and sharing science in

chemistry1. Design experiments based on established or potentially new theories.

2. Execute and record experimental outcomes in private notebooks.

3. When a sufficient narrative emerges selective experimental data are combined to publish, with a limited amount of “supplementary supporting data”

Page 5: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

What kind of (chemical) worldview has this approach

created?1. Selective bias towards which experiment are even attempted.

2. Overconfidence in our understanding since deviant or ambiguous results are rarely reported.

Page 6: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Filling in the blind spots with the Chemical Rediscovery Survey

(chemrs.wikispaces.com)

1. Randomize the mixture of chemicals with certain criteria*

2. Identify “what happens” after convenient* periods of time.

3. Follow up on unexpected behavior with the traditional scientific method.

4. Openly share the entire process, including all raw data and preliminary hypotheses and discoveries as it happens.

Page 7: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

The current CRS criteria

1. Only small common cheap organic compounds

2. Only select relatively “Green” compounds

3. Avoid excessively unpleasant compounds (stench!)

Page 8: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Co-axial NMR tubes are used to isolate the reaction from the

deuterated solvent

Page 9: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

An example of a Chemical Rediscovery Survey experiment

Page 10: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

The overall reaction is easily identified by NMR

Page 11: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Raw NMR data is provided for open analysis

Page 12: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

The experiment is represented in a machine readable matrix: mole

fractions

Page 13: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

All assignable NMR peaks are also archived for machine readability

Page 14: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Report discoveries as they happen

Page 15: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Example of a more human readable format

Page 16: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

In the case of ethanol, hemiacetal OH appears as doublet (7.3 Hz)

Page 17: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

This level of detail for monitoring chemical interactions is not typically available from the

chemical literature (Open or Not)

Using NMR spectroscopy in this way to create an Open Survey of chemical behavior is analogous in astronomy to creating a new Survey of Space by introducing a new telescope

Page 18: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

NMR requires a homogeneous solution for proper measurement

However once an interesting reaction has been observed to occur slowly at 25C and low concentration, preparative scale-up conditions can be estimated (i.e. reaction rate doubles about every 10C)

Page 19: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

The Recrystallization App (Open)

(Andrew Lang)

Page 20: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

What are good solvents to recrystallize benzoic acid?

(Andrew Lang)

Page 21: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Click on the solvent to see temp curve (Open)

(Andrew Lang)

Page 22: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

The role of Openness in rethinking how to tackle the “big chemistry questions”

Q3. How do we make chemistry environmentally friendly?

By limiting ourselves to relatively Green compounds and by sharing all data in real time we are much more likely to find Green reactions from the CRS project and encourage others to benefit.

This would reduce student exposure in teaching labs and lower costs for waste disposal

Page 23: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Q4. Can we design the perfect drug?

(Andrew Lang)

We can try to do Open Drug Discovery – we have found active lead compounds against malaria for example and working on Taxol analogs

Page 24: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Q5. How do we sell chemistry to the public?We are approaching 1000 queries a day for specific solubility and melting point data. Some originate from academia and industry but many from high schools and the general public.

By concentrating on “Green” non toxic and readily available compounds and by providing Open resources to encourage their curiosity the public will become more engaged and understand the importance of chemistry.

Page 25: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Contributing to Science while Teaching it:

Chemical Information Retrieval Class

Page 26: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Chemical Information Validation Sheet 2012

Page 27: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Each entry validated with an image

Page 28: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Alfa Aesar donates melting points to the public

Page 29: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Outliers for ethanol: Alfa Aesar and Oxford MSDS

Page 30: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

OutliersMDPI

datasetEPI (donated all data to public

also)

Page 31: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Open Melting Point DatasetsCurrently 20,000 compounds with Open MPs

Page 32: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

American Petroleum Institute 5 CPHYSPROP -30 CPHYSPROP 125 Cpeer reviewed journal (2008) 97.5 Cgovernment database -30 Cgovernment database 4.58 C

What is the melting point of 4-benzyltoluene?

Page 33: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Open Lab Notebook page measuring the melting point of 4-benzyltoluene

Page 34: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

An example of a failed experiment in an Open Notebook with useful information

Page 35: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

A failed experiment reveals the importance of aldehyde solubility

Page 36: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Information from the literature on the target synthesis

Page 37: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Motivation: Faster Science, Better Science

Page 38: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

There are NO FACTS, only measurements embedded

within assumptions

Open Notebook Science maintains the integrity of data

provenance by making assumptions explicit

Page 39: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

An example of a successful experiment in an Open Notebook that was used to improve the

teaching lab manual

Page 40: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Open Random Forest modeling of Open Melting Point data using CDK descriptors

(Andrew Lang)

R2 = 0.78, TPSA and nHdon most important

Page 41: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Melting point prediction service

Page 42: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Web services for summary data

(Andrew Lang)

Page 43: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Using a Google Spreadsheet as a “dashboard interface” for reaction planning and analysis

Page 44: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Calling Google App Scripts

Page 45: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Calling Google App Scripts

(Andrew Lang and Rich Apodaca)

Page 46: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Google Apps Scripts web services

Page 47: A brief description of the Chemical Rediscovery Survey and Open Chemistry in the Bradley Lab at Drexel University

Conclusions

More openness in chemistry can make science more efficient and address many of the key current questions challenging chemistry community

Provide interfaces that make sense to the end users: Open Data, Open Models and Open Source Software to modelersApps (smartphones, Google App Scripts, etc.) for chemists at the bench Acknowledgements

Andrew Lang (code, modeling)Bill Acree (modeling, solubility data contribution)Antony Williams (ChemSpider services, mp data curation)Matthew McBride and Rida Atif (recrystallization and synthesis)Kayla Gogarty, Cuepil Choi, Matthew McBride, Alex Turfa (CRS)