Generating molecules with optimized aqueous solubility using iterative graph translation

Literature Information

Publication Date 2021-11-15
DOI 10.1039/D1RE00315A
Impact Factor 4.239
Authors

Camille Bilodeau, Wengong Jin, Hongyun Xu, Jillian A. Emerson, Sukrit Mukhopadhyay, Thomas H. Kalantar, Tommi Jaakkola, Regina Barzilay, Klavs F. Jensen


View Original

Abstract

While molecular discovery is critical for solving many scientific problems, the time and resource costs of experiments make it intractable to fully explore chemical space. Here, we present a generative modeling framework that proposes novel molecules that are 1) based on starting candidate structures and 2) optimized with respect to one or more objectives or constraints. We explore how this framework performs in an applied setting by focusing on the problem of optimizing molecules for aqueous solubility, using an experimental database containing data curated from the literature. The resulting model was capable of improving molecules with a range of starting solubilities. When synthetic feasibility was applied as a secondary optimization constraint (estimated using a combination of synthetic accessibility and retrosynthetic accessibility scores), the model generated synthetically feasible molecules 83.0% of the time (compared with 59.9% of the time without the constraint). To validate model performance experimentally, a set of candidate molecules was translated using the model and the solubilities of the candidate and generated molecules were verified experimentally. We additionally validated model performance via experimental measurements by holding out the top 100 most soluble molecules during training and showing that the model could rediscover 33 of those molecules. To determine the sensitivity of model performance to dataset size, we trained the model on different subsets of the initial training dataset. We found that model performance did not decrease significantly when the model was trained on a random 50% subset of the training data but did decrease when the model was trained on subsets containing only less soluble molecules (i.e., the bottom 50%). Overall, this framework serves as a tool for generating optimized, synthetically feasible molecules that can be applied to a range of problems in chemistry and chemical engineering.

Related Literature

The first synthetic studies on pestalotiopsin A. A stereocontrolled approach to the functionalised bicyclic core

Derek Johnston, Emmanuel Couché, David J. Edmonds, Kenneth W. Muir, David J. Procter

2002-12-13 Paper

DOI: 10.1039/B209066J

A pendant peptide endows a sunscreen with water-resistance

Aubrey J. Ellison

2018-09-10 Communication

DOI: 10.1039/C8OB01773E

Preparation and characterization of pyrene modified uridine derivatives as potential electron donors in RNA

Jennifer Frommer, Beatrice Karg, Klaus Weisz, Sabine Müller

2018-09-27 Paper

DOI: 10.1039/C8OB02246A

Assignment of absolute configuration of a chiral phenyl-substituted dihydrofuroangelicin

Gennaro Pescitelli, Nina Berova, Tom L. Xiao, Roman V. Rozhkov, Richard C. Larock, Daniel W. Armstrong

2002-12-03 Paper

DOI: 10.1039/B207652G

Functionalization of diazotetronic acid and application in a stereoselective modular synthesis of pulvinone, aspulvinones A–E, G, Q and their analogues

Amarender Manchoju, Ritesh A. Annadate, Lise Desquien, Sunil V. Pansare

2018-08-10 Paper

DOI: 10.1039/C8OB01511B

Eugenunilones A–H: rearranged sesquiterpenoids from Eugenia uniflora

Mu Chen, Jia-Qing Cao, Song Ang, Ting-Ni Zeng, Ni-Ping Li, Tang-Jia Yang, Jun-Shan Liu, Yan Wu, Wen-Cai Ye, Lei Wang

2021-12-03 Research Article

DOI: 10.1039/D1QO01629F

Ruthenium(ii)-catalyzed synthesis of indazolone-fused cinnolines via C–H coupling with diazo compounds

Lin Su, Zheng Yu, Peiling Ren, Zhi Luo, Wei Hou, Hongtao Xu

2018-09-13 Paper

DOI: 10.1039/C8OB02071J

Metal- and base-free synthesis of functionalized α,α-difluoroimines via electrophilic fluorination of N-substituted enamines

Fangyi Li, Changfu Qiu, Guangwei Yin, Chunhua Wang, Zheng Li

2018-09-13 Communication

DOI: 10.1039/C8OB01941J

You might also like

Compound Q&A

What is the market or research trend for N-(4-Methoxybenzyl)-2-pyridinamine (CAS: 52818-63-0)?

N-(4-Methoxybenzyl)-2-pyridinamine (CAS: 52818-63-0) is increasingly being used ...

52818-63-0N-(4-Methoxybenzyl)-...
Compound Q&A

What precautions should be taken when handling Ethyl 4-(2-chlorophenyl)-1,3-thiazole-2-carboxylate (CAS: 1050507-06-6)?

When handling Ethyl 4-(2-chlorophenyl)-1,3-thiazole-2-carboxylate, appropriate p...

1050507-06-6Ethyl 4-(2-chlorophe...
Compound Q&A

What regulatory guidelines apply to diethyldiselane (CAS: 628-39-7)?

Diethyldiselane (CAS: 628-39-7) is classified under the Globally Harmonized Syst...

628-39-7Diethyldiselane
Compound Q&A

What is the market or research trend for oxocopper (CAS: 12053-18-8)?

The market for oxocopper (CAS: 12053-18-8) is primarily driven by its use in cat...

12053-18-8oxocopper; oxo-(oxoc...
Compound Q&A

What is the market or research trend for 5-{[(2-Methyl-2-propanyl)oxy]carbonyl}-5-azaspiro[2.4]heptane-7-carboxylic acid?

The market for 5-{[(2-Methyl-2-propanyl)oxy]carbonyl}-5-azaspiro[2.4]heptane-7-c...

1268519-54-55-{[(2-Methyl-2-prop...
Compound Q&A

What is 2-(1-Pyrrolidinyl)-4-pyridinamine (CAS: 35981-63-6)?

2-(1-Pyrrolidinyl)-4-pyridinamine is a chemical compound with the CAS number 359...

35981-63-62-(1-Pyrrolidinyl)-4...
Compound Q&A

What are the physical and chemical properties of 2-(3-Pyridinyl)-1-azabicyclo[2.2.2]octane (CAS: 91556-75-1)?

2-(3-Pyridinyl)-1-azabicyclo[2.2.2]octane (CAS: 91556-75-1) is a crystalline sol...

91556-75-12-(3-Pyridinyl)-1-az...
Compound Q&A

How is (S)-Alpha-allyl-proline hydrochloride (CAS: 129704-91-2) typically synthesized?

(S)-Alpha-allyl-proline hydrochloride is usually synthesized via a Wittig reacti...

129704-91-2(S)-Alpha-allyl-prol...
Compound Q&A

What is 3-Methyl-1,2-oxazole-5-carboxylic acid (CAS: 4857-42-5)?

3-Methyl-1,2-oxazole-5-carboxylic acid (CAS: 4857-42-5) is an organic compound w...

4857-42-53-Methyl-1,2-oxazole...
Compound Q&A

How is Lys-SMCC-DM1 (CAS: 1281816-04-3) typically synthesized?

Lys-SMCC-DM1 is synthesized via a multi-step process involving the coupling of S...

1281816-04-3Lys-SMCC-DM1

Source Journal

Reaction Chemistry & Engineering

Reaction Chemistry & Engineering
CiteScore: 0
Self-citation Rate: 8.8%
Articles per Year: 284

Reaction Chemistry & Engineering is an interdisciplinary journal reporting cutting-edge research focused on enhancing the understanding and efficiency of reactions. Reaction engineering leverages the interface where fundamental molecular chemistry meets chemical engineering and technology. Challenges in chemistry can be overcome by the application of new technologies, while engineers may find improved solutions for process development from the latest developments in reaction chemistry. Reaction Chemistry & Engineering is a unique forum for researchers whose interests span the broad areas of chemical engineering and chemical sciences to come together in solving problems of importance to wider society. All papers should be written to be approachable by readers across the engineering and chemical sciences. Papers that consider multiple scales, from the laboratory up to and including plant scale, are particularly encouraged.

Recommended Compounds

Recommended Suppliers

Disclaimer
This page provides academic journal information for reference and research purposes only. We are not affiliated with any journal publishers and do not handle publication submissions. For publication-related inquiries, please contact the respective journal publishers directly.
If you notice any inaccuracies in the information displayed, please contact us at support@chemtradehub.com. We will promptly review and address your concerns.