Impact of noise on inverse design: the case of NMR spectra matching

Literature Information

Publication Date 2023-10-17
DOI 10.1039/D3DD00132F
Impact Factor 0
Authors


View Original

Abstract

Despite its fundamental importance and widespread use for assessing reaction success in organic chemistry, deducing chemical structures from nuclear magnetic resonance (NMR) measurements has remained largely manual and time consuming. To keep up with the accelerated pace of automated synthesis in self driving laboratory settings, robust computational algorithms are needed to rapidly perform structure elucidations. We analyse the effectiveness of solving the NMR spectra matching task encountered in this inverse structure elucidation problem by systematically constraining the chemical search space, and correspondingly reducing the ambiguity of the matching task. Numerical evidence collected for the twenty most common stoichiometries in the QM9-NMR database indicate systematic trends of more permissible machine learning prediction errors in constrained search spaces. Results suggest that compounds with multiple heteroatoms are harder to characterize than others. Extending QM9 by ∼10 times more constitutional isomers with 3D structures generated by Surge, ETKDG and CREST, we used ML models of chemical shifts trained on the QM9-NMR data to test the spectra matching algorithms. Combining both 13C and 1H shifts in the matching process suggests twice as permissible machine learning prediction errors than for matching based on 13C shifts alone. Performance curves demonstrate that reducing ambiguity and search space can decrease machine learning training data needs by orders of magnitude.

Related Literature

Effect of tetrabutylphosphonium cation on the physico-chemical properties of amino-acid ionic liquids

Junko Kagimoto, Kenta Fukumoto, Hiroyuki Ohno

2006-04-25 Communication

DOI: 10.1039/B600771F

The pentanuclear Feii cluster [(C5H4)6Fe5]2−: bringing together ferrocene sandwiches and homoleptic Feii-cyclopentadienyl σ-complexes

Ingeborg Sänger, Julia B. Heilmann, Michael Bolte, Hans-Wolfram Lerner, Matthias Wagner

2006-04-07 Communication

DOI: 10.1039/B602359B

Carbohydrate triazoles and isoxazoles as inhibitors of galectins-1 and -3

Denis Giguère, Ramesh Patnam, Marc-André Bellefleur, Christian St-Pierre, Sachiko Sato, René Roy

2006-03-16 Communication

DOI: 10.1039/B517529A

An electrochemical/photochemical information processing system using a monolayer-functionalized electrode

Ronan Baron, Avital Onopriyenko, Eugenii Katz, Oleg Lioubashevski, Itamar Willner, Sheng Wang, He Tian

2006-02-23 Communication

DOI: 10.1039/B518378B

On the influence of porphyrin π–π stacking on supramolecular chirality created in the porphyrin-based twisted tape structure

Masayuki Takeuchi, Satoshi Tanaka, Seiji Shinkai

2005-10-04 Communication

DOI: 10.1039/B512128K

Synthesis of pyrroles: reaction of chromium N-alkylaminocarbene complexes with α,β-unsaturated aldehydes

Kohei Fuchibe, Daisuke Ono, Takahiko Akiyama

2006-04-26 Communication

DOI: 10.1039/B602924H

Easy access to the family of thiazole N-oxides using HOF·CH3CN

Elizabeta Amir, Shlomo Rozen

2006-04-25 Communication

DOI: 10.1039/B602594C

High energy density materials from azido cyclophosphazenes

K. Muralidharan, Bamidele A. Omotowa, Brendan Twamley, Crystal Piekarski, Jean′ne M. Shreeve

2005-09-20 Communication

DOI: 10.1039/B510924H

The direct α-zincation of amides, phosphonates and phosphine oxides by H–Zn exchange

Mark L. Hlavinka, Jeffrey F. Greco, John R. Hagadorn

2005-09-23 Communication

DOI: 10.1039/B509190J

You might also like

Compound Q&A

What are the main uses of 1H-Indazole-6-carbonitrile (CAS: 141290-59-7)?

1H-Indazole-6-carbonitrile finds applications in pharmaceuticals, where it serve...

141290-59-71H-Indazole-6-carbon...
Compound Q&A

How should waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) be handled?

Waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) should be collecte...

2997-85-5Dioctyl (2E)-2-buten...
Compound Q&A

What industries use Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide (CAS: 68291-98-5)?

Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide is primarily used in pharmac...

68291-98-5Sodium [(1,2-benzoxa...
Compound Q&A

Are there alternatives to Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxylate (CAS: 741709-66-0) in synthesis?

Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxyla...

741709-66-0Dimethyl 4-(4,4,5,5-...
Compound Q&A

How should waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) be handled?

Waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) should be manage...

80714-39-22-Fluoro-6-hydrazino...
Compound Q&A

What is 6-Formyl-2-pyridinecarboxylic acid (CAS: 499214-11-8)?

6-Formyl-2-pyridinecarboxylic acid is an organic compound with the molecular for...

499214-11-86-Formyl-2-pyridinec...
900874-91-13-(3,4-dimethoxyphen...
Compound Q&A

How is 9H-Tribenzo[b,d,f]azepine (CAS: 29875-73-8) typically synthesized?

9H-Tribenzo[b,d,f]azepine is typically synthesized via a multi-step process invo...

29875-73-89H-Tribenzo[b,d,f]az...
Compound Q&A

How is 1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxylic acid (CAS: 1797982-51-4) typically synthesized?

1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxyli...

1797982-51-41-Cyclopropyl-7-etho...
Compound Q&A

How should waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: 671820-52-3) be handled?

Waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: ...

671820-52-3Methyl 3-oxo-1,2,3,4...
Disclaimer
This page provides academic journal information for reference and research purposes only. We are not affiliated with any journal publishers and do not handle publication submissions. For publication-related inquiries, please contact the respective journal publishers directly.
If you notice any inaccuracies in the information displayed, please contact us at support@chemtradehub.com. We will promptly review and address your concerns.