Impact of noise on inverse design: the case of NMR spectra matching

Literature Information

Publication Date 2023-10-17
DOI 10.1039/D3DD00132F
Impact Factor 0
Authors


View Original

Abstract

Despite its fundamental importance and widespread use for assessing reaction success in organic chemistry, deducing chemical structures from nuclear magnetic resonance (NMR) measurements has remained largely manual and time consuming. To keep up with the accelerated pace of automated synthesis in self driving laboratory settings, robust computational algorithms are needed to rapidly perform structure elucidations. We analyse the effectiveness of solving the NMR spectra matching task encountered in this inverse structure elucidation problem by systematically constraining the chemical search space, and correspondingly reducing the ambiguity of the matching task. Numerical evidence collected for the twenty most common stoichiometries in the QM9-NMR database indicate systematic trends of more permissible machine learning prediction errors in constrained search spaces. Results suggest that compounds with multiple heteroatoms are harder to characterize than others. Extending QM9 by ∼10 times more constitutional isomers with 3D structures generated by Surge, ETKDG and CREST, we used ML models of chemical shifts trained on the QM9-NMR data to test the spectra matching algorithms. Combining both 13C and 1H shifts in the matching process suggests twice as permissible machine learning prediction errors than for matching based on 13C shifts alone. Performance curves demonstrate that reducing ambiguity and search space can decrease machine learning training data needs by orders of magnitude.

Related Literature

Polymer-grafted multiwall carbon nanotubes functionalized by nitrene chemistry: effect on cooperativity and phase miscibility

Goutam Prasanna Kar, Priti Xavier, Suryasarathi Bose

2014-06-25 Paper

DOI: 10.1039/C4CP01594K

An electrically-stabilized liquid-crystalline phase: origin and application

I. Nishiyama

2014-10-29 Communication

DOI: 10.1039/C4CP04643A

On the mechanism of nanoparticle formation in a flame doped by iron pentacarbonyl

Marina Poliak, Alexey Fomin, Vladimir Tsionsky, Sergey Cheskis, Irenaeus Wlokas, Igor Rahinov

2014-11-11 Paper

DOI: 10.1039/C4CP04454A

Electrodeposition of iron and iron–aluminium alloys in an ionic liquid and their magnetic properties

P. Giridhar, B. Weidenfeller, F. Endres

2014-03-26 Paper

DOI: 10.1039/C4CP00613E

Direct observation of key photoinduced dynamics in a potential nano-delivery vehicle of cancer drugs

Samim Sardar, Siddhi Chaudhuri, Prasenjit Kar, Soumik Sarkar, Samir Kumar Pal

2014-10-28 Paper

DOI: 10.1039/C4CP03749A

A comparative structural study in monolayers of GPI fragments and their binary mixtures

C. Stefaniu, I. Vilotijevic, G. Brezesinski

2014-03-18 Paper

DOI: 10.1039/C4CP00567H

Correction: Plasmon-enhanced water splitting on TiO2-passivated GaP photocatalysts

Jing Qiu, Guangtong Zeng, Prathamesh Pavaskar, Zhen Li

2014-11-11 Correction

DOI: 10.1039/C4CP90165G

Beyond the molecular orbital conception of electronically excited states through the quantum theory of atoms in molecules

David Ferro-Costas, Ángel Martín Pendás, Leticia González, Ricardo A. Mosquera

2014-03-17 Paper

DOI: 10.1039/C4CP00431K

Non-equilibrium segmental dynamics driven by multiwall carbon nanotubes in PS/PVME blends

Priti Xavier, Suryasarathi Bose

2014-03-31 Paper

DOI: 10.1039/C4CP00832D

You might also like

Compound Q&A

What are the main uses of 1H-Indazole-6-carbonitrile (CAS: 141290-59-7)?

1H-Indazole-6-carbonitrile finds applications in pharmaceuticals, where it serve...

141290-59-71H-Indazole-6-carbon...
Compound Q&A

How should waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) be handled?

Waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) should be collecte...

2997-85-5Dioctyl (2E)-2-buten...
Compound Q&A

What industries use Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide (CAS: 68291-98-5)?

Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide is primarily used in pharmac...

68291-98-5Sodium [(1,2-benzoxa...
Compound Q&A

Are there alternatives to Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxylate (CAS: 741709-66-0) in synthesis?

Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxyla...

741709-66-0Dimethyl 4-(4,4,5,5-...
Compound Q&A

How should waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) be handled?

Waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) should be manage...

80714-39-22-Fluoro-6-hydrazino...
Compound Q&A

What is 6-Formyl-2-pyridinecarboxylic acid (CAS: 499214-11-8)?

6-Formyl-2-pyridinecarboxylic acid is an organic compound with the molecular for...

499214-11-86-Formyl-2-pyridinec...
900874-91-13-(3,4-dimethoxyphen...
Compound Q&A

How is 9H-Tribenzo[b,d,f]azepine (CAS: 29875-73-8) typically synthesized?

9H-Tribenzo[b,d,f]azepine is typically synthesized via a multi-step process invo...

29875-73-89H-Tribenzo[b,d,f]az...
Compound Q&A

How is 1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxylic acid (CAS: 1797982-51-4) typically synthesized?

1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxyli...

1797982-51-41-Cyclopropyl-7-etho...
Compound Q&A

How should waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: 671820-52-3) be handled?

Waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: ...

671820-52-3Methyl 3-oxo-1,2,3,4...
Disclaimer
This page provides academic journal information for reference and research purposes only. We are not affiliated with any journal publishers and do not handle publication submissions. For publication-related inquiries, please contact the respective journal publishers directly.
If you notice any inaccuracies in the information displayed, please contact us at support@chemtradehub.com. We will promptly review and address your concerns.