An interpretable machine learning framework for modelling macromolecular interaction mechanisms with nuclear magnetic resonance
Literature Information
Samantha Stuart, Jeffrey Watchorn
Macromolecular interactions, such as polymer–protein binding, determine the biological fate of biomaterials. However, in most macromolecular binding systems, underlying interaction mechanisms are unclear, limiting capabilities for in vitro prediction. In particular, the atomic-level structure–activity relationships that drive protein–polymer binding are confounding. To overcome this gap, we developed a machine learning framework that applies interaction data from direct saturation compensated nuclear magnetic resonance (DISCO NMR) to classify polymer proton descriptors to their interactive behaviors with mucin proteins. The framework constructs structure-interaction trends from cross-polymer atomic-level behavior patterns, and identifies “undervalued” inert polymer groups with potential to be engineered towards interaction. Trends are constructed from materials-agnostic interaction descriptors that combine chemical shift fingerprints, molecular weight, and cumulative DISCO effect from saturation transfer buildup, mapping proton chemical, physical, and conformational attributes together. In this work we constructed a fully-trained decision tree classifier to model structure–activity after applying principal component analysis (accuracy = 0.92, F1 = 0.87) and interpreted its decision rules to improve scientific understanding of mucin binding. Several undervalued inert protons identified by the model include: HPC 80 kDa (4.58 ppm), HPMC 120 kDa (4.48 ppm), PVA 105 kDa (1.58 ppm), DEX 150 kDa (5.20 ppm), PVP 55 kDa (3.89 ppm), CMC 90 kDa (4.58 ppm), and PEOZ 50 kDa (3.42 ppm). The model additionally suggested a structure–activity relationship is shared by HPC, CMC, DEX, and HPMC protons in the 80–150 kDa range. More broadly, the framework and its descriptors can be applied for data-driven discovery of new polymer formulations using previously obscure cross-polymer sub-group trends, and is similarly applicable to any receptor-ligand system compatible with DISCO-NMR screening.
Related Literature
A new alternative to Stryker's reagent in hydrosilylation: synthesis, structure, and reactivity of a well-defined carbene–copper(ii) acetate complex
Jaesook Yun, Daesung Kim, Hoseop Yun
DOI: 10.1039/B509964A
Smart amphiphiles: hydro/organogelators for in situreduction of gold
Praveen Kumar Vemula, George John
DOI: 10.1039/B518289A
Synthesis and spectroscopic observation of dendrimer-encapsulated gold nanoclusters
M. Linh Tran, Andrei V. Zvyagin, Taras Plakhotnik
DOI: 10.1039/B602079H
Direction of unusual mixed-ligand metal–organic frameworks: a new type of 3-D polythreading involving 1-D and 2-D structural motifs and a 2-fold interpenetrating porous network
Miao Du, Xiu-Juan Jiang, Xiao-Jun Zhao
DOI: 10.1039/B509875K
Insulated conducting polymers: manipulating charge transport using supramolecular complexes
Phoebe H. Kwan, Timothy M. Swager
DOI: 10.1039/B508399K
Nanoassembly of biocompatible microcapsules for urease encapsulation and their use as biomimetic reactors
Aimin Yu, Ian Gentle, Gaoqing Lu, Frank Caruso
DOI: 10.1039/B601490A
Intramolecular alkene hydroaminations catalyzed by a bis(thiophosphinic amidate) Zr(iv) complex
Hyunseok Kim, Phil Ho Lee, Tom Livinghouse
DOI: 10.1039/B505738H
Base-catalysed asymmetric hydroamination/cyclisation of aminoalkenes utilising a dimeric chiral diamidobinaphthyl dilithium salt
Patricia Horrillo Martínez, Kai C. Hultzsch, Frank Hampel
DOI: 10.1039/B518360J
An achiral form of the hexameric resorcin[4]arene capsule sustained by hydrogen bonding with alcohols
Onome Ugono, K. Travis Holman
DOI: 10.1039/B604148E
ortho-Hydroxylation of benzoic acids with hydrogen peroxide at a non-heme iron center
Sonia Taktak, Margaret Flook, Bruce M. Foxman, Lawrence Que, Jr., Elena V. Rybak-Akimova
DOI: 10.1039/B508004E
You might also like
What are the main uses of 1H-Indazole-6-carbonitrile (CAS: 141290-59-7)?
1H-Indazole-6-carbonitrile finds applications in pharmaceuticals, where it serve...
How should waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) be handled?
Waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) should be collecte...
What industries use Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide (CAS: 68291-98-5)?
Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide is primarily used in pharmac...
Are there alternatives to Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxylate (CAS: 741709-66-0) in synthesis?
Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxyla...
How should waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) be handled?
Waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) should be manage...
What is 6-Formyl-2-pyridinecarboxylic acid (CAS: 499214-11-8)?
6-Formyl-2-pyridinecarboxylic acid is an organic compound with the molecular for...
What is the market or research trend for 3-(3,4-dimethoxyphenyl)-2,5-dimethyl-N-(2-morpholin-4-ylethyl)pyrazolo[1,5-a]pyrimidin-7-amine (CAS: 900874-91-1)?
Research trends for this compound indicate a focus on its potential applications...
How is 9H-Tribenzo[b,d,f]azepine (CAS: 29875-73-8) typically synthesized?
9H-Tribenzo[b,d,f]azepine is typically synthesized via a multi-step process invo...
How is 1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxylic acid (CAS: 1797982-51-4) typically synthesized?
1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxyli...
How should waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: 671820-52-3) be handled?
Waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: ...















