Extracting structured seed-mediated gold nanorod growth procedures from scientific text with LLMs
Literature Information
Nicholas Walker, Anubhav Jain
Although gold nanorods have been the subject of much research, the pathways for controlling their shape and thereby their optical properties remain largely heuristically understood. Although it is apparent that the simultaneous presence of and interaction between various reagents during synthesis control these properties, computational and experimental approaches for exploring the synthesis space can be either intractable or too time-consuming in practice. This motivates an alternative approach leveraging the wealth of synthesis information already embedded in the body of scientific literature by developing tools to extract relevant structured data in an automated, high-throughput manner. To that end, we present an approach using the powerful GPT-3 language model to extract structured multi-step seed-mediated growth procedures and outcomes for gold nanorods from unstructured scientific text. GPT-3 prompt completions are fine-tuned to predict synthesis templates in the form of JSON documents from unstructured text input with an overall accuracy of 86% aggregated by entities and 76% aggregated by papers. The performance is notable, considering the model is performing simultaneous entity recognition and relation extraction. We present a dataset of 11 644 entities extracted from 1137 papers, resulting in 268 papers with at least one complete seed-mediated gold nanorod growth procedure and outcome for a total of 332 complete procedures.
Recommended Journals
Related Literature
Clay-supported novel bimetallic core–shell Co–Pt and Ni–Pt nanocrystals with high catalytic activities
DOI: 10.1039/C4CP04194A
Observing Pt nanoparticle formation at the atomic level during polyol synthesis
Jocenir Boita, Lucas Nicolao, Maria C. M. Alves, Jonder Morais
DOI: 10.1039/C4CP01925C
Timescales of water transport in viscous aerosol: measurements on sub-micron particles and dependence on conditioning history
Jessica W. Lu, Andrew M. J. Rickards, Jim S. Walker, Kerry J. Knox, Rachael E. H. Miles, Jonathan P. Reid
DOI: 10.1039/C3CP54233E
Morphology and chemical states of size-selected Ptn clusters on an aluminium oxide film on NiAl(110)
Atsushi Beniya, Noritake Isomura, Hirohito Hirata, Yoshihide Watanabe
DOI: 10.1039/C4CP01767F
Electronic structure investigation of the evanescent AtO+ ion
André Severo Pereira Gomes, Florent Réal, Nicolas Galland, Celestino Angeli, Renzo Cimiraglia, Valérie Vallet
DOI: 10.1039/C3CP55294B
Band gap grading and photovoltaic performance of solution-processed Cu(In,Ga)S2 thin-film solar cells
So Hyeong Sohn, Noh Soo Han, Yong Jin Park, Seung Min Park, Dong-Wook Kim, Jae Kyu Song
DOI: 10.1039/C4CP03243H
High-density biosynthetic fuels: the intersection of heterogeneous catalysis and metabolic engineering
Benjamin G. Harvey, Heather A. Meylemans, Raina V. Gough, Roxanne L. Quintana, Michael D. Garrison, Thomas J. Bruno
DOI: 10.1039/C3CP55349C
Electronic structure at nanocontacts of surface passivated CdSe nanorods with gold clusters
Deepashri Saraf, Anjali Kshirsagar
DOI: 10.1039/C4CP00069B
Trade-offs of the opto-electrical properties of a-Si:H solar cells based on MOCVD BZO films
Ze Chen, Xiao-dan Zhang, Jun-hui Liang, Jia Fang, Xue-jiao Liang, Jian Sun, De-kun Zhang, Xin-liang Chen, Qian Huang, Ying Zhao
DOI: 10.1039/C4CP04066J
Systematic experimental charge density analysis of anion receptor complexes
Isabelle L. Kirby, Mark Brightwell, Mateusz B. Pitak, Claire Wilson, Simon J. Coles
DOI: 10.1039/C3CP54858A
You might also like
What are the main uses of 1H-Indazole-6-carbonitrile (CAS: 141290-59-7)?
1H-Indazole-6-carbonitrile finds applications in pharmaceuticals, where it serve...
How should waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) be handled?
Waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) should be collecte...
What industries use Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide (CAS: 68291-98-5)?
Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide is primarily used in pharmac...
Are there alternatives to Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxylate (CAS: 741709-66-0) in synthesis?
Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxyla...
How should waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) be handled?
Waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) should be manage...
What is 6-Formyl-2-pyridinecarboxylic acid (CAS: 499214-11-8)?
6-Formyl-2-pyridinecarboxylic acid is an organic compound with the molecular for...
What is the market or research trend for 3-(3,4-dimethoxyphenyl)-2,5-dimethyl-N-(2-morpholin-4-ylethyl)pyrazolo[1,5-a]pyrimidin-7-amine (CAS: 900874-91-1)?
Research trends for this compound indicate a focus on its potential applications...
How is 9H-Tribenzo[b,d,f]azepine (CAS: 29875-73-8) typically synthesized?
9H-Tribenzo[b,d,f]azepine is typically synthesized via a multi-step process invo...
How is 1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxylic acid (CAS: 1797982-51-4) typically synthesized?
1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxyli...
How should waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: 671820-52-3) be handled?
Waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: ...











![N-[(9H-Fluoren-9-ylmethoxy)carbonyl]serine structure N-[(9H-Fluoren-9-ylmethoxy)carbonyl]serine structure](https://static.chemtradehub.com/structs/737/73724-45-5-b0dc.webp)



