Domain-specific chatbots for science using embeddings
Literature Information
Kevin G. Yager
Large language models (LLMs) have emerged as powerful machine-learning systems capable of handling a myriad of tasks. Tuned versions of these systems have been turned into chatbots that can respond to user queries on a vast diversity of topics, providing informative and creative replies. However, their application to physical science research remains limited owing to their incomplete knowledge in these areas, contrasted with the needs of rigor and sourcing in science domains. Here, we demonstrate how existing methods and software tools can be easily combined to yield a domain-specific chatbot. The system ingests scientific documents in existing formats, and uses text embedding lookup to provide the LLM with domain-specific contextual information when composing its reply. We similarly demonstrate that existing image embedding methods can be used for search and retrieval across publication figures. These results confirm that LLMs are already suitable for use by physical scientists in accelerating their research efforts.
Related Literature
Dimeric phenanthroimidazole for blue electroluminescent materials: the effect of substituted position attached to biphenyl center
Zhiming Wang, Ying Feng, Hui Li, Zhao Gao, Xiaojuan Zhang, Ping Lu, Ping Chen, Yuguang Ma, Shiyong Liu
DOI: 10.1039/C4CP00209A
Electronic structure at nanocontacts of surface passivated CdSe nanorods with gold clusters
Deepashri Saraf, Anjali Kshirsagar
DOI: 10.1039/C4CP00069B
The influence of the electrolyte on chemical and morphological modifications of an iron sulfide thin film negative electrode
Feng Liao, Jolanta Światowska, Vincent Maurice, Antoine Seyeux, Lorena H. Klein, Sandrine Zanna, Philippe Marcus
DOI: 10.1039/C4CP04041D
Orientation effects in morphology and electronic properties of anatase TiO2 one-dimensional nanostructures. I. Nanowires
Dmitri B. Migas, Andrew B. Filonov, Victor E. Borisenko
DOI: 10.1039/C3CP54988G
The energy transfer mechanism in Pr3+ and Yb3+ codoped β-NaLuF4 nanocrystals
Jiahua Zhang, Zhendong Hao, Xia Zhang, Guohui Pan, Yongshi Luo, Shaozhe Lü, Haifeng Zhao
DOI: 10.1039/C4CP01184H
Systematic experimental charge density analysis of anion receptor complexes
Isabelle L. Kirby, Mark Brightwell, Mateusz B. Pitak, Claire Wilson, Simon J. Coles
DOI: 10.1039/C3CP54858A
An ab initio study of the CrHe diatomic molecule: the effect of van der Waals distortion on a highly magnetic multi-electron system
Johann V. Pototschnig, Martin Ratschek, Andreas W. Hauser, Wolfgang E. Ernst
DOI: 10.1039/C4CP00559G
First-principles study of ground-state properties of U2Mo
Xiyue Cheng, Yuting Zhang, Ronghan Li, Weiwei Xing, Pengcheng Zhang, Xing-Qiu Chen
DOI: 10.1039/C4CP03841J
Beyond the molecular orbital conception of electronically excited states through the quantum theory of atoms in molecules
David Ferro-Costas, Ángel Martín Pendás, Leticia González, Ricardo A. Mosquera
DOI: 10.1039/C4CP00431K
Copper clusters as novel fluorescent probes for the detection and photocatalytic elimination of lead ions
M. A. López-Quintela
DOI: 10.1039/C4CP02148G
You might also like
What are the main uses of 1H-Indazole-6-carbonitrile (CAS: 141290-59-7)?
1H-Indazole-6-carbonitrile finds applications in pharmaceuticals, where it serve...
How should waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) be handled?
Waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) should be collecte...
What industries use Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide (CAS: 68291-98-5)?
Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide is primarily used in pharmac...
Are there alternatives to Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxylate (CAS: 741709-66-0) in synthesis?
Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxyla...
How should waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) be handled?
Waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) should be manage...
What is 6-Formyl-2-pyridinecarboxylic acid (CAS: 499214-11-8)?
6-Formyl-2-pyridinecarboxylic acid is an organic compound with the molecular for...
What is the market or research trend for 3-(3,4-dimethoxyphenyl)-2,5-dimethyl-N-(2-morpholin-4-ylethyl)pyrazolo[1,5-a]pyrimidin-7-amine (CAS: 900874-91-1)?
Research trends for this compound indicate a focus on its potential applications...
How is 9H-Tribenzo[b,d,f]azepine (CAS: 29875-73-8) typically synthesized?
9H-Tribenzo[b,d,f]azepine is typically synthesized via a multi-step process invo...
How is 1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxylic acid (CAS: 1797982-51-4) typically synthesized?
1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxyli...
How should waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: 671820-52-3) be handled?
Waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: ...













![N-[(9H-Fluoren-9-ylmethoxy)carbonyl]serine structure N-[(9H-Fluoren-9-ylmethoxy)carbonyl]serine structure](https://static.chemtradehub.com/structs/737/73724-45-5-b0dc.webp)

