Domain-specific chatbots for science using embeddings

Literature Information

Publication Date 2023-10-10
DOI 10.1039/D3DD00112A
Impact Factor 0
Authors

Kevin G. Yager


View Original

Abstract

Large language models (LLMs) have emerged as powerful machine-learning systems capable of handling a myriad of tasks. Tuned versions of these systems have been turned into chatbots that can respond to user queries on a vast diversity of topics, providing informative and creative replies. However, their application to physical science research remains limited owing to their incomplete knowledge in these areas, contrasted with the needs of rigor and sourcing in science domains. Here, we demonstrate how existing methods and software tools can be easily combined to yield a domain-specific chatbot. The system ingests scientific documents in existing formats, and uses text embedding lookup to provide the LLM with domain-specific contextual information when composing its reply. We similarly demonstrate that existing image embedding methods can be used for search and retrieval across publication figures. These results confirm that LLMs are already suitable for use by physical scientists in accelerating their research efforts.

Related Literature

Dimeric phenanthroimidazole for blue electroluminescent materials: the effect of substituted position attached to biphenyl center

Zhiming Wang, Ying Feng, Hui Li, Zhao Gao, Xiaojuan Zhang, Ping Lu, Ping Chen, Yuguang Ma, Shiyong Liu

2014-03-07 Paper

DOI: 10.1039/C4CP00209A

Electronic structure at nanocontacts of surface passivated CdSe nanorods with gold clusters

Deepashri Saraf, Anjali Kshirsagar

2014-02-27 Paper

DOI: 10.1039/C4CP00069B

The influence of the electrolyte on chemical and morphological modifications of an iron sulfide thin film negative electrode

Feng Liao, Jolanta Światowska, Vincent Maurice, Antoine Seyeux, Lorena H. Klein, Sandrine Zanna, Philippe Marcus

2014-11-10 Paper

DOI: 10.1039/C4CP04041D

Orientation effects in morphology and electronic properties of anatase TiO2 one-dimensional nanostructures. I. Nanowires

Dmitri B. Migas, Andrew B. Filonov, Victor E. Borisenko

2014-03-26 Paper

DOI: 10.1039/C3CP54988G

The energy transfer mechanism in Pr3+ and Yb3+ codoped β-NaLuF4 nanocrystals

Jiahua Zhang, Zhendong Hao, Xia Zhang, Guohui Pan, Yongshi Luo, Shaozhe Lü, Haifeng Zhao

2014-04-03 Paper

DOI: 10.1039/C4CP01184H

Systematic experimental charge density analysis of anion receptor complexes

Isabelle L. Kirby, Mark Brightwell, Mateusz B. Pitak, Claire Wilson, Simon J. Coles

2014-04-28 Paper

DOI: 10.1039/C3CP54858A

An ab initio study of the CrHe diatomic molecule: the effect of van der Waals distortion on a highly magnetic multi-electron system

Johann V. Pototschnig, Martin Ratschek, Andreas W. Hauser, Wolfgang E. Ernst

2014-03-25 Paper

DOI: 10.1039/C4CP00559G

First-principles study of ground-state properties of U2Mo

Xiyue Cheng, Yuting Zhang, Ronghan Li, Weiwei Xing, Pengcheng Zhang, Xing-Qiu Chen

2014-10-20 Paper

DOI: 10.1039/C4CP03841J

Beyond the molecular orbital conception of electronically excited states through the quantum theory of atoms in molecules

David Ferro-Costas, Ángel Martín Pendás, Leticia González, Ricardo A. Mosquera

2014-03-17 Paper

DOI: 10.1039/C4CP00431K

Copper clusters as novel fluorescent probes for the detection and photocatalytic elimination of lead ions

M. A. López-Quintela

2014-08-01 Communication

DOI: 10.1039/C4CP02148G

You might also like

Compound Q&A

What are the main uses of 1H-Indazole-6-carbonitrile (CAS: 141290-59-7)?

1H-Indazole-6-carbonitrile finds applications in pharmaceuticals, where it serve...

141290-59-71H-Indazole-6-carbon...
Compound Q&A

How should waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) be handled?

Waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) should be collecte...

2997-85-5Dioctyl (2E)-2-buten...
Compound Q&A

What industries use Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide (CAS: 68291-98-5)?

Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide is primarily used in pharmac...

68291-98-5Sodium [(1,2-benzoxa...
Compound Q&A

Are there alternatives to Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxylate (CAS: 741709-66-0) in synthesis?

Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxyla...

741709-66-0Dimethyl 4-(4,4,5,5-...
Compound Q&A

How should waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) be handled?

Waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) should be manage...

80714-39-22-Fluoro-6-hydrazino...
Compound Q&A

What is 6-Formyl-2-pyridinecarboxylic acid (CAS: 499214-11-8)?

6-Formyl-2-pyridinecarboxylic acid is an organic compound with the molecular for...

499214-11-86-Formyl-2-pyridinec...
900874-91-13-(3,4-dimethoxyphen...
Compound Q&A

How is 9H-Tribenzo[b,d,f]azepine (CAS: 29875-73-8) typically synthesized?

9H-Tribenzo[b,d,f]azepine is typically synthesized via a multi-step process invo...

29875-73-89H-Tribenzo[b,d,f]az...
Compound Q&A

How is 1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxylic acid (CAS: 1797982-51-4) typically synthesized?

1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxyli...

1797982-51-41-Cyclopropyl-7-etho...
Compound Q&A

How should waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: 671820-52-3) be handled?

Waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: ...

671820-52-3Methyl 3-oxo-1,2,3,4...

Source Journal

Digital Discovery

Digital Discovery
CiteScore: 0
Self-citation Rate: 0%
Articles per Year: 0

Recommended Compounds

Recommended Suppliers

Disclaimer
This page provides academic journal information for reference and research purposes only. We are not affiliated with any journal publishers and do not handle publication submissions. For publication-related inquiries, please contact the respective journal publishers directly.
If you notice any inaccuracies in the information displayed, please contact us at support@chemtradehub.com. We will promptly review and address your concerns.