ChemDataWriter: a transformer-based toolkit for auto-generating books that summarise research

Literature Information

Publication Date 2023-10-04
DOI 10.1039/D3DD00159H
Impact Factor 0
Authors

Shu Huang


View Original

Abstract

Since the number of scientific papers has grown substantially over recent years, scientists spend much time searching, screening, and reading papers to follow the latest research trends. With the development of advanced natural-language-processing (NLP) models, transformer-based text-generation algorithms have the potential to summarise scientific papers and automatically write a literature review from numerous scientific publications. In this paper, we introduce a Python-based toolkit, ChemDataWriter, which auto-generates books about research in a completely unsupervised fashion. ChemDataWriter adopts a conservative book-generation pipeline to automatically write the book by suggesting potential book content, retrieving and re-ranking the relevant papers, and then summarising and paraphrasing the text within the paper. To the best of our knowledge, ChemDataWriter is the first open-source toolkit in the area of chemistry to be able to compose a literature review entirely via artificial intelligence once one has suggested a broad topic. We also provide an example of a book that ChemDataWriter has auto-generated about battery-materials research. To aid the use of ChemDataWriter, its code is provided with associated documentation to serve as a user guide.

Related Literature

Fatigue-resistant photochromic dithienylethenes by controlling the oxidation state

Yong-Chul Jeong, Dae Gyu Park, Eunkyoung Kim, Kwang-Hyun Ahn, Sung Ik Yang

2006-03-21 Communication

DOI: 10.1039/B600754F

Non-catalytic and template-free growth of aligned CdS nanowires exhibiting high field emission current densities

Yi-Feng Lin, Yung-Jung Hsu, Shih-Yuan Lu, Sheng-Chin Kung

2006-05-02 Communication

DOI: 10.1039/B604309G

Highly stable cyclic dimers based on non-covalent interactions

Valérie G. H. Lafitte, Abil E. Aliev, Peter N. Horton, Michael B. Hursthouse, Helen C. Hailes

2006-04-20 Communication

DOI: 10.1039/B600459H

Monitoring the formation of TTF dimers by Na+ complexation

Marc Sallé, Jean-Yves Balandier, Franck Le Derf, Eric Levillain, Magali Allain, Pascal Viel, Serge Palacin

2006-03-30 Communication

DOI: 10.1039/B518275A

Insulated conducting polymers: manipulating charge transport using supramolecular complexes

Phoebe H. Kwan, Timothy M. Swager

2005-09-21 Communication

DOI: 10.1039/B508399K

Samarium diiodide-induced intramolecular pinacol coupling of dinitrones: synthesis of cyclic cis-vicinal diamines

Jean-Philippe Ebran, Rita G. Hazell, Troels Skrydstrup

2005-09-30 Communication

DOI: 10.1039/B511491H

Switching a molecular shuttle on and off: simple, pH-controlled pseudorotaxanes based on cucurbit[7]uril

Vladimir Sindelar, Serena Silvi, Angel E. Kaifer

2006-03-31 Communication

DOI: 10.1039/B601959E

Fluorescence based strategies for genetic analysis

Rohan T. Ranasinghe, Tom Brown

2005-09-30 Feature Article

DOI: 10.1039/B509522K

Inside front cover

Front/Back Matter

DOI: 10.1039/B606514G

Synthesis of organic–inorganic hybrid mesoporous tin oxophosphate in the presence of anionic surfactant

Masahiro Fujiwara, Masahiko Matsukata

2005-09-20 Communication

DOI: 10.1039/B508589F

You might also like

Compound Q&A

What are the main uses of 1H-Indazole-6-carbonitrile (CAS: 141290-59-7)?

1H-Indazole-6-carbonitrile finds applications in pharmaceuticals, where it serve...

141290-59-71H-Indazole-6-carbon...
Compound Q&A

How should waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) be handled?

Waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) should be collecte...

2997-85-5Dioctyl (2E)-2-buten...
Compound Q&A

What industries use Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide (CAS: 68291-98-5)?

Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide is primarily used in pharmac...

68291-98-5Sodium [(1,2-benzoxa...
Compound Q&A

Are there alternatives to Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxylate (CAS: 741709-66-0) in synthesis?

Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxyla...

741709-66-0Dimethyl 4-(4,4,5,5-...
Compound Q&A

How should waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) be handled?

Waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) should be manage...

80714-39-22-Fluoro-6-hydrazino...
Compound Q&A

What is 6-Formyl-2-pyridinecarboxylic acid (CAS: 499214-11-8)?

6-Formyl-2-pyridinecarboxylic acid is an organic compound with the molecular for...

499214-11-86-Formyl-2-pyridinec...
900874-91-13-(3,4-dimethoxyphen...
Compound Q&A

How is 9H-Tribenzo[b,d,f]azepine (CAS: 29875-73-8) typically synthesized?

9H-Tribenzo[b,d,f]azepine is typically synthesized via a multi-step process invo...

29875-73-89H-Tribenzo[b,d,f]az...
Compound Q&A

How is 1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxylic acid (CAS: 1797982-51-4) typically synthesized?

1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxyli...

1797982-51-41-Cyclopropyl-7-etho...
Compound Q&A

How should waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: 671820-52-3) be handled?

Waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: ...

671820-52-3Methyl 3-oxo-1,2,3,4...
Disclaimer
This page provides academic journal information for reference and research purposes only. We are not affiliated with any journal publishers and do not handle publication submissions. For publication-related inquiries, please contact the respective journal publishers directly.
If you notice any inaccuracies in the information displayed, please contact us at support@chemtradehub.com. We will promptly review and address your concerns.