ChemDataWriter: a transformer-based toolkit for auto-generating books that summarise research
Literature Information
Shu Huang
Since the number of scientific papers has grown substantially over recent years, scientists spend much time searching, screening, and reading papers to follow the latest research trends. With the development of advanced natural-language-processing (NLP) models, transformer-based text-generation algorithms have the potential to summarise scientific papers and automatically write a literature review from numerous scientific publications. In this paper, we introduce a Python-based toolkit, ChemDataWriter, which auto-generates books about research in a completely unsupervised fashion. ChemDataWriter adopts a conservative book-generation pipeline to automatically write the book by suggesting potential book content, retrieving and re-ranking the relevant papers, and then summarising and paraphrasing the text within the paper. To the best of our knowledge, ChemDataWriter is the first open-source toolkit in the area of chemistry to be able to compose a literature review entirely via artificial intelligence once one has suggested a broad topic. We also provide an example of a book that ChemDataWriter has auto-generated about battery-materials research. To aid the use of ChemDataWriter, its code is provided with associated documentation to serve as a user guide.
Recommended Journals
Related Literature
Fatigue-resistant photochromic dithienylethenes by controlling the oxidation state
Yong-Chul Jeong, Dae Gyu Park, Eunkyoung Kim, Kwang-Hyun Ahn, Sung Ik Yang
DOI: 10.1039/B600754F
Non-catalytic and template-free growth of aligned CdS nanowires exhibiting high field emission current densities
Yi-Feng Lin, Yung-Jung Hsu, Shih-Yuan Lu, Sheng-Chin Kung
DOI: 10.1039/B604309G
Highly stable cyclic dimers based on non-covalent interactions
Valérie G. H. Lafitte, Abil E. Aliev, Peter N. Horton, Michael B. Hursthouse, Helen C. Hailes
DOI: 10.1039/B600459H
Monitoring the formation of TTF dimers by Na+ complexation
Marc Sallé, Jean-Yves Balandier, Franck Le Derf, Eric Levillain, Magali Allain, Pascal Viel, Serge Palacin
DOI: 10.1039/B518275A
Insulated conducting polymers: manipulating charge transport using supramolecular complexes
Phoebe H. Kwan, Timothy M. Swager
DOI: 10.1039/B508399K
Samarium diiodide-induced intramolecular pinacol coupling of dinitrones: synthesis of cyclic cis-vicinal diamines
Jean-Philippe Ebran, Rita G. Hazell, Troels Skrydstrup
DOI: 10.1039/B511491H
Switching a molecular shuttle on and off: simple, pH-controlled pseudorotaxanes based on cucurbit[7]uril
Vladimir Sindelar, Serena Silvi, Angel E. Kaifer
DOI: 10.1039/B601959E
Fluorescence based strategies for genetic analysis
Rohan T. Ranasinghe, Tom Brown
DOI: 10.1039/B509522K
Synthesis of organic–inorganic hybrid mesoporous tin oxophosphate in the presence of anionic surfactant
Masahiro Fujiwara, Masahiko Matsukata
DOI: 10.1039/B508589F
You might also like
What are the main uses of 1H-Indazole-6-carbonitrile (CAS: 141290-59-7)?
1H-Indazole-6-carbonitrile finds applications in pharmaceuticals, where it serve...
How should waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) be handled?
Waste containing Dioctyl (2E)-2-butenedioate (CAS: 2997-85-5) should be collecte...
What industries use Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide (CAS: 68291-98-5)?
Sodium [(1,2-benzoxazol-3-ylmethyl)sulfonyl]azanide is primarily used in pharmac...
Are there alternatives to Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxylate (CAS: 741709-66-0) in synthesis?
Dimethyl 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2,6-pyridinedicarboxyla...
How should waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) be handled?
Waste containing 2-Fluoro-6-hydrazinopyridine (CAS: 80714-39-2) should be manage...
What is 6-Formyl-2-pyridinecarboxylic acid (CAS: 499214-11-8)?
6-Formyl-2-pyridinecarboxylic acid is an organic compound with the molecular for...
What is the market or research trend for 3-(3,4-dimethoxyphenyl)-2,5-dimethyl-N-(2-morpholin-4-ylethyl)pyrazolo[1,5-a]pyrimidin-7-amine (CAS: 900874-91-1)?
Research trends for this compound indicate a focus on its potential applications...
How is 9H-Tribenzo[b,d,f]azepine (CAS: 29875-73-8) typically synthesized?
9H-Tribenzo[b,d,f]azepine is typically synthesized via a multi-step process invo...
How is 1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxylic acid (CAS: 1797982-51-4) typically synthesized?
1-Cyclopropyl-7-ethoxy-6-fluoro-8-methoxy-4-oxo-1,4-dihydro-3-quinolinecarboxyli...
How should waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: 671820-52-3) be handled?
Waste containing Methyl 3-oxo-1,2,3,4-tetrahydro-6-quinoxalinecarboxylate (CAS: ...















