Predictive modelling of colossal ATR-FTIR spectral data using PLS-DA: empirical differences between PLS1-DA and PLS2-DA algorithms
Literature Information
Loong Chuen Lee, Abdul Aziz Jemain
In response to our review paper [L. C. Lee et al., Analyst, 2018, 143, 3526–3539], we present a study that compares empirical differences between PLS1-DA and PLS2-DA algorithms in modelling a colossal ATR-FTIR spectral dataset. Over the past two decades, partial least squares-discriminant analysis (PLS-DA) has gained wide acceptance and huge popularity in the field of applied research, partly due to its dimensionality reduction capability and ability to handle multicollinear and correlated variables. To solve a K-class problem (K > 2) using PLS-DA and high-dimensional data like infrared spectra, one can construct either K one-versus-all PLS1-DA models or only one PLS2-DA model. The aim of this work is to explore empirical differences between the two PLS-DA algorithms in modeling a colossal ATR-FTIR spectral dataset. The practical task is to build a prediction model using the imbalanced, high dimensional, colossal and multi-class ATR-FTIR spectra of blue gel pen inks. Four different sub-datasets were prepared from the principal dataset by considering the raw and asymmetric least squares (AsLS) preprocessed forms: (a) Raw-global region; (b) Raw-local region; (c) AsLS-global region; and (d) AsLS-local region. A series of 50 models which includes the first 50 PLS components incrementally was constructed repeatedly using the four sub-datasets. Each model was evaluated using six different variants of v-fold cross validation, autoprediction and external testing methods. As a result, each PLS-DA algorithm was represented by a number of figures of merit. The differences between PLS1-DA and PLS2-DA algorithms were assessed using hypothesis tests with respect to model accuracy, stability and fitting. On the other hand, confusion matrices of the two PLS-DA algorithms were inspected carefully for assessment of model parsimony. Overall, both the algorithms presented satisfactory model accuracy and stability. Nonetheless, PLS1-DA models showed significantly higher accuracy rates than PLS2-DA models, whereas PLS2-DA models seem to be much more stable compared to PLS1-DA models. Eventually, PLS2-DA also proved to be less prone to overfitting and is more parsimonious than PLS1-DA. In conclusion, the relatively high accuracy of the PLS1-DA algorithm is achieved at the cost of rather low parsimony and stability, and with an increased risk of overfitting.
Recommended Journals

Chemical & Pharmaceutical Bulletin

Canadian Metallurgical Quarterly

Advances in Colloid and Interface Science

Journal of the Chinese Chemical Society

Anti-Corrosion Methods and Materials

Corrosion Science

Carbon

Cement and Concrete Research

Journal of the American Chemical Society

Chemistry of Heterocyclic Compounds
Related Literature
Bioconjugation onto biological surfaces with fluorescently labeled polymers
Julien Nicolas, Ezat Khoshdel, David M. Haddleton
DOI: 10.1039/B617596A
Isophthalamides and 2,6-dicarboxamidopyridines with pendant indole groups: a ‘twisted’ binding mode for selective fluoride recognition
Gareth W. Bates, Philip A. Gale, Mark E. Light
DOI: 10.1039/B703905K
Unusual carbon–sulfur bond cleavage in the reaction of a new type of bulky hexathioether with a zerovalent palladium complex
Daisuke Shimizu, Nobuhiro Takeda, Norihiro Tokitoh
DOI: 10.1039/B513339D
The hexamethylpentalene dianion and other reagents for organometallic pentalene chemistry
Andrew E. Ashley, Andrew R. Cowley, Dermot O'Hare
DOI: 10.1039/B702150J
Coordination chemistry of the hexavacant tungstophosphate [H2P2W12O48]12−: synthesis and characterization of iron(iii) complexes derived from the unprecedented {P2W14O54} fragment
Béatrice Godin, Jacqueline Vaissermann, Patrick Herson, Laurent Ruhlmann, Michel Verdaguer, Pierre Gouzerh
DOI: 10.1039/B510434C
Reducing charge recombination losses in solid state dye sensitized solar cells: the use of donor–acceptor sensitizer dyes
Samantha Handa, Helga Wietasch, Mukundan Thelakkat, James R. Durrant, Saif A. Haque
DOI: 10.1039/B618700E
Straightforward construction of diarylmethane skeletons viaaryne insertion into carbon–carbon σ-bonds
Hiroto Yoshida, Masahiko Watanabe, Takami Morishita, Joji Ohshita, Atsutaka Kunai
DOI: 10.1039/B616768C
You might also like
What precautions should be taken when handling lithium chloride hydrate (1:1:1) (CAS: 16712-20-2)?
When handling lithium chloride hydrate (1:1:1) (CAS: 16712-20-2), it is importan...
Is 4-(4H-1,2,4-Triazol-4-yl)piperidine (CAS: 690261-92-8) safe?
4-(4H-1,2,4-Triazol-4-yl)piperidine is generally considered safe for use in phar...
How should waste containing 1,3-Thiazole-2-carboxamide (CAS: 16733-85-0) be handled?
Waste containing 1,3-Thiazole-2-carboxamide (CAS: 16733-85-0) should be collecte...
What regulatory guidelines apply to 5-(Difluoromethyl)-2-fluorobenzonitrile (CAS: 934175-58-3)?
5-(Difluoromethyl)-2-fluorobenzonitrile (CAS: 934175-58-3) is subject to regulat...
How is Methyl 3-acetamido-2-thiophenecarboxylate (CAS: 22288-79-5) typically synthesized?
Methyl 3-acetamido-2-thiophenecarboxylate can be synthesized by the reaction of ...
What is 4-Isoquinolinecarbonitrile (CAS: 34846-65-6)?
4-Isoquinolinecarbonitrile is a chemical compound with the CAS number 34846-65-6...
How should Methyl 1H-1,2,3-triazole-4-carboxylate (CAS: 877309-59-6) be stored?
Store Methyl 1H-1,2,3-triazole-4-carboxylate (CAS: 877309-59-6) in a cool, dry p...
What regulatory guidelines apply to 6-Bromo[1,3]thiazolo[5,4-b]pyridin-2-amine (CAS: 1160791-13-8)?
6-Bromo[1,3]thiazolo[5,4-b]pyridin-2-amine (CAS: 1160791-13-8) is subject to the...
Is (2S,3S)-2-Ammonio-3-(3,4-dihydroxyphenyl)-3-hydroxypropanoate (CAS: 23651-95-8) safe?
(2S,3S)-2-Ammonio-3-(3,4-dihydroxyphenyl)-3-hydroxypropanoate (CAS: 23651-95-8) ...
What are the physical and chemical properties of 7-bromo-3-methyl-3,4-dihydroquinazolin-4-one (CAS: 1293987-84-4)?
7-Bromo-3-methyl-3,4-dihydroquinazolin-4-one is a solid with a crystalline form....
Source Journal
Analyst

Analyst publishes analytical and bioanalytical research that reports premier fundamental discoveries and inventions, and the applications of those discoveries, unconfined by traditional discipline barriers.




