Predictive modelling of colossal ATR-FTIR spectral data using PLS-DA: empirical differences between PLS1-DA and PLS2-DA algorithms

Literature Information

Publication Date 2019-02-21
DOI 10.1039/C8AN02074D
Impact Factor 4.616
Authors

Loong Chuen Lee, Abdul Aziz Jemain


View Original

Abstract

In response to our review paper [L. C. Lee et al., Analyst, 2018, 143, 3526–3539], we present a study that compares empirical differences between PLS1-DA and PLS2-DA algorithms in modelling a colossal ATR-FTIR spectral dataset. Over the past two decades, partial least squares-discriminant analysis (PLS-DA) has gained wide acceptance and huge popularity in the field of applied research, partly due to its dimensionality reduction capability and ability to handle multicollinear and correlated variables. To solve a K-class problem (K > 2) using PLS-DA and high-dimensional data like infrared spectra, one can construct either K one-versus-all PLS1-DA models or only one PLS2-DA model. The aim of this work is to explore empirical differences between the two PLS-DA algorithms in modeling a colossal ATR-FTIR spectral dataset. The practical task is to build a prediction model using the imbalanced, high dimensional, colossal and multi-class ATR-FTIR spectra of blue gel pen inks. Four different sub-datasets were prepared from the principal dataset by considering the raw and asymmetric least squares (AsLS) preprocessed forms: (a) Raw-global region; (b) Raw-local region; (c) AsLS-global region; and (d) AsLS-local region. A series of 50 models which includes the first 50 PLS components incrementally was constructed repeatedly using the four sub-datasets. Each model was evaluated using six different variants of v-fold cross validation, autoprediction and external testing methods. As a result, each PLS-DA algorithm was represented by a number of figures of merit. The differences between PLS1-DA and PLS2-DA algorithms were assessed using hypothesis tests with respect to model accuracy, stability and fitting. On the other hand, confusion matrices of the two PLS-DA algorithms were inspected carefully for assessment of model parsimony. Overall, both the algorithms presented satisfactory model accuracy and stability. Nonetheless, PLS1-DA models showed significantly higher accuracy rates than PLS2-DA models, whereas PLS2-DA models seem to be much more stable compared to PLS1-DA models. Eventually, PLS2-DA also proved to be less prone to overfitting and is more parsimonious than PLS1-DA. In conclusion, the relatively high accuracy of the PLS1-DA algorithm is achieved at the cost of rather low parsimony and stability, and with an increased risk of overfitting.

Related Literature

Bioconjugation onto biological surfaces with fluorescently labeled polymers

Julien Nicolas, Ezat Khoshdel, David M. Haddleton

2007-03-29 Communication

DOI: 10.1039/B617596A

Front cover

Cover

DOI: 10.1039/B704554A

Isophthalamides and 2,6-dicarboxamidopyridines with pendant indole groups: a ‘twisted’ binding mode for selective fluoride recognition

Gareth W. Bates, Philip A. Gale, Mark E. Light

2007-04-30 Communication

DOI: 10.1039/B703905K

Unusual carbon–sulfur bond cleavage in the reaction of a new type of bulky hexathioether with a zerovalent palladium complex

Daisuke Shimizu, Nobuhiro Takeda, Norihiro Tokitoh

2005-11-21 Communication

DOI: 10.1039/B513339D

The hexamethylpentalene dianion and other reagents for organometallic pentalene chemistry

Andrew E. Ashley, Andrew R. Cowley, Dermot O'Hare

2007-03-14 Communication

DOI: 10.1039/B702150J

Coordination chemistry of the hexavacant tungstophosphate [H2P2W12O48]12−: synthesis and characterization of iron(iii) complexes derived from the unprecedented {P2W14O54} fragment

Béatrice Godin, Jacqueline Vaissermann, Patrick Herson, Laurent Ruhlmann, Michel Verdaguer, Pierre Gouzerh

2005-10-19 Communication

DOI: 10.1039/B510434C

Contents

Front/Back Matter

DOI: 10.1039/B515523C

Reducing charge recombination losses in solid state dye sensitized solar cells: the use of donor–acceptor sensitizer dyes

Samantha Handa, Helga Wietasch, Mukundan Thelakkat, James R. Durrant, Saif A. Haque

2007-04-03 Communication

DOI: 10.1039/B618700E

Straightforward construction of diarylmethane skeletons viaaryne insertion into carbon–carbon σ-bonds

Hiroto Yoshida, Masahiko Watanabe, Takami Morishita, Joji Ohshita, Atsutaka Kunai

2007-02-05 Communication

DOI: 10.1039/B616768C

Contents

Front/Back Matter

DOI: 10.1039/B705433P

You might also like

Compound Q&A

What precautions should be taken when handling lithium chloride hydrate (1:1:1) (CAS: 16712-20-2)?

When handling lithium chloride hydrate (1:1:1) (CAS: 16712-20-2), it is importan...

16712-20-2Lithium chloride hyd...
Compound Q&A

Is 4-(4H-1,2,4-Triazol-4-yl)piperidine (CAS: 690261-92-8) safe?

4-(4H-1,2,4-Triazol-4-yl)piperidine is generally considered safe for use in phar...

690261-92-84-(4H-1,2,4-Triazol-...
Compound Q&A

How should waste containing 1,3-Thiazole-2-carboxamide (CAS: 16733-85-0) be handled?

Waste containing 1,3-Thiazole-2-carboxamide (CAS: 16733-85-0) should be collecte...

16733-85-01,3-Thiazole-2-carbo...
Compound Q&A

What regulatory guidelines apply to 5-(Difluoromethyl)-2-fluorobenzonitrile (CAS: 934175-58-3)?

5-(Difluoromethyl)-2-fluorobenzonitrile (CAS: 934175-58-3) is subject to regulat...

934175-58-35-(Difluoromethyl)-2...
Compound Q&A

How is Methyl 3-acetamido-2-thiophenecarboxylate (CAS: 22288-79-5) typically synthesized?

Methyl 3-acetamido-2-thiophenecarboxylate can be synthesized by the reaction of ...

22288-79-5Methyl 3-acetamido-2...
Compound Q&A

What is 4-Isoquinolinecarbonitrile (CAS: 34846-65-6)?

4-Isoquinolinecarbonitrile is a chemical compound with the CAS number 34846-65-6...

34846-65-64-Isoquinolinecarbon...
Compound Q&A

How should Methyl 1H-1,2,3-triazole-4-carboxylate (CAS: 877309-59-6) be stored?

Store Methyl 1H-1,2,3-triazole-4-carboxylate (CAS: 877309-59-6) in a cool, dry p...

877309-59-6Methyl 1H-1,2,3-tria...
Compound Q&A

What regulatory guidelines apply to 6-Bromo[1,3]thiazolo[5,4-b]pyridin-2-amine (CAS: 1160791-13-8)?

6-Bromo[1,3]thiazolo[5,4-b]pyridin-2-amine (CAS: 1160791-13-8) is subject to the...

1160791-13-86-Bromo[1,3]thiazolo...
Compound Q&A

Is (2S,3S)-2-Ammonio-3-(3,4-dihydroxyphenyl)-3-hydroxypropanoate (CAS: 23651-95-8) safe?

(2S,3S)-2-Ammonio-3-(3,4-dihydroxyphenyl)-3-hydroxypropanoate (CAS: 23651-95-8) ...

23651-95-8(2S,3S)-2-Ammonio-3-...
Compound Q&A

What are the physical and chemical properties of 7-bromo-3-methyl-3,4-dihydroquinazolin-4-one (CAS: 1293987-84-4)?

7-Bromo-3-methyl-3,4-dihydroquinazolin-4-one is a solid with a crystalline form....

1293987-84-47-bromo-3-methyl-3,4...

Source Journal

Analyst

Analyst
CiteScore: 7.8
Self-citation Rate: 5.6%
Articles per Year: 653

Analyst publishes analytical and bioanalytical research that reports premier fundamental discoveries and inventions, and the applications of those discoveries, unconfined by traditional discipline barriers.

Recommended Compounds

Recommended Suppliers

Disclaimer
This page provides academic journal information for reference and research purposes only. We are not affiliated with any journal publishers and do not handle publication submissions. For publication-related inquiries, please contact the respective journal publishers directly.
If you notice any inaccuracies in the information displayed, please contact us at support@chemtradehub.com. We will promptly review and address your concerns.