A bootstrapping soft shrinkage approach and interval random variables selection hybrid model for variable selection in near-infrared spectroscopy

Al-Kaf, H.A.G. and Alduais, N.A.M. and Saad, A.-M.H.Y. and Chia, K.S. and Mohsen, A.M. and Alhussian, H. and Mahdi, A.A.M.H. and Salam, W.S.-I.W. (2020) A bootstrapping soft shrinkage approach and interval random variables selection hybrid model for variable selection in near-infrared spectroscopy. IEEE Access, 8. pp. 168036-168052.

Full text not available from this repository.
Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

High dimensionality problem in spectra datasets is a significant challenge to researchers and requires the design of effective methods that can extract the optimal variable subset that can improve the accuracy of predictions or classifications. In this study, a hybrid variable selection method, based on the incremental number of variables using bootstrapping soft shrinkage method (BOSS) and interval random variable selection (IRVS) method is proposed and named BOSS-IRVS. The BOSS method is used to determine the informative intervals, while the IRVS method is used to search for informative variables in the informative interval determined by BOSS method. The proposed BOSS-IRVS method was tested using seven different public accessible near-infrared (NIR) spectroscopic datasets of corn, diesel fuel, soy, wheat protein, and hemoglobin types. The performance of the proposed method was compared with that of two outstanding variable selection methods i.e. BOSS and hybrid variable selection strategy based on continuous shrinkage of variable space (VCPA-IRIV). The experimental results showed clearly that the proposed method BOSS-IRVS outperforms VCPA-IRIV and BOSS methods in all tested datasets and improved the percentage of the prediction accuracy, by 15.4 and 15.3 for corn moisture,13.4 and 49.8 for corn oil, 41.5 and 50.6 for corn protein, 12.6 and 5.6 for soy moisture, 0.6 and 6.3 for total diesel fuel, 19.9 and 14.3 for wheat protein, and 5.8 and 20.3 for hemoglobin. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/

Item Type: Article
Impact Factor: cited By 0
Uncontrolled Keywords: Classification (of information); Diesel engines; Diesel fuels; Hemoglobin; Moisture; Near infrared spectroscopy; Proteins; Random variables; Shrinkage, Creative Commons; High dimensionality; Optimal variables; Prediction accuracy; Shrinkage methods; Variable selection; Variable selection methods; Variables selections, Infrared devices
Depositing User: Ms Sharifah Fahimah Saiyed Yeop
Date Deposited: 19 Aug 2021 07:23
Last Modified: 19 Aug 2021 07:23
URI: http://scholars.utp.edu.my/id/eprint/23380

Actions (login required)

View Item
View Item