A Variable Selection Approach of Near Infrared Spectra Based on Window Competitive Adaptive Reweighted Sampling Strategy
LI Pao1, ZHOU Jun2, JIANG Li-wen1, LIU Xia1, DU Guo-rong1,2*
1. College of Food Science and Technology, Hunan Agricultural University, Changsha 410128, China
2. Beijing Work Station, Technology Center, Shanghai Tobacco Group Co., Ltd., Beijing 101121, China
Abstract:Variable selection plays an important role in the quantitative analysis of near infrared spectra. The accuracy of near infrared spectroscopy can be improved by eliminating the redundant variables and selecting the characteristic variables. Competitive adaptive reweighted sampling (CARS) method is a newly developed strategy for wavelength selection by employing the principle “survival of the fittest” on which Darwin’s Evolution Theory is based. The number of selected wavelengths by CARS is much smaller than those of other methods with fast calculating speed and high accuracy. However, it is easy to get inconsistent results between the calibration and validation set due to the excessive attention on the cross validation results. In order to develop a robust variable selection method, by combining the advantages of CARS and “window”, a new tactic called window competitive adaptive reweighted sampling (WCARS) is employed to select characteristic variables and applied to the analysis of the near infrared spectra of the complex plant samples and the contents of the chemical components. Compared with the results of CARS method, accurate quantitative results can be obtained by the WCARS method. Furthermore, the results of correction set are consistent with those of the prediction set, and the problem of overfitting can be avoided. The results show that WCARS tactic can efficiently improve the accuracy and stability of variables selection and optimize the precision of prediction model, which has a certain application value.
李 跑,周 骏,蒋立文,刘 霞,杜国荣. 窗口竞争性自适应重加权采样策略的近红外特征变量选择方法[J]. 光谱学与光谱分析, 2019, 39(05): 1428-1432.
LI Pao, ZHOU Jun, JIANG Li-wen, LIU Xia, DU Guo-rong. A Variable Selection Approach of Near Infrared Spectra Based on Window Competitive Adaptive Reweighted Sampling Strategy. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(05): 1428-1432.
[1] ZHANG Yi-ting, WANG Cui-cui, FAN Meng-li, et al(张伊挺,王翠翠,樊梦丽,等). Spectroscopy and Spectral Analysis (光谱学与光谱分析), 2017, 36(12): 4100.
[2] Wang C C, Cai W S, Shao X G. Analytical Letters, 2017, 51(4): 537.
[3] Wei H Y, Li H, Liu P, et al. Spectroscopy Letters, 2017, 50(7): 470.
[4] CHU Xiao-li, YUAN Hong-fu, LU Wan-zhen(褚小立,袁洪福,陆婉珍). Progress Chemistry(化学进展), 2004, 16(4): 528.
[5] Tan Z, Lou T T, Huang Z X, et al. Journal of Agricultural and Food Chemistry, 2017, 65: 6274.
[6] Han X, Huang Z X, Chen X D, et al. Fuel, 2017, 207: 146.
[7] Zhang C X, Bian X H, Liu P, et al. Chemometrics and Intelligent Laboratory Systems, 2016, 161: 43.
[8] Saudland A, Wagner J, Nielsen J P, et al. Applied Spectroscopy, 2007, 54(3): 413.
[9] Ranzan C, Trierweiler L F, Hitzmann B, et al. Chemometrics and Intelligent Laboratory Systems, 2015, 142: 78.
[10] Li H D, Liang Y Z, Xu Q S, et al. Analytica Chimica Acta, 2009, 648(1): 77.
[11] Cai W S, Li Y K, Shao X G. Chemometrics and Intelligent Laboratory Systems, 2008, 90(2): 188.
[12] Xu H, Liu Z C, Cai W S, et al. Chemometrics and Intelligent Laboratory Systems, 2009, 97(2): 189.
[13] Han X, Tan Z, Huang Z X, et al. Analytical Methods 2017, 9: 3720.
[14] Nie L X, Dai Z, Ma S C. Analytical Letters, 2016, 49(14): 2259.
[15] Wu L J, Wang B X, Yin Y F, et al. Analytical Letters, 2016, 49(14): 2290.
[16] Li H D, Xu Q S, Liang Y Z. Chemometrics and Intelligent Laboratory Systems, 2018, 176: 34.