光谱学与光谱分析 |
|
|
|
|
|
A New Wavelength Selection Algorithm Based on the Fusion of Multiple Models |
HONG Ming-jian1,2,WEN Zhi-yu1 |
1. Micro-Electromechanical System Research Center of Chongqing University, Chongqing 400030, China 2. School of Software Engineering, Chongqing University, Chongqing 400030, China |
|
|
Abstract NIR spectroscopy makes a feature of a large number of wavelengths with a much smaller set of samples. However, some of the wavelengths contribute no information to the modeling. Even worse, they may contain the irrelevant information such as noise and background, which may result in a complex model and/or bad predictive ability of the model. So, it’s important to do research in-depth to eliminate these wavelengths and improve the quality of the final model. The present paper firstly summarizes the variable selection methods based on a single PLS regression model and concludes that (1) the cross-validation can be used to select optimal model with good predictive ability, but the resulting model may be not suitable for selecting variables; (2) selecting variables based on a single regression model is inaccurate and instable because a single vector of regression coefficients may not measure the importance of the variables correctly and may vary with models of different complexity. On basis of this analysis, this paper proposed a new method for variable selection based on the fusion of multiple PLS models. This method fuses the multiple PLS regression coefficients to form a vector, then a threshold is determined to eliminate the variables whose corresponding element in the vector is lower than this threshold. Finally, this method is verified by 3 well-known NIR datasets and compared with the UVE-PLS and GA-PLS algorithms. The experiments show that this method may result in a model with less complexity and/or better predictive ability. Moreover, the proposed method is elegant and efficient and therefore can be put in practical use.
|
Received: 2009-10-09
Accepted: 2010-01-16
|
|
Corresponding Authors:
HONG Ming-jian
E-mail: hongmingjian@gmail.com,hmj@cqu.edu.cn
|
|
[1] LU Wan-zhen, YUAN Hong-fu, XU Guang-tong,et al(陆婉珍,袁洪福,徐广通,等). Modern Analysis Techniques for Near Infrared Spectroscopy(现代近红外光谱分析技术). Beijing: China Petrochemical Press(北京:中国石化出版社), 2000. [2] Martens H,Naes T. Multivariate Calibration, Chichester. UK: John Wiley & Sons, Inc., 1989. [3] Geladi P,Kowalski B R. Analytica Chimica Acta, 1986, 185: 1. [4] Pierna J A F, Abbas O, Baeten V. Analytica Chimica Acta, 2009, 642: 89. [5] Leardi R. Journal of Chemometrics, 2000, 14: 643. [6] Sutter J M,Kalivas J H. Microchemical Journal, 1993, 47: 60. [7] Hageman J A, Streppel M, Wehrens R. Journal of Chemometrics, 2003, 17: 427. [8] Frenich A G, Jouan-Rimbaud D, Massart D L. Analyst, 1995, 120: 2787. [9] Teófilo R F, Martins J A P A, Ferreira M M C. Journal of Chemometrics, 2009, 23: 32. [10] Centner V, Massart D, Noord O E D. Analytical Chemistry, 1996, 68: 3851. [11] Xu H, Liu Z, Cai W. Chemometrics and Intelligent Laboratory Systems, 2009, 97: 189. [12] Benoudjit N, Cools E, Meurens M. Chemometrics and Intelligent Laboratory Systems, 2004, 70: 47. [13] CHEN Xiao-jing, WU Di, YU Jia-jia(陈孝敬, 吴 迪, 虞佳佳). Acta Optica Sinica(光学学报), 2008, 28: 2153. [14] Xu Q,Liang Y. Chemometrics and Intelligent Laboratory Systems, 2001, 56: 1. [15] Eldén L. Computational Statistics & Data Analysis, 2004, 46: 11. [16] Martens H A,Dardenne P. Chemometrics and Intelligent Laboratory Systems, 1998, 44: 99. [17] Snee R D. Technometrics, 1977, 19: 415. [18] Kalivas J H. Chemometrics and Intelligent Laboratory Systems, 1997, 37: 255. |
[1] |
FAN Ping-ping,LI Xue-ying,QIU Hui-min,HOU Guang-li,LIU Yan*. Spectral Analysis of Organic Carbon in Sediments of the Yellow Sea and Bohai Sea by Different Spectrometers[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 52-55. |
[2] |
YANG Chao-pu1, 2, FANG Wen-qing3*, WU Qing-feng3, LI Chun1, LI Xiao-long1. Study on Changes of Blue Light Hazard and Circadian Effect of AMOLED With Age Based on Spectral Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 36-43. |
[3] |
GAO Feng1, 2, XING Ya-ge3, 4, LUO Hua-ping1, 2, ZHANG Yuan-hua3, 4, GUO Ling3, 4*. Nondestructive Identification of Apricot Varieties Based on Visible/Near Infrared Spectroscopy and Chemometrics Methods[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 44-51. |
[4] |
LIU Jia, ZHENG Ya-long, WANG Cheng-bo, YIN Zuo-wei*, PAN Shao-kui. Spectra Characterization of Diaspore-Sapphire From Hotan, Xinjiang[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 176-180. |
[5] |
BAO Hao1, 2,ZHANG Yan1, 2*. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157. |
[6] |
ZHU Wen-jing1, 2,FENG Zhan-kang1, 2,DAI Shi-yuan1, 2,ZHANG Ping-ping3,JI Wen4,WANG Ai-chen1, 2,WEI Xin-hua1, 2*. Multi-Feature Fusion Detection of Wheat Lodging Information Based on UAV Multispectral Images[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 197-206. |
[7] |
GAO Hong-sheng1, GUO Zhi-qiang1*, ZENG Yun-liu2, DING Gang2, WANG Xiao-yao2, LI Li3. Early Classification and Detection of Kiwifruit Soft Rot Based on
Hyperspectral Image Band Fusion[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 241-249. |
[8] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[9] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[10] |
HE Qing-yuan1, 2, REN Yi1, 2, LIU Jing-hua1, 2, LIU Li1, 2, YANG Hao1, 2, LI Zheng-peng1, 2, ZHAN Qiu-wen1, 2*. Study on Rapid Determination of Qualities of Alfalfa Hay Based on NIRS[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3753-3757. |
[11] |
LI Wei1, TAN Feng2*, ZHANG Wei1, GAO Lu-si3, LI Jin-shan4. Application of Improved Random Frog Algorithm in Fast Identification of Soybean Varieties[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3763-3769. |
[12] |
HU Cai-ping1, HE Cheng-yu2, KONG Li-wei3, ZHU You-you3*, WU Bin4, ZHOU Hao-xiang3, SUN Jun2. Identification of Tea Based on Near-Infrared Spectra and Fuzzy Linear Discriminant QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3802-3805. |
[13] |
LIU Xin-peng1, SUN Xiang-hong2, QIN Yu-hua1*, ZHANG Min1, GONG Hui-li3. Research on t-SNE Similarity Measurement Method Based on Wasserstein Divergence[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3806-3812. |
[14] |
LI Qi-chen1, 2, LI Min-zan1, 2*, YANG Wei2, 3, SUN Hong2, 3, ZHANG Yao1, 3. Quantitative Analysis of Water-Soluble Phosphorous Based on Raman
Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3871-3876. |
[15] |
LIANG Jin-xing1, 2, 3, XIN Lei1, CHENG Jing-yao1, ZHOU Jing1, LUO Hang1, 3*. Adaptive Weighted Spectral Reconstruction Method Against
Exposure Variation[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3330-3338. |
|
|
|
|