|
|
|
|
|
|
Least Angle Regression Combined With Competitive Adaptive Re-Weighted Sampling for NIR Spectral Wavelength Selection |
LU Hao-xiang1, ZHANG Jing2, LI Ling-qiao1*, LIU Zhen-bing1, YANG Hui-hua1,3, FENG Yan-chun4, YIN Li-hui4 |
1. College of Computer and Information Security, Guilin University of Electronic Technology, Guilin 541004, China
2. College of Business, Guilin University of Electronic Technology, Guilin 541004, China
3. College of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
4. National Institutes for Food and Drug Control, Beijing 100050, China |
|
|
Abstract Near-infrared spectroscopy is widely used in drug detection, petrochemical industry, etc., because it has no damage to the samples, and the detection speed is fast, and the accuracy is high. In particular, it has more accurate detection performance with the in-depth application of machine learning and deep learning modeling methods in recent years makes. However, the NIR spectral data of the sample has relatively high dimensions and has problems such as spectral overlap, collinearity and noise, which will negatively impact the performance of the NIR spectral model. In this case, the selection of effective characteristic wavelength points of the sample is extremely important. In order to improve the accuracy and reliability of the quantitative and qualitative analysis models of NIR spectra, a variable selection method for NIR spectra is proposed, which combines the advantages of Least Angle Regression and Competitive Adaptive Re-weighted Sampling, and has better performance. In this method, LAR was used to preliminarily screen the characteristic wavelengths in the whole spectrum of the sample, and then CARS was used to further select the selected characteristic wavelengths to effectively remove the irrelevant characteristic wavelengths. In order to verify the effectiveness of the method, the method was evaluated from two aspects of quantitative and qualitative analysis. In the quantitative analysis experiment, PLS regression analysis model was established using FULL, LAR, CARS, SPA and UVE as comparison methods and drug sample data set as example. PLS model established by variables screened by LAR-CARS showed higher predictive determination coefficient and lower predictive standard deviation in drug data set. In the qualitative analysis experiment, the classification model was established with SVM, ELM, SWELM and BP as comparison methods and drug data sets with different proportions of training sets. The accuracy of the SVM classification model established by the variables screened by LAR-CARS reached the highest 100%. From the experimental results, it can be seen that LAR-CARS can effectively select the wavelength points that the characteristics of the sample, and the quantitative and qualitative analysis model established by using the selected wavelength points has better robustness and can be used for the characteristic wavelength screening of the sample spectrum.
|
Received: 2020-07-01
Accepted: 2020-10-28
|
|
Corresponding Authors:
LI Ling-qiao
E-mail: 54pe@163.com
|
|
[1] Yazici A, Tiryaki G Y, Ayvaz H, et al. Journal of the Science of Food and Agriculture, 2020, 100(5): 1980.
[2] Greene T P, Gullysantiago M A, Barsony M. The Astrophysical Journal, 2018, 862(1): 85.
[3] Li S, Xing B, Lin D, et al. Industrial Crops and Products, 2020, 152(11): 112539.
[4] ZHANG Feng, TANG Xiao-jun, TONG Ang-xin, et al(张 峰,汤晓君,仝昂鑫,等). Chinese Journal of Scientific Instrument(仪器仪表学报), 2020, 41(1): 64.
[5] Felipe C B, Espinosa R M, Hevia G, et al. Journal of Sports Sciences, 2019, 37(23): 1.
[6] Liu J, Zhang Y, Wang H, et al. Spectrochimica Acta Part A: Molecular & Biomolecular Spectroscopy, 2018, 199(21): 43.
[7] Wang Y, Guo W, Zhu X, et al. International Journal of Food Science & Technology, 2019, 54(2): 387.
[8] Tsakiridis N L, Tziolas N V, Theocharis J B, et al. European Journal of Soil Science, 2019, 70(3): 578.
[9] WANG Kun, WU Jing-zhu, WANG Dong, et al(王 坤,吴静珠,王 冬,等). Journal of Food Safety and Quality Detection Technology(食品安全质量检测学报),2020,11(16):5569.
[10] LI Xin-xing, YAO Jiu-bin, CHENG Jian-hong, et al(李鑫星, 姚久彬, 成建红, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2020, 40(1): 189.
[11] ZHAO Huan, HUAN Ke-wei, SHI Xiao-guang, et al(赵 环, 宦克为, 石晓光, 等). Chinese Journal of Analytical Chemistry(分析化学), 2018, 65(1): 136.
[12] Chen F C, Jahanshahi M R. IEEE Transactions on Industrial Electronics, 2018, 65(5): 4392.
[13] Zhang J, Lu Z, Li M, et al. IEEE Access, 2019, 7(1): 183118.
[14] YANG Zhen-fa, XIAO Hang, ZHANG Lei, et al(杨振发,肖 航,张 雷,等). Chinese Journal of Analytical Chemistry(分析化学),2020,48(2):275.
[15] Krongchai C, Wongsaipun S, Funsueb S, et al. Chiang Mai Journal of Science, 2020, 41(1): 160.
[16] Lu B, Liu N, Li H, et al. Soil & Tillage Research, 2019, 191(12): 266.
[17] Zhang R, Zhang F, Chen W, et al. Chemometrics & Intelligent Laboratory Systems, 2018, 175(11): 47. |
[1] |
GAO Feng1, 2, XING Ya-ge3, 4, LUO Hua-ping1, 2, ZHANG Yuan-hua3, 4, GUO Ling3, 4*. Nondestructive Identification of Apricot Varieties Based on Visible/Near Infrared Spectroscopy and Chemometrics Methods[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 44-51. |
[2] |
BAO Hao1, 2,ZHANG Yan1, 2*. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157. |
[3] |
HU Cai-ping1, HE Cheng-yu2, KONG Li-wei3, ZHU You-you3*, WU Bin4, ZHOU Hao-xiang3, SUN Jun2. Identification of Tea Based on Near-Infrared Spectra and Fuzzy Linear Discriminant QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3802-3805. |
[4] |
LIU Xin-peng1, SUN Xiang-hong2, QIN Yu-hua1*, ZHANG Min1, GONG Hui-li3. Research on t-SNE Similarity Measurement Method Based on Wasserstein Divergence[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3806-3812. |
[5] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[6] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[7] |
LUO Li, WANG Jing-yi, XU Zhao-jun, NA Bin*. Geographic Origin Discrimination of Wood Using NIR Spectroscopy
Combined With Machine Learning Techniques[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3372-3379. |
[8] |
ZHANG Shu-fang1, LEI Lei2, LEI Shun-xin2, TAN Xue-cai1, LIU Shao-gang1, YAN Jun1*. Traceability of Geographical Origin of Jasmine Based on Near
Infrared Diffuse Reflectance Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3389-3395. |
[9] |
YANG Qun1, 2, LING Qi-han1, WEI Yong1, NING Qiang1, 2, KONG Fa-ming1, ZHOU Yi-fan1, 2, ZHANG Hai-lin1, WANG Jie1, 2*. Non-Destructive Monitoring Model of Functional Nitrogen Content in
Citrus Leaves Based on Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3396-3403. |
[10] |
HUANG Meng-qiang1, KUANG Wen-jian2, 3*, LIU Xiang1, HE Liang4. Quantitative Analysis of Cotton/Polyester/Wool Blended Fiber Content by Near-Infrared Spectroscopy Based on 1D-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3565-3570. |
[11] |
HUANG Zhao-di1, CHEN Zai-liang2, WANG Chen3, TIAN Peng2, ZHANG Hai-liang2, XIE Chao-yong2*, LIU Xue-mei4*. Comparing Different Multivariate Calibration Methods Analyses for Measurement of Soil Properties Using Visible and Short Wave-Near
Infrared Spectroscopy Combined With Machine Learning Algorithms[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3535-3540. |
[12] |
KANG Ming-yue1, 3, WANG Cheng1, SUN Hong-yan3, LI Zuo-lin2, LUO Bin1*. Research on Internal Quality Detection Method of Cherry Tomatoes Based on Improved WOA-LSSVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3541-3550. |
[13] |
HUANG Hua1, LIU Ya2, KUERBANGULI·Dulikun1, ZENG Fan-lin1, MAYIRAN·Maimaiti1, AWAGULI·Maimaiti1, MAIDINUERHAN·Aizezi1, GUO Jun-xian3*. Ensemble Learning Model Incorporating Fractional Differential and
PIMP-RF Algorithm to Predict Soluble Solids Content of Apples
During Maturing Period[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3059-3066. |
[14] |
CHEN Jia-wei1, 2, ZHOU De-qiang1, 2*, CUI Chen-hao3, REN Zhi-jun1, ZUO Wen-juan1. Prediction Model of Farinograph Characteristics of Wheat Flour Based on Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3089-3097. |
[15] |
GUO Ge1, 3, 4, ZHANG Meng-ling3, 4, GONG Zhi-jie3, 4, ZHANG Shi-zhuang3, 4, WANG Xiao-yu2, 5, 6*, ZHOU Zhong-hua1*, YANG Yu2, 5, 6, XIE Guang-hui3, 4. Construction of Biomass Ash Content Model Based on Near-Infrared
Spectroscopy and Complex Sample Set Partitioning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3143-3149. |
|
|
|
|