|
|
|
|
|
|
Elastic Net Modeling for Near Infrared Spectroscopy |
ZHENG Nian-nian, LUAN Xiao-li*, LIU Fei |
Key Laboratory for Advanced Process Control of Light Industry of Ministry of Education,Institute of Automation,Jiangnan University,Wuxi 214122,China |
|
|
Abstract It is important and challenging to select the variable for the spectral information automatically and establish a sparse linear model between the spectrum and the sample content under the circumstance that the near-infrared spectral information is much larger than the sample size. In this paper, Elastic Net was used for the measurement of o-cresol in the polyphenylene ether by utilizing the near infrared spectroscopy and a quantitative calibration model between near infrared spectroscopy and o-cresol content was established. Then, the model prediction effect is compared with the Lasso method. In the case where the number of variables is much larger than that of the samples. Although Lasso method can achieve variable selection, the prediction accuracy of the model is affected due to excessive compression to variable coefficients. Elastic Net avoids excessive censorship by increasing L2 penalty, which can improve model prediction accuracy. In order to verifymodel performance indicators ofElastic Net method, we use the complex correlation coefficient R2 and the adjusted complex correlation coefficient R2a to evaluate the interpretability of the model, meanwhile, the prediction accuracy of the model is evaluated by using the mean relative prediction error MRPE and the prediction correlation coefficient Rp. Lasso method to establish the model performance indicators are: R2=0.94,R2a=0.93,MRPE=4.51%,Rp=0.96;Elastic Net method performance indicators are: R2=0.97,R2a=1,MRPE=3.25%,Rp=0.98. From the result we could draw the conclusion that Elastic Net’s model is better than Lasso method. A sparse linear model with higher interpretability and high prediction accuracy can be obtained by the Elastic Net regression.
|
Received: 2017-10-07
Accepted: 2018-02-14
|
|
Corresponding Authors:
LUAN Xiao-li
E-mail: xlluan@jiangnan.edu.cn
|
|
[1] LU Wan-zhen(陆婉珍). Modern Near Infrared Spectroscopy Analytical Technology,2nd Ed.(现代近红外光谱分析技术, 第2版). Beijing: China Petrochemical Press(北京:中国石化出版社),2010.
[2] Huang X Y,Teye E,Sam-Amoah L K,et al. Analytical Methods,2014,6(14):5008.
[3] SHI Ting,LUAN Xiao-li,LIU Fei(史 婷,栾小丽,刘 飞). Spectroscopy and Spectral Analysis(光谱学与光谱分析),2017, 37(4): 1058.
[4] TANG Shou-peng,YAO Xin-feng,YAO Xia,et al(汤守鹏,姚鑫锋,姚 霞,等). Chinese Journal of Analytical Chemistry(分析化学),2009, 37(10): 1445.
[5] Zou H,Hastie T. Journal of the Royal Statistical Society. Series B(Methodological),2005, 67(1):301.
[6] Tibshirani R. Journal of the Royal Statistical Society. Series B(Methodological),1996, 15(1):267.
[7] LIU Liu,TAO Da-cheng(刘 柳,陶大程). Journal of Data Acquisition and Processing(数据采集与处理),2015, 30(1):35.
[8] HE Xiao-qun,LIU Wen-qing(何晓群,刘文卿). Applied Regression Analysis(应用回归分析). Beijing:China Renmin University Press(北京:中国人民大学出版社),2015.
[9] ZHANG Yu(张 玉). Journal of Jiangsu Institute of Education·Natural Sciences(江苏教育学院学报·自然科学),2012, 28(3):28.
[10] LIU Ran,YANG Xiao-li,XU Yun-hui,et al(刘 冉,杨晓丽,徐云惠,等). Guangzhou Chemical Industry(广州化工),2013,41(15):128.
[11] WANG Xin(王 欣). Science & Technology Information(科技资讯),2013, 15: 2.
[12] Tong Peijin,Du Yiping,Zheng Kaiyi,et al. Chemometrics and Intelligent Laboratory Systems,2015, 143: 40.
[13] Khan M H R, Shaw J E H. Statistics and Computing, 2016, 26(3): 725.
[14] WANG Da-rong,ZHANG Zhong-zhan(王大荣,张忠占). Journal of Applied Statistics and Management(数理统计与管理), 2010, 29(4):616.
|
[1] |
GAO Feng1, 2, XING Ya-ge3, 4, LUO Hua-ping1, 2, ZHANG Yuan-hua3, 4, GUO Ling3, 4*. Nondestructive Identification of Apricot Varieties Based on Visible/Near Infrared Spectroscopy and Chemometrics Methods[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 44-51. |
[2] |
BAO Hao1, 2,ZHANG Yan1, 2*. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157. |
[3] |
HU Cai-ping1, HE Cheng-yu2, KONG Li-wei3, ZHU You-you3*, WU Bin4, ZHOU Hao-xiang3, SUN Jun2. Identification of Tea Based on Near-Infrared Spectra and Fuzzy Linear Discriminant QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3802-3805. |
[4] |
LIU Xin-peng1, SUN Xiang-hong2, QIN Yu-hua1*, ZHANG Min1, GONG Hui-li3. Research on t-SNE Similarity Measurement Method Based on Wasserstein Divergence[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3806-3812. |
[5] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[6] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[7] |
LUO Li, WANG Jing-yi, XU Zhao-jun, NA Bin*. Geographic Origin Discrimination of Wood Using NIR Spectroscopy
Combined With Machine Learning Techniques[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3372-3379. |
[8] |
ZHANG Shu-fang1, LEI Lei2, LEI Shun-xin2, TAN Xue-cai1, LIU Shao-gang1, YAN Jun1*. Traceability of Geographical Origin of Jasmine Based on Near
Infrared Diffuse Reflectance Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3389-3395. |
[9] |
YANG Qun1, 2, LING Qi-han1, WEI Yong1, NING Qiang1, 2, KONG Fa-ming1, ZHOU Yi-fan1, 2, ZHANG Hai-lin1, WANG Jie1, 2*. Non-Destructive Monitoring Model of Functional Nitrogen Content in
Citrus Leaves Based on Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3396-3403. |
[10] |
HUANG Meng-qiang1, KUANG Wen-jian2, 3*, LIU Xiang1, HE Liang4. Quantitative Analysis of Cotton/Polyester/Wool Blended Fiber Content by Near-Infrared Spectroscopy Based on 1D-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3565-3570. |
[11] |
HUANG Zhao-di1, CHEN Zai-liang2, WANG Chen3, TIAN Peng2, ZHANG Hai-liang2, XIE Chao-yong2*, LIU Xue-mei4*. Comparing Different Multivariate Calibration Methods Analyses for Measurement of Soil Properties Using Visible and Short Wave-Near
Infrared Spectroscopy Combined With Machine Learning Algorithms[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3535-3540. |
[12] |
KANG Ming-yue1, 3, WANG Cheng1, SUN Hong-yan3, LI Zuo-lin2, LUO Bin1*. Research on Internal Quality Detection Method of Cherry Tomatoes Based on Improved WOA-LSSVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3541-3550. |
[13] |
HUANG Hua1, LIU Ya2, KUERBANGULI·Dulikun1, ZENG Fan-lin1, MAYIRAN·Maimaiti1, AWAGULI·Maimaiti1, MAIDINUERHAN·Aizezi1, GUO Jun-xian3*. Ensemble Learning Model Incorporating Fractional Differential and
PIMP-RF Algorithm to Predict Soluble Solids Content of Apples
During Maturing Period[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3059-3066. |
[14] |
CHEN Jia-wei1, 2, ZHOU De-qiang1, 2*, CUI Chen-hao3, REN Zhi-jun1, ZUO Wen-juan1. Prediction Model of Farinograph Characteristics of Wheat Flour Based on Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3089-3097. |
[15] |
GUO Ge1, 3, 4, ZHANG Meng-ling3, 4, GONG Zhi-jie3, 4, ZHANG Shi-zhuang3, 4, WANG Xiao-yu2, 5, 6*, ZHOU Zhong-hua1*, YANG Yu2, 5, 6, XIE Guang-hui3, 4. Construction of Biomass Ash Content Model Based on Near-Infrared
Spectroscopy and Complex Sample Set Partitioning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3143-3149. |
|
|
|
|