Abstract:Elastic net is an improvement of the least-squares method by introducing in L1 and L2 penalties, and it has the advantages of the variable selection. The quantitative analysis model build by Elastic net can improve the prediction accuracy. Using 89 wheat samples as the experiment material, the spectrum principal components of the samples were selected by Elastic net. The analysis model was established for the near-infrared spectrum and the wheat’s protein content, and the feasibility of using Elastic net to establish the quantitative analysis model was confirmed. In experiment, the 89 wheat samples were randomly divided into two groups, with 60 samples being the model set and 29 samples being the prediction set. The 60 samples were used to build analysis model to predict the protein contents of the 29 samples, and correlation coefficient (R) of the predicted value and chemistry observed value was 0.984 9, with the mean relative error being 2.48%. To further investigate the feasibility and stability of the model, the 89 samples were randomly selected five times, with 60 samples to be model set and 29 samples to be prediction set. The five groups of principal components which were selected by Elastic net for building model were basically consistent, and compared with the PCR and PLS method, the model prediction accuracies were all better than PCR and similar with PLS. In view of the fact that Elastic net can realize the variable selection and the model has good prediction, it was shown that Elastic net is suitable method for building chemometrics quantitative analysis model.
陈万会,刘旭华,何雄奎,闵顺耕,张录达* . 基于Elastic net主成分优选的近红外光谱定量分析模型 [J]. 光谱学与光谱分析, 2010, 30(11): 2932-2935.
CHEN Wan-hui, LIU Xu-hua, HE Xiong-kui, MIN Shun-geng, ZHANG Lu-da* . Near-Infrared Spectrum Quantitative Analysis Model Based on Principal Components Selected by Elastic Net . SPECTROSCOPY AND SPECTRAL ANALYSIS, 2010, 30(11): 2932-2935.
[1] YAN Yan-lu, ZHAO Long-lian, ZHANG Lu-da, et al(严衍禄,赵龙莲,张录达, 等). Near-Infrared Spectroscopy Fundamentals and Applications(近红外光谱分析基础与应用). Beijing:China Light Industry Press(北京: 中国轻工业出版社),2005. [2] FTNIRDRSA Study Group(FTNIRDRSA研究组). Journal of Beijing Agricultural University (北京农业大学学报), 1990, 16(增刊): 1. [3] LIU Ke, LIU Bao-guo, ZHANG Yun-sen(刘 珂,刘保国,张运森). Journal of Henan University of Technology·Natural Science Edition(河南工业大学学报·自然科学版), 2009, 30(1), 88. [4] LI Wei, XIAO Ai-ping, LENG Juan(李 伟, 肖爱平, 冷 鹃). Chinese Agricultural Science Bulletin(中国农学通报), 2009, 25(3): 56. [5] DONG Shou-long, REN Qian, HUANG You-zhi(董守龙, 任 芊, 黄友之). Analysis and Detection (分析与检测), 2004, 11(4): 44. [6] WANG Tao, ZHANG Lu-da, LAO Cai-lian, et al(王 韬,张录达, 劳彩莲,等). Journal of China Agricultural University(中国农业大学学报),2004,9(6):76. [7] WANG Hui-wen(王惠文). Partial Least-Squares Regression Method and Its Applications(偏最小二乘回归方法及其应用). Beijing: National Defense Industry Press(北京:国防工业出版社),1999. [8] QI Xiao-ming, ZHANG Lu-da(齐小明,张录达). Journal of Beijing Agricultural College(北京农学院学报), 1999, 14(2):45. [9] LIAO Bu-yan, ZHANG Zheng-zhu, XIA Tao, et al(廖步岩,张正竹,夏 涛, 等). Journal of Anhui Agricultural University(安徽农业大学学报), 2009, 36(2):287. [10] Hui Zou,Trevor Hastie. J. Royal Statist, Soc. B,2005,67:301. [11] Hoerl A, Kennard R. Encyclopedia of Statistical Sciences, 1998, 8:129. [12] Tibshirani R. J. R. Statist. Soc. B,1996, 58:267. [13] LI Xia, LIU Chao(李 霞,刘 超). Statistics and Decision(统计与决策),2008,5:30. [14] Eric B, Trevor H, Debashis P, et al. Journal of the American Statistical Association, 2006, 101(473):119.