Multiple Dependent Variables LS-SVM Regression Algorithm and Its Application in NIR Spectral Quantitative Analysis
AN Xin1,3,XU Shuo2,ZHANG Lu-da1*,SU Shi-guang1
1. College of Science, China Agricultural University,Beijing 100094, China 2. College of Information and Electrical Engineering, China Agricultural University,Beijing 100083, China 3. School of International Trade and Economics, University of International Business and Economics, Beijing 100029, China
Abstract:In the present paper, on the basis of LS-SVM algorithm, we built a multiple dependent variables LS-SVM (MLS-SVM) regression model whose weights can be optimized, and gave the corresponding algorithm. Furthermore, we theoretically explained the relationship between MLS-SVM and LS-SVM. Sixty four broomcorn samples were taken as experimental material, and the sample ratio of modeling set to predicting set was 51∶13. We first selected randomly and uniformly five weight groups in the interval [0, 1], and then in the way of leave-one-out (LOO) rule determined one appropriate weight group and parameters including penalizing parameters and kernel parameters in the model according to the criterion of the minimum of average relative error. Then a multiple dependent variables quantitative analysis model was built with NIR spectrum and simultaneously analyzed three chemical constituents containing protein, lysine and starch. Finally, the average relative errors between actual values and predicted ones by the model of three components for the predicting set were 1.65%, 6.47% and 1.37%, respectively, and the correlation coefficients were 0.994 0, 0.839 2 and 0.882 5, respectively. For comparison, LS-SVM was also utilized, for which the average relative errors were 1.68%, 6.25% and 1.47%, respectively, and the correlation coefficients were 0.994 1, 0.831 0 and 0.880 0, respectively. It is obvious that MLS-SVM algorithm is comparable to LS-SVM algorithm in modeling analysis performance, and both of them can give satisfying results. The result shows that the model with MLS-SVM algorithm is capable of doing multi-components NIR quantitative analysis synchronously. Thus MLS-SVM algorithm offers a new multiple dependent variables quantitative analysis approach for chemometrics. In addition, the weights have certain effect on the prediction performance of the model with MLS-SVM, which is consistent with our intuition and is validated in this study. Therefore, it is necessary to optimize weights in multiple dependent variables NIR modeling analysis.
[1] PU Rui-liang, GONG Peng(蒲瑞良,宫 鹏). High-spectrum Remote Sensing and Application (高光谱遥感及其应用). Beijing: Higher Education Press(北京:高等教育出版社), 2000. 18. [2] YAN Yan-lu, ZHAO Long-lian, HAN Dong-hai,et al(严衍禄,赵龙莲,韩东海,等). Foundation of Near Infrared Spectral Analysis and its Application(近红外光谱分析基础与应用). Beijing: China Light Industry Press(北京:中国轻工业出版社), 2005. [3] ZHANG Lu-da, JIN Ze-chen, SHEN Xiao-nan, et al(张录达,金泽宸,沈晓南,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2005, 25(9): 1400. [4] ZHANG Lu-da, SU Shi-guang, WANG Lai-sheng, et al(张录达,苏时光,王来生,等). Spectroscopy and Spectral Analysis (光谱学与光谱分析), 2005, 25(1): 33. [5] MA Qun, HAO Gui-qi, QIAO Yan-jiang, et al(马 群,郝贵奇,乔延江,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2006, 26(10): 1842. [6] LIN Ji-peng, LIU Jun-hua(林继鹏,刘君华). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2006, 26(12): 2232. [7] AN Xin, SU Shi-guang, WANG Tao, et al(安 欣,苏时光,王 韬,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2007, 27(8): 1619. [8] Suykens J A K, Vandewalle J. Neural Processing Letter, 1999, 9(3): 293. [9] YAN Hui, ZHANG Xue-gong, LI Yan-da (阎 辉, 张学工, 李衍达). Journal of Tsinghua University(Science and Technology)(清华大学学报·自然科学版), 2001, 41(9): 77. [10] YU Ke, CHENG Yi-yu(虞 科,程翼宇). Chinese Joumal of Analytical Chemistry(分析化学), 2006, 34(4): 561. [11] YAN Wei-wu, ZHU Hong-dong, SHAO Hui-he(阎威武, 朱宏栋, 邵惠鹤). Journal of System Simulation(系统仿真学报), 2003, 15(10): 1494. [12] CAI Dong-song, JING Ji-peng (蔡冬松, 靖继鹏). Information Science(情报科学), 2005, 23(12): 1877. [13] ZHANG Xiao-hui, ZHU Jia-yuan, ZHANG Heng-xi(张晓晖,朱家元,张恒喜). Computer Engineering and Applications(计算机工程与应用),2004, 40(27): 203. [14] Shawe-Taylor John, Cristianini Nello. Kernel Methods for Pattern Analysis. Cambridge University Press, 2004. 75. [15] Bernhard Schlkopf, Alexander J Smola. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. Cambridge, MA: MIT Press, 2002. 408. [16] Hsu Chih-Wei, Chang Chih-Chung, Lin Chih-Jen. A Practical Guide to Support Vector Classification. Available[online]: http://www.csie.ntu.edu.tw/~cjlin/papers/guide/guide.pdf.