Abstract:The varieties of cabbage seeds directly affect the yield and quality of cabbage, in order to rapidly and nondestructively identify the varieties of cabbage seeds, near infrared spectra technique were applied in this study and reflectance spectrum of the cabbage seeds was obtained. Firstly, to excavate the effective information in the spectral data and improve signal to noise ratio, the raw spectra was pre-processed with the method of standard normal variate (SNV) and multiplicative scatter correction (MSC). Secondly, principal component analysis (PCA) was used to analyze the clustering of cabbage samples, then the characteristic differentia of three cabbage varieties was obtained through qualitative analysis. Six Effective wavelengths were selected by successive projections algorithm (SPA). Finally, the full spectra variable, the first three principal components (PCs) using PCA and selected effective wavelengths using SPA were respectively set as inputs of the partial least squares discriminant analysis (PLS-DA) and least-squares support vector machine (LS-SVM) models for the classification of cabbage seeds. As can be seen from the two dimensional plot drawn with the scores of PC1 and PC2 (the first two principle components), PC1 and PC2 had a good clustering effect for different kinds of cabbage seeds. LS-SVM models performed better than PLS-DA models, the correct rates of discrimination were 100% achieved with LS-SVM models. PLS-DA and LS-SVM models built based on the selected wavelengths performed better than the models built based on the first three principal components, moreover, the SPA-LS-SVM model obtained the best results among all models, with 100% discrimination accuracy for both the calibration set and the prediction set. The overall results show that SPA can extract wavelengths, and the LS-SVM model combined with SPA can obtain optimal classification results. So the present paper could offer an alternate approach for the rapid discrimination of cabbage seeds variety.
Key words:Near infrared spectral;Principal component analysis (PCA);Successive projections algorithm (SPA);Partial least squares discriminant analysis (PLS-DA);Least-squares support vector machine (LS-SVM)
罗 微,杜焱喆*,章海亮. PCA和SPA的近红外光谱识别白菜种子品种研究[J]. 光谱学与光谱分析, 2016, 36(11): 3536-3541.
LUO Wei, DU Yan-zhe*, ZHANG Hai-liang. Discrimination of Varieties of Cabbage with Near Infrared Spectra Based on Principal Component Analysis and Successive Projections Algorithm. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2016, 36(11): 3536-3541.
[1] Monti L L, Bustamante C A, Osorio S, et al. Food Chemistry, 2016, 190: 879. [2] Heras-Roger J, Díaz-Romero C, Darias-Martín J. Food Chemistry, 2016, 196: 1224. [3] Jakobek L, Barron A R. Journal of Food Composition and Analysis, 2016, 45: 9. [4] ZHANG Chu,LIU Fei,KONG Wen-wen, et al(张 初,刘 飞,孔汶汶, 等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报), 2013,(20): 270. [5] BAO Yi-dan,CHEN Na,HE Yong, et al(鲍一丹,陈 纳,何 勇, 等). Optics and Precision Engineering(光学精密工程), 2015, (2): 349. [6] Cai J, Wang Y, Xi X, et al. International Journal of Biological Macromolecules, 2015, 78: 439. [7] LIU Wei,LIU Chang-hong,ZHENG Lei(刘 伟,刘长虹,郑 磊). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报), 2014,(10): 145. [8] Miskelly D M, Wrigley C W. Identification of Varieties of Food Grains: Elsevier, 2016. [9] YANG Hang,ZHANG Li-fu,TONG Qing-xi(杨 杭,张立福,童庆禧). Infrared and Laser Engineering(红外与激光工程), 2013,(9): 2437. [10] Mazivila S J, de Santana F B, Mitsutake H, et al. Fuel, 2015, 142: 222. [11] Cheng J, Sun D. LWT-Food Science and Technology, 2015, 63(2): 892. [12] FANG Xiao-rong,ZHANG Hai-liang,HUANG Ling-xia, et al(方孝荣,章海亮,黄凌霞, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2015,35(5): 1248.