Vis-NIR Spectroscopic Pattern Recognition Combined with SG Smoothing Applied to Breed Screening of Transgenic Sugarcane
LIU Gui-song1, GUO Hao-song1, PAN Tao1*, WANG Ji-hua2, CAO Gan2
1. Key Laboratory of Optoelectronic Information and Sensing Technologies of Guangdong Higher Educational Institutes, Jinan University, Guangzhou 510632, China 2. Research Institute of Crops, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
Abstract:Based on Savitzky-Golay (SG) smoothing screening, principal component analysis (PCA) combined with separately supervised linear discriminant analysis (LDA) and unsupervised hierarchical clustering analysis (HCA) were used for non-destructive visible and near-infrared (Vis-NIR) detection for breed screening of transgenic sugarcane. A random and stability-dependent framework of calibration, prediction, and validation was proposed. A total of 456 samples of sugarcane leaves planting in the elongating stage were collected from the field, which was composed of 306 transgenic (positive) samples containing Bt and Bar gene and 150 non-transgenic (negative) samples. A total of 156 samples (negative 50 and positive 106) were randomly selected as the validation set; the remaining samples (negative 100 and positive 200, a total of 300 samples) were used as the modeling set, and then the modeling set was subdivided into calibration (negative 50 and positive 100, a total of 150 samples) and prediction sets (negative 50 and positive 100, a total of 150 samples) for 50 times. The number of SG smoothing points was expanded, while some modes of higher derivative were removed because of small absolute value, and a total of 264 smoothing modes were used for screening. The pairwise combinations of first three principal components were used, and then the optimal combination of principal components was selected according to the model effect. Based on all divisions of calibration and prediction sets and all SG smoothing modes, the SG-PCA-LDA and SG-PCA-HCA models were established, the model parameters were optimized based on the average prediction effect for all divisions to produce modeling stability. Finally, the model validation was performed by validation set. With SG smoothing, the modeling accuracy and stability of PCA-LDA, PCA-HCA were significantly improved. For the optimal SG-PCA-LDA model, the recognition rate of positive and negative validation samples were 94.3%, 96.0%; and were 92.5%, 98.0% for the optimal SG-PCA-LDA model, respectively. Conclusion: Vis-NIR spectroscopic pattern recognition combined with SG smoothing could be used for accurate recognition of transgenic sugarcane leaves, and provided a convenient screening method for transgenic sugarcane breeding.
Key words:Breed screening of transgenic sugarcane;Vis-NIR spectroscopy;SG smoothing;PCA-LDA;PCA-HCA
[1] ZHANG Xian-yong, YANG Ben-peng, ZHANG Shu-zhen(张显勇,杨本鹏,张树珍). Molecular Plant Breeding(分子植物育种), 2007, 5(6): 155. [2] WANG Guang-yin, FAN Wen-xiu, CHEN Bi-hua, et al(王广印,范文秀,陈碧华,等). Food Science(食品科学), 2008, 29(10): 698. [3] Pan T, Li M M, Chen J M. Appl. Spectrosc., 2014, 68(3): 263. [4] PAN Tao, WU Zhen-tao, CHEN Hua-zhou(潘 涛, 吴振涛, 陈华舟), Chinese J. Anal. Chem.(分析化学), 2012, 40(6): 920. [5] Liu Z Y, Liu B, Pan T, et al. Spectrochim. Acta A, 2013, 102: 269. [6] CAO Pu,PAN Tao, CHEN Xing-dan(曹 璞,潘 涛,陈星旦). Optics Preci. Eng.(光学精密工程), 2007,15(12): 1952. [7] CHU Xiao-li, YUAN Hong-fu, LU Wan-zhen(褚小立, 袁洪福, 陆婉珍). Prog. Chem.(化学进展), 2004, 16(4): 528. [8] CHU Xiao-li, XU Yu-peng, LU Wan-zhen(褚小立, 许育鹏, 陆婉珍). Chinese J. Anal. Chem.(分析化学), 2008, 36(5): 702. [9] LIU Zhen-yao, PAN Tao(刘振尧,潘 涛). Optics Preci. Eng.(光学精密工程), 2012, 20(10): 2171. [10] Pan T, Liu J M, Chen J M, et al. Anal. Methods, 2013, 5(4): 4355. [11] Xie L J, Ying Y B, Ying T J, et al. Anal. Chim. Acta, 2007, 584(2): 379. [12] Errikson L, Johansson E, Kettaneh W N, et al. Multi- and Megavariate Data Analysis Part I Basic Principles and Applications, Second Edition. Umea Sweden: Umetrics Academy, 2006. [13] Savitzky A, Golay M J E. Anal. Chem., 1964, 36(8): 1627. [14] Chen H Z, Pan T, Chen J M, etal. Chemometr. Intell. Lab., 2011, 107(1): 139. [15] Pan T, Chen Z H, Chen J M, et al. Anal. Methods, 2012, 4(4): 1046. [16] XIE Jun, PAN Tao, CHEN Jie-mei, et al (谢 军,潘 涛,陈洁梅,等), Chinese J. Anal. Chem.(分析化学), 2010, 38(3): 342.