|
|
|
|
|
|
Near Infrared Spectroscopy Analysis Based Machine Learning to Identify Haploids in Maize |
LI Wei1, LI Jin-long1, LI Wei-jun2, LIU Li-wei3, LI Hao-guang2, CHEN Chen1, CHEN Shao-jiang1* |
1. National Maize Improvement Center of China, Engineering Center for Maize Breeding of MOE, China Agricultural University, Beijing 100193, China
2. Institute of Semiconductores, Chinese Academy of Sciences, Beijing 100083, China
3. Beijing Tunyu Seed Co., Ltd.,Beijing 100193, China |
|
|
Abstract Haploid identification is a very important part of doubled haploid technology in maize. In this reasearch, we studied the near-infrared transmission spectra of a large number seeds of haploids and heterozygous diploids to establish an accurate model for haploid identification. Compared with the average spectrum of all haploids and heterozygous diploids, it was found that the absorption peak position of the two spectra was almost the same, but the haploid absorbance was slightly higher than that of heterozygous diploid, especially at the wavelengths of 940~1 120 and 1 180~1 316 nm which shared larger differences. Based on the near infrared spectra of haploids and heterozygous diploids from three different sourcegermplasm, different machine learning algorithms were called to construct a haploid selection model, accuracy of models developed with different spectral preprocessing methods were compared, and the effects of datasets to model evaluation were also studied. By comparison with several models, the haploid identification accuracy of the partial least squares method and the neural network algorithm reached a high accuracy of 95.42% and 93.26% respectively. The results of the testing set were consistent with the accuracy of the model, indicating that the two algorithms are suitable for large-scale screening of haploids. By using the partial least squares model, the accuracy of the model developed from the spectral preprocessing methods of smoothing was the best. Compared with the modeling results of different data size, it was found that increasing the data set in a certain range could improve the accuracy of the model. And when proportion of haploids was high enough, the recall rate of haploid prediction would reach up to 100%. In addition, haploids and heterozygous diploids which was difficult to be identified by R1-nj color were selected to form a new dataset. The accuracy of the partial least squares method trained by this dataset was 93.39%. This showed the advantages of NIR machine learning method for haploid identification, which could be used to achieve accurate identification in the case independent of R1-nj color expression. The method of NIR haploid identification based on machine learning has high accuracy and efficiency, and the method can be optimized with increasing data. This research paved a way for the intelligent identification of haploid.
|
Received: 2017-05-02
Accepted: 2017-10-11
|
|
Corresponding Authors:
CHEN Shao-jiang
E-mail: chen368@126.com
|
|
[1] Nanda D K, Chase S S. Crop Science, 1966, 6(2): 213.
[2] CHEN Shao-jiang,SONG Tong-ming(陈绍江, 宋同明). Acta Agronomica Sinica(作物学报),2003,(4): 587.
[3] Melchinger A E, Schipprack W, Würschum T, et al. Scientific Reports, 2013, 3: 2129.
[4] LIU Jin,GUO Ting-ting,YANG Pei-qiang,et al(刘 金,郭婷婷,杨培强,等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报), 2012,(S2): 233.
[5] CHEN Shao-jiang(陈绍江). Crops(作物杂志),2013,(6): 1.
[6] CHEN Shao-jiang, LI Liang, LI Hao-chuan, et al(陈绍江,黎 亮,李浩川,等). Beijing: China Agricultural University Press(北京:中国农业大学出版社),2009.
[7] Jones R W, Reinot T, Frei U K, et al. Applied Spectroscopy, 2012, 66(4): 447.
[8] QIN Hong,MA Jing-yi,CHEN Shao-jiang,et al(覃 鸿,马竞一,陈绍江,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析),2016,36(1): 292.
[9] Team R C. R Foundation for Statistical Computing, Vienna, Austria. 2016.
[10] Kuhn M. Journal of Statistical Software, 2008, 28(5): 1.
[11] Dong X, Xu X, Li L, et al. Molecular Breeding, 2014, 34(3): 1147. |
[1] |
GAO Feng1, 2, XING Ya-ge3, 4, LUO Hua-ping1, 2, ZHANG Yuan-hua3, 4, GUO Ling3, 4*. Nondestructive Identification of Apricot Varieties Based on Visible/Near Infrared Spectroscopy and Chemometrics Methods[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 44-51. |
[2] |
LIU Jia, ZHENG Ya-long, WANG Cheng-bo, YIN Zuo-wei*, PAN Shao-kui. Spectra Characterization of Diaspore-Sapphire From Hotan, Xinjiang[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 176-180. |
[3] |
BAO Hao1, 2,ZHANG Yan1, 2*. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157. |
[4] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[5] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[6] |
WANG Hong-jian1, YU Hai-ye1, GAO Shan-yun1, LI Jin-quan1, LIU Guo-hong1, YU Yue1, LI Xiao-kai1, ZHANG Lei1, ZHANG Xin1, LU Ri-feng2, SUI Yuan-yuan1*. A Model for Predicting Early Spot Disease of Maize Based on Fluorescence Spectral Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3710-3718. |
[7] |
HE Qing-yuan1, 2, REN Yi1, 2, LIU Jing-hua1, 2, LIU Li1, 2, YANG Hao1, 2, LI Zheng-peng1, 2, ZHAN Qiu-wen1, 2*. Study on Rapid Determination of Qualities of Alfalfa Hay Based on NIRS[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3753-3757. |
[8] |
HU Cai-ping1, HE Cheng-yu2, KONG Li-wei3, ZHU You-you3*, WU Bin4, ZHOU Hao-xiang3, SUN Jun2. Identification of Tea Based on Near-Infrared Spectra and Fuzzy Linear Discriminant QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3802-3805. |
[9] |
LIU Xin-peng1, SUN Xiang-hong2, QIN Yu-hua1*, ZHANG Min1, GONG Hui-li3. Research on t-SNE Similarity Measurement Method Based on Wasserstein Divergence[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3806-3812. |
[10] |
HUANG You-ju1, TIAN Yi-chao2, 3*, ZHANG Qiang2, TAO Jin2, ZHANG Ya-li2, YANG Yong-wei2, LIN Jun-liang2. Estimation of Aboveground Biomass of Mangroves in Maowei Sea of Beibu Gulf Based on ZY-1-02D Satellite Hyperspectral Data[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3906-3915. |
[11] |
LUO Li, WANG Jing-yi, XU Zhao-jun, NA Bin*. Geographic Origin Discrimination of Wood Using NIR Spectroscopy
Combined With Machine Learning Techniques[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3372-3379. |
[12] |
ZHANG Shu-fang1, LEI Lei2, LEI Shun-xin2, TAN Xue-cai1, LIU Shao-gang1, YAN Jun1*. Traceability of Geographical Origin of Jasmine Based on Near
Infrared Diffuse Reflectance Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3389-3395. |
[13] |
YANG Qun1, 2, LING Qi-han1, WEI Yong1, NING Qiang1, 2, KONG Fa-ming1, ZHOU Yi-fan1, 2, ZHANG Hai-lin1, WANG Jie1, 2*. Non-Destructive Monitoring Model of Functional Nitrogen Content in
Citrus Leaves Based on Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3396-3403. |
[14] |
FANG Zheng, WANG Han-bo. Measurement of Plastic Film Thickness Based on X-Ray Absorption
Spectrometry[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3461-3468. |
[15] |
FU Gen-shen1, LÜ Hai-yan1, YAN Li-peng1, HUANG Qing-feng1, CHENG Hai-feng2, WANG Xin-wen3, QIAN Wen-qi1, GAO Xiang4, TANG Xue-hai1*. A C/N Ratio Estimation Model of Camellia Oleifera Leaves Based on
Canopy Hyperspectral Characteristics[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3404-3411. |
|
|
|
|