| 
					
						| 
								
									|  |  |  |   |  |  
    					|  |  
    					| Application of Improved Random Frog Algorithm in Fast Identification of Soybean Varieties |  
						| LI Wei1, TAN Feng2*, ZHANG Wei1, GAO Lu-si3, LI Jin-shan4 |  
						| 1. College of Engineering, Heilongjiang Bayi Agricultural University, Daqing 163319, China 2. College of Electrical and Information, Heilongjiang Bayi Agricultural University, Daqing 163319, China
 3. Suihua Branch of Heilongjiang Academy of Agricultural Sciences, Suihua 152052, China
 4. Daqing Green Agricultural Products Monitor Center, Daqing 163311, China
 
 |  
						|  |  
					
						| 
								
									| 
											
                        					 
												
													
													    |  |  
														| 
													
													    | Abstract  Rapid and accurate identification of soybean varieties play an important role for identifying seed quality, purifying the seed market and ensuring food security. The traditional identification methods of crop varieties have the problems of poor accuracy and low efficiency. Therefore a PLS identification model was established by Raman spectroscopy combined with characteristic wavelength extraction to fast identify four high-oil soybean varieties (Heinong 87, Heinong 89, Suinong 38 and Suinong 77) in Heilongjiang Province. RF is a new characteristic wavelength selection algorithm that determines the importance of variables by iteratively calculating the selected probability, which can remove redundant information to a great extent in the full spectrum. However, this method has the disadvantages of the random initial variable set, a large number of iterations and uncertain threshold selection. Therefore, an improved random frog (MRF) algorithm based on LASSO regression was proposed. In order to get rid of the randomness of the initial variable set in the RF algorithm, LASSO was used to extract the characteristic wavelength point most related to the attribute variable as an initial variable set F0. On this basis, iterative calculations were carried out to reduce the number of useless iterations and improve the model's prediction accuracy. In addition, RF selects variables by setting a threshold, which leads to the uncertainty of the extracted characteristic wavelength. The improvements were as follows: Firstly, the variables with the selected probability of 0 were removed, taking 10 wavelength points as intervals for the sorted variables. Then, the partial least squares discriminant analysis model between the characteristic wavelengths and soybean varieties was built by adding one interval each time, and taking the wavelength subset with the smallest RMSECV as the selected characteristic wavelengths. The PLS-DA model was established with the selected characteristic wavelengths of MRF as the input variables and compared the prediction performance with full spectrum and other characteristic wavelength selection methods of RF, LASSO and ElasticNet algorithms. The results indicated that the MRF algorithm selected 300 characteristic wavelength points, accounting for only 9.37% of the full spectrum, which effectively screened the key characteristic variables and simplified the complexity of the model. The RMSEP and R2p were 0.246 9 and 0.951 2 respectively, and the identification accuracy reached 100%, which was the best among all models. Therefore, Raman spectroscopy combined with MRF algorithm could achieve the fast identification of soybean varieties and provide a new technique for the fast identification of other crop varieties. |  
															| Received: 2022-05-11    
						    						    							Accepted: 2022-10-25 |  
															|  |  
															| Corresponding Authors:
																TAN Feng   
																																 E-mail: tf1972@163.com |  |  
													
														  
															| [1] ZENG Xue-ming(曾学明). Chinese Journal of Agricultural Resources and Regional Planning(中国农业资源与区划), 2017, 38(9): 89. [2] LIU Yao, LI Zi-nan, WU Tao, et al(刘 瑶, 李梓楠, 吴 涛, 等). Soybean Science(大豆科学), 2018, 37(4): 596.
 [3] FEI Hong-li, RUAN Chang-qing, LI Zhi-jiang, et al(费洪立, 阮长青, 李志江, 等). China Oils and Fats(中国油脂), 2022, 47(2): 148.
 [4] ZHANG Rui-jun, BAI Zhi-yuan, YANG Yu-hua, et al(张瑞军, 白志元, 杨玉花, 等). Molecular Plant Breeding(分子植物育种), 2021, 19(20): 6750.
 [5] Nargis H F, Nawaz H, Bhatti H N, et al. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2021, 246: 119034.
 [6] Dibs R, Silva T V, Neto J A G, et al. Vibrational Spectroscopy, 2021, 112: 103183.
 [7] SHA Min, GUI Dong-dong, ZHANG Zheng-yong, et al(沙 敏, 桂冬冬, 张正勇, 等). Journal of the Chinese Cereals and Oils Association(中国粮油学报), 2020, 35(1): 168.
 [8] Liu Dongli, Wu Yixuan, Gao Zongmei, et al. Crop & Pasture Science, 2019, 70:437.
 [9] Li Hongdong, Xu Qingsong, Liang Yizeng. Analytica Chimica Acta, 2012, 740: 20.
 [10] WANG Kai-yi, YANG Sheng, GUO Cai-yun, et al(王恺怡, 杨 盛, 郭彩云, 等). Journal of Instrumental Analysis(分析测试学报), 2022, 47(2): 398.
 [11] Imani M, Ghassemian H. International Journal of Remote Sensing, 2015, 36(6): 1728.
 [12] Genis D O, Sezer B, Durna S, et al. Food Chemistry, 2021, 336: 127699.
 [13] Li Xiong, Liu Yande, Jiang Xiaogang, et al. Journal of Molecular Structure, 2020, 1210: 127760.
 
 |  
													
														
															| 
																																																																																						
																				
																					| [1] | LI Jie, ZHOU Qu*, JIA Lu-fen, CUI Xiao-sen. Comparative Study on Detection Methods of Furfural in Transformer Oil Based on IR and Raman Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 125-133. |  
																					| [2] | WANG Fang-yuan1, 2, HAN Sen1, 2, YE Song1, 2, YIN Shan1, 2, LI Shu1, 2, WANG Xin-qiang1, 2*. A DFT Method to Study the Structure and Raman Spectra of Lignin 
Monomer and Dimer[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 76-81. |  
																					| [3] | XING Hai-bo1, ZHENG Bo-wen1, LI Xin-yue1, HUANG Bo-tao2, XIANG Xiao2, HU Xiao-jun1*. Colorimetric and SERS Dual-Channel Sensing Detection of Pyrene in 
Water[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 95-102. |  
																					| [4] | WANG Xin-qiang1, 3, CHU Pei-zhu1, 3, XIONG Wei2, 4, YE Song1, 3, GAN Yong-ying1, 3, ZHANG Wen-tao1, 3, LI Shu1, 3, WANG Fang-yuan1, 3*. Study on Monomer Simulation of Cellulose Raman Spectrum[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 164-168. |  
																					| [5] | WANG Lan-hua1, 2, CHEN Yi-lin1*, FU Xue-hai1, JIAN Kuo3, YANG Tian-yu1, 2, ZHANG Bo1, 4, HONG Yong1, WANG Wen-feng1. Comparative Study on Maceral Composition and Raman Spectroscopy of Jet From Fushun City, Liaoning Province and Jimsar County, Xinjiang Province[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 292-300. |  
																					| [6] | WANG Zhi-qiang1, CHENG Yan-xin1, ZHANG Rui-ting1, MA Lin1, GAO Peng1, LIN Ke1, 2*. Rapid Detection and Analysis of Chinese Liquor Quality by Raman 
Spectroscopy Combined With Fluorescence Background[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3770-3774. |  
																					| [7] | LIU Hao-dong1, 2, JIANG Xi-quan1, 2, NIU Hao1, 2, LIU Yu-bo1, LI Hui2, LIU Yuan2, Wei Zhang2, LI Lu-yan1, CHEN Ting1,ZHAO Yan-jie1*,NI Jia-sheng2*. Quantitative Analysis of Ethanol Based on Laser Raman Spectroscopy Normalization Method[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3820-3825. |  
																					| [8] | LU Wen-jing, FANG Ya-ping, LIN Tai-feng, WANG Hui-qin, ZHENG Da-wei, ZHANG Ping*. Rapid Identification of the Raman Phenotypes of Breast Cancer Cell 
Derived Exosomes and the Relationship With Maternal Cells[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3840-3846. |  
																					| [9] | LI Qi-chen1, 2, LI Min-zan1, 2*, YANG Wei2, 3, SUN Hong2, 3, ZHANG Yao1, 3. Quantitative Analysis of Water-Soluble Phosphorous Based on Raman 
Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3871-3876. |  
																					| [10] | GUO He-yuanxi1, LI Li-jun1*, FENG Jun1, 2*, LIN Xin1, LI Rui1. A SERS-Aptsensor for Detection of Chloramphenicol Based on DNA  Hybridization Indicator and Silver Nanorod Array Chip[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3445-3451. |  
																					| [11] | ZHU Hua-dong1, 2, 3, ZHANG Si-qi1, 2, 3, TANG Chun-jie1, 2, 3. Research and Application of On-Line Analysis of CO2 and H2S in Natural Gas Feed Gas by Laser Raman Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3551-3558. |  
																					| [12] | LIU Jia-ru1, SHEN Gui-yun2, HE Jian-bin2, GUO Hong1*. Research on Materials and Technology of Pingyuan Princess Tomb of Liao Dynasty[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3469-3474. |  
																					| [13] | WEI Zi-kai, WANG Jie, ZHANG Ruo-yu, ZHANG Meng-yun*. Classification of Foreign Matter in Cotton Using Line Scan Hyperspectral Transmittance Imaging[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3230-3238. |  
																					| [14] | LI Wen-wen1, 2, LONG Chang-jiang1, 2, 4*, LI Shan-jun1, 2, 3, 4, CHEN Hong1, 2, 4. Detection of Mixed Pesticide Residues of Prochloraz and Imazalil in 
Citrus Epidermis by Surface Enhanced Raman Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3052-3058. |  
																					| [15] | ZHAO Ling-yi1, 2, YANG Xi3, WEI Yi4, YANG Rui-qin1, 2*, ZHAO Qian4, ZHANG Hong-wen4, CAI Wei-ping4. SERS Detection and Efficient Identification of Heroin and Its Metabolites Based on Au/SiO2 Composite Nanosphere Array[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3150-3157. |  |  
											 
											 |  |  |