|
|
|
|
|
|
Infrared Spectral Study on the Origin Identification of Boletus Tomentipes Based on the Random Forest Algorithm and Data Fusion Strategy |
HU Yi-ran1, LI Jie-qing1, LIU Hong-gao2, FAN Mao-pan1*, WANG Yuan-zhong3* |
1. College of Resources and Environment, Yunnan Agricultural University, Kunming 650201, China
2. College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming 650201, China
3. Institute of Medicinal Plants, Yunnan Academy of Agricultural Sciences, Kunming 650200, China |
|
|
Abstract Boletus tomentipes Earleas a kind of healthy food is favored by the majority of consumers. The nutrient accumulation of the fruiting body is affected by the growth environment (altitude, climate, etc. ). There is a significant difference in the content of nutrient between different regionsIt is urgent to establish an accurate, rapid and cheap origin identification technology. In this paper, a data fusion strategy combined with random forest algorithm (RF) was used to identify the origin of B. tomentipes, and the effects of various eigenvalue extraction methods on the classification of RF models were compared. Fourier transform near infrared and Fourier transform mid-infrared spectra of 87 samples from 4 producing areas (north subtropics, north temperate zones, south subtropical zones and middle subtropical zones) were scanned to analyze their spectral characteristics. All the sampleswere divided into two thirds of the training set (58) and a third of the validation set (29) by the kennard-stone algorithm. Based on 4 kinds of infrared spectra ( near-infrared average spectra of stipes (N-b), near-infrared average spectra of caps (N-g), mid-infrared average spectra of stipes (M-b), mid-infrared average spectra of caps (M-g)) and three data fusion strategies (low-level fusion strategies, mid-level fusion strategies, high-level fusion strategies) of data, combining with the RF building identification model, the effects of different characteristic value (variable importance in projection, Boruta, latent variables) on the classification results of the model are compared. Among them, the optimal ntree and mtrywere selected according to oob. The classification performance of the model was evaluated with specificity, sensitivity, training set correctness, and validation set accuracy. Finally, the best method to identify the origin of B. tomentipes was found by multiple evaluation indicators. The results showed that (1) near infrared and middle infrared spectra could identify the origin of B. tomentipes. (2) It is not ideal for establish a discriminant model with a single spectrum combined with RF. (3) All three fusion strategies can improve the origin identification effect of B. tomentipes. Theresults of origin identification from good to bad are in order of high-level fusion, mid-level fusion, low-level fusion. By scanning the near infrared and middle infrared spectra of B. tomentipes, a high-level fusion strategy based on characteristic value LV was adopted, and the identification model of B. tomentipes from different regions was established with RF, which has high verification set accuracy (99.6%), high sensitivity (0.969) and high specificity (0.986). As a reliable method, it can identify the geographical origin of B. tomentipes quickly and accurately.
|
Received: 2019-10-08
Accepted: 2020-01-20
|
|
Corresponding Authors:
FAN Mao-pan, WANG Yuan-zhong
E-mail: boletus@126.com;mpfan@126.com
|
|
[1] Wang X, Zhang J, Wu L, et al. Food Chemistry, 2014, 151: 279.
[2] YANGBAI Qiu-xiu, CHEN Xun, LIU Xiao-fei(杨白秋秀,陈 旭,刘晓飞). Edible Fungi of China(中国食用菌), 2017, 36(5): 13.
[3] LU Yong-xin, TIAN Hou-ming, YANG Hai-shu, et al(鲁永新,田侯明,杨海抒,等). Chinese Journal of Eco-Agriculture(中国生态农业学报), 2015, 23(6): 748.
[4] Falandysz J, Zhang J, Wiejak A, et al. Ecotoxicology and Environmental Safety, 2017, 142: 497.
[5] YANG Tian-wei, CUI Bao-kai, ZHANG Ji, et al(杨天伟,崔宝凯,张 霁,等). Mycosystema(菌物学报), 2014, 33(2): 262.
[6] Chen Y, Yan Y, Xie M, et al. Journal of Pharmaceutical and Biomedical Analysis, 2008, 47(3): 469.
[7] Wang X, Zhang J, Li T, et al. Journal of Analytical Methods in Chemistry, 2015, 2015: http://dx.doi.org/10.1155/2015/165412.
[8] Li Y, Zhang J, Wang Y. Analytical and Bioanalytical Chemistry, 2018, 410(1): 91.
[9] Wang Y, Zuo Z T, Huang H Y, et al. Royal Society Open Science, 2019, 6(5): 190399.
[10] He P, Xu X, Zhang B, et al. Estimation of Leaf Chlorophyll Content in Winter Wheat Using Variable Importance for Projection (VIP) with Hyperspectral Data. Proceedings of SPIE, 2015, 9637: 963708.
[11] CHEN Yi-jie, TANG Jia-shan(陈逸杰,唐加山). Software Guide(软件导刊), 2019, 18(4): 69.
[12] Mellado-Mojica E, López M G. Food Chemistry, 2015, 167: 349. |
[1] |
JIN Cheng-liang1, WANG Yong-jun2*, HUANG He2, LIU Jun-min3. Application of High-Dimensional Infrared Spectral Data Preprocessing in the Origin Identification of Traditional Chinese Medicinal Materials[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2238-2245. |
[2] |
WU Chao1, QIU Bo1*, PAN Zhi-ren1, LI Xiao-tong1, WANG Lin-qian1, CAO Guan-long1, KONG Xiao2. Application of Spectral and Metering Data Fusion Algorithm in Variable Star Classification[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(06): 1869-1874. |
[3] |
WANG Qiu, LI Bin, HAN Zhao-yang, ZHAN Chao-hui, LIAO Jun, LIU Yan-de*. Research on Anthracnose Grade of Camellia Oleifera Based on the Combined LIBS and Fourier Transform NIR Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(05): 1450-1458. |
[4] |
WU Mu-lan1, SONG Xiao-xiao1*, CUI Wu-wei1, 2, YIN Jun-yi1. The Identification of Peas (Pisum sativum L.) From Nanyang Based on Near-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(04): 1095-1102. |
[5] |
WANG Zhi-xin, WANG Hui-hui, ZHANG Wen-bo, WANG Zhong, LI Yue-e*. Classification and Recognition of Lilies Based on Raman Spectroscopy and Machine Learning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(01): 183-189. |
[6] |
WANG Wen-jun1, SHA Yun-fei1, WANG Yang-zhong1, YU Jie1, LIU Tai-ang2, ZHANG Xu-feng3, MENG Xiang-zhou3, GE Jiong1*. Discriminating Flavor Styles via Data Fusion of NIR and EN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(01): 133-137. |
[7] |
GENG Ying-rui1, SHEN Huan-chao1, NI Hong-fei2, CHEN Yong1, LIU Xue-song1*. Support Vector Machine Optimized by Near-Infrared Spectroscopic
Technique Combined With Grey Wolf Optimizer Algorithm to
Realize Rapid Identification of Tobacco Origin[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(09): 2830-2835. |
[8] |
LI Qing1, 2, XU Li1, 2, PENG Shan-gui1, 2, LUO Xiao1, 2, ZHANG Rong-qin1, 2, YAN Zhu-yun3, WEN Yong-sheng1, 2*. Research on Identification of Danshen Origin Based on Micro-Focused
Raman Spectroscopy Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1774-1780. |
[9] |
DAI Lu-lu1, YANG Ming-xing1, 2*, WEN Hui-lin1. Study on Chemical Compositions and Origin Discriminations of Hetian Yu From Maxianshan, Gansu Province[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(05): 1451-1458. |
[10] |
CHEN Feng-xia1, YANG Tian-wei2, LI Jie-qing1, LIU Hong-gao3, FAN Mao-pan1*, WANG Yuan-zhong4*. Identification of Boletus Species Based on Discriminant Analysis of Partial Least Squares and Random Forest Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(02): 549-554. |
[11] |
WANG Dong1,2, HAN Ping1,2*, WU Jing-zhu3*, ZHAO Li-li4, XU Heng4. Non-Destructive Identification of the Heat-Damaged Kernels of Waxy Corn Seeds Based on Near-Ultraviolet-Visible-Shortwave and Near-Infrared Multi-Spectral Imaging Data[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(09): 2696-2702. |
[12] |
CHEN Qi1,3, PAN Tian-hong2,4*, LI Yu-qiang4, LIN Hong4. Geographical Origin Discrimination of Taiping Houkui Tea Using Convolutional Neural Network and Near-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(09): 2776-2781. |
[13] |
ZHANG Jiao1, 2, WANG Yuan-zhong1, YANG Wei-ze1, ZHANG Jin-yu1*. Data Fusion of ATR-FTIR and UV-Vis Spectra to Identify the Origin of Polygonatum Kingianum[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(05): 1410-1416. |
[14] |
LU Shi-yang1, 2, ZHANG Lei-lei1, 2, PAN Jia-rong1, 2, YANG De-hong1, 2, SUI Ya-nan1, 2, ZHU Cheng1, 2*. Study on the Indetification of the Geographical Origin of Cherries Using Raman Spectroscopy and LSTM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(04): 1177-1181. |
[15] |
SHA Yun-fei1, HUANG Wen1, WANG Liang1, LIU Tai-ang2,YUE Bao-hua2, LI Min-jie2, YOU Jing-lin2, GE Jiong1*, XIE Wen-yan1*. Merging MIR and NIR Spectral Data for Flavor Style Determination[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(02): 473-477. |
|
|
|
|