The Origin Identification Study of Boletus Edulis Based on the Infrared Spctrum Data Fusion Strategy
HU Yi-ran1, LI Jie-qing1, LIU Hong-gao2, FAN Mao-pan1*, WANG Yuan-zhong3*
1. College of Resources and Environment, Yunnan Agricultural University, Kunming 650201, China
2. College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming 650201, China
3. Institute of Medicinal Plants, Yunnan Academy of Agricultural Sciences, Kunming 650200, China
Abstract:In recent years, food safety problems happened frequently, and consumers pay more and more attention to the environmental safety of food origin, which leads to an increase in demand for geographical indication products. As a healthy food, the quality of Boletus edulis is greatly affected by the environment of its origin. In order to protect consumers’ health and prevent fake and inferior products from entering the market, it is urgent to develop an efficient and low-cost identification technology of the origin of delicious Boletus edulis. Data fusion strategy and partial least squares discrimination (PLS-DA) model were used to identify the origin of Boletus edulis. In this paper, Fourier transform near infrared and Fourier transform middle infrared spectra of 141 samples from 8 Origin (Kunming, Chuxiong, Yuxi, Diqing, Dali, Baoshan, Wenshan and Qujing) were scanned. Kennard-stone algorithm was used to divide all samples into 2/3 training set and 1/3 prediction set. Three fusion strategies (low-level, mid-level, high-level) were used to analyze four single spectral matrices spectra: near-infrared average spectra of stipes (N-b), near-infrared average spectra of caps (N-g), mid-infrared average spectra of stipes (M-b), mid-infrared average spectra of caps (M-g) and to establish a partial least squares discriminant (PLS-DA) model. In which root mean square error of cross validation (RMSECV) and the root mean square prediction error (RMSEP) are used to evaluate model stability. The purpose of the non-error ratio (NER), training set classification accuracy and forecast set classification accuracy evaluation model classification performance. It contributes to find out the best way to geographic origin identification of Boletus edulis. The results showed that: (1) near infrared and middle infrared spectra can identify the origin of Boletus edulis; (2) the model established by middle infrared spectrum is better than that in near infrared spectrum; (3) all the three fusion strategies can improve the identification effect of origin of Boletus edulis, and the identification results of producing area from good to bad are in order of mid fusion, high fusion, low fusion and single spectral model. By using PLS-DA intermediate fusion strategy to fuse in near infrared and Mid-infrared spectrum, different origin Boletus edulis identification models are established, with the least number of variables (49), the highest accuracy of training set in producing area (100%), the highest accuracy of prediction set of origin (100%), the lowest RMSEP (0.133). As a reliable method, it can identify the geographical origin of Boletus edulis fast and accurately.
Key words:Boletus edulis; Geographic origin identification; Data fusion; Fourier transform mid-infrared spectrum; Fourier transform near infrared spectrum
胡翼然,李杰庆,刘鸿高,范茂攀,王元忠. 红外光谱数据融合对美味牛肝菌产地鉴别[J]. 光谱学与光谱分析, 2020, 40(04): 1276-1282.
HU Yi-ran, LI Jie-qing, LIU Hong-gao, FAN Mao-pan, WANG Yuan-zhong. The Origin Identification Study of Boletus Edulis Based on the Infrared Spctrum Data Fusion Strategy. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(04): 1276-1282.
[1] Lu Y, Song S, Wang R, et al. Environment International, 2015, 77: 5.
[2] Cei L, Stefani G, Defrancesco E, et al. Land Use Policy, 2018, 75: 620.
[3] Wang X, Zhang J, Wu L, et al. Food Chemistry, 2014, 151: 279.
[4] Falandysz J, Saba M, Liu H, et al. Environmental Science and Pollution Research, 2016, 23(23): 23730.
[5] MA Zhang-zhong, XU Jin-hua, LUO Zhang, et al(马长中, 徐锦华, 罗 章, 等). The Food Industry(食品工业), 2017, 38(3): 311.
[6] GUI Ming-ying, XU Jun, ZHANG Xin, et al(桂明英, 徐 俊, 张 鑫, 等). The Food Industry(食品工业), 2015, 36(1): 196.
[7] Quelal-Vásconez M A, Lerma-García M J, Pérez-Esteve , et al. Food Control, 2019, 99: 68.
[8] YANG Tian-wei, LI Tao, LI Jie-qing, et al(杨天伟, 李 涛, 李杰庆, 等). Spectroscopy and Spectral Analysis (光谱学与光谱分析), 2016, 36(7): 2117.
[9] Li Y, Xiong Y, Min S. Vibrational Spectroscopy, 2019, 101: 20.
[10] Li Y, Zhang J, Wang Y. Analytical and Bioanalytical Chemistry, 2018, 410(1): 91.
[11] Márquez C, López M I, Ruisánchez I, et al. Talanta, 2016, 161: 80.
[12] Ribeiro J S, Ferreira M M C, Salva T J G. Talanta, 2011, 83(5): 1352.
[13] MENG Ling-zhi, GONG Shu-ling (孟令芝, 龚淑玲). Organic Spectral Analysis(有机波谱分析). 2nd ed(第2版). Wuhan:Wuhan University Press(武汉:武汉大学出版社), 2003. 213.
[14] YANG Tian-wei, LIU Hong-gao, ZHANG Ji, et al(杨天伟, 刘鸿高, 张 霁, 等). Hubei Agricultural Science(湖北农业科学), 2016, 55(9): 2362.
[15] YAO Sen, LI Tao, LIU Hong-Gao, et al(姚 森,李 涛,刘鸿高, 等). Food Science(食品科学), 2018, 39(8): 212.