|
|
|
|
|
|
Near-Infrared Spectroscopy Combined With Random Forest Algorithm: A Fast and Effective Strategy for Origin Traceability of Fuzi |
GONG Sheng1, ZHU Ya-ning2, ZENG Chen-juan3, MA Xiu-ying3, PENG Cheng1, GUO Li1* |
1. State Key Laboratory of Southwestern Chinese Medicine Resources, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
2. Yaan Sanjiu Pharmaceutical Co., Ltd., Yaan 625000, China
3. Sichuan Jianengda Panxi Pharmaceuticals Industry Co., Ltd., Butuo 616350, China
|
|
|
Abstract Effective and reliable methods of origin certification are essential for protecting high-value Chinese medicinal materials (e.g geo-authentic Chinese medicinal materials, geographical indication products, etc.) from designated regions. As a famous traditional Chinese medicine and a geo-authentic Chinese medicinal material produced in Sichuan Province, Aconiti Lateralis Radix Praeparata (Fuzi) has a remarkable curative effect and wide clinical application is in great demand in domestic and international markets. The efficacy and price of the Fuzi of different origins vary, and it is difficult for the public to identify them through traditional experience accurately. Mass spectrometry-based on plant metabolomics is a tedious and lengthy test sample preparation process, complicated operation, long detection time, and low reproducibility. Near-infrared (NIR) spectroscopy, a mature, fast and nondestructive detection technique was integrated with machine learning to bring new ways for online quality supervision and control of Chinese medicinal materials. Therefore, a non-destructive identification model based on NIR spectroscopy combined with a random forest (RF) algorithm was developed for different origins of Fuzi. A total of 255 samples of Fuzi were collected from the major cultivation regions of Sichuan, Shaanxi and Yunnan, and the diffuse reflectance spectral information of all samples was obtained using Fourier transform NIR spectroscopy. Single and combined spectral preprocessing methods are used to eliminate multiple interferences in the spectra, and the best preprocessing method is screened and used as an input indicator to build an RF model. The comprehensive performance of the RF model was evaluated using sensitivity, specificity and balanced accuracy. The results showed that Savitzky-Golay 11-point smoothing combined with multivariate scattering correction was the best preprocessing method.Using only the full wavelength data, the prediction accuracy of the RF model for the three groups of provincial samples was also checked over 90%, and the prediction accuracy after preprocessing reached 98.39%. For the city/county level samples, the RF model also had the excellent discriminative ability, greater than 75% accuracy. The RF model achieved 100% recognition rate for samples from cultivation areas around the traditional production areas. The top 100 feature wave numbers were filtered out, and the model was re-optimized, and the recognition accuracy of the model for each city/county level region was over 85%, especially for some samples from the highlands was significantly improved. In this study, an environment-friendly traceability strategy with faster analysis, less sample loss and higher precision was adopted, providing a new model for the rapid and efficient identification of Fuzi of different origins and a reference for the subsequent identification and traceability of Fuzi and its related processed products.
|
Received: 2021-11-14
Accepted: 2022-03-16
|
|
Corresponding Authors:
GUO Li
E-mail: guoli@cdutcm.edu.cn
|
|
[1] PENG Cheng(彭 成). Genuine Chinese Medicinal in China(中华道地药材). Beijing: Chinese Medicine Press(北京:中国中医药出版社), 2011. 1293.
[2] REN Pin-an, LI Xiao-lin, HUANG Jing, et al(任品安,李晓林,黄 晶,等). Chinese Traditional and Herbal Drugs(中草药), 2019, 50(13): 3255.
[3] QIAN Chang-min, SONG Zhao-hui, ZHANG Lan-lan, et al(钱长敏,宋兆辉,张兰兰,等). China Journal of Chinese Materia Medica(中国中药杂志), 2013, 38(17): 2761.
[4] ZHANG Ding-kun, HAN Xue, LI Rui-yu, et al(张定堃,韩 雪,李瑞煜,等). Journal of Chinese Materia Medica(中国中药杂志), 2016, 41(3): 463.
[5] PENG Cheng(彭 成). Safety Evaluation and Application of Toxic Traditional Chinese Medicines Aconiti Laleralis Radix Praeparata, Aconiti Radix, and Aconiti Kusenzoffii Radix(有毒中药附子、川乌、草乌的安全性评价与应用). Chengdu:Sichuan Science and Technology Press(成都: 四川科学技术出版社), 2014. 10.
[6] FAN Lin-hong, HE Lin, TAN Chao-qun, et al(范林宏,何 林,谭超群,等). Chinese Journal of Experimental Traditional Medical Formulae(中国实验方剂杂志), 2022, 28(3): 131.
[7] Dong F, Lin J T, You J H, et al. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2020, 226:117555.
[8] Pan W, Wu M, Zheng Z Z, et al. Journal of Food Science, 2020, 85(7): 2004.
[9] LEI Xiao-qing, LI Geng, WANG Xiu-li, et al(雷晓晴, 李 耿, 王秀丽, 等). Chinese Traditional and Herbal Drugs(中草药), 2018, 49(11): 2653.
[10] Qi L M, Zhong F R, Chen Y, et al. J. Pharm. Anal., 2020, 10(4): 356.
[11] Xue J T, Yang Q W, Jing Y, et al. Pharmacognosy Magazine, 2016, 12(47): 188.
[12] Liu P, Wang J, Li Q, et al. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2019, 206: 23.
[13] Koljonen J, Nordling T E M, Alander J T. Journal of Near Infrared Spectroscopy, 2008, 16(3): 189.
[14] Brusco M J. Computational Statistics & Data Analysis, 2014, 77: 38.
[15] Rahman A, Kondo N, Ogawa Y, et al. Biosystems Engineering, 2016, 141: 12.
[16] Fan W, Shan Y, Li G, et al. Food Analytical Methods, 2012, 5(3): 585.
[17] Zhang B, Li J, Fan S, et al. Computers and Electronics in Agriculture, 2015, 114: 14.
[18] Speiser J L, Miller M E, Tooze J, et al. Expert Systems with Applications, 2019, 134: 93.
[19] Breiman L. Machine Learning, 2001, 45(1): 5.
[20] Ziegler A, Konig I R. Wiley Interdiscip Rev-Data Mining Knowl Discov, 2014, 4(1): 55.
[21] Masetic Z, Subasi A. Comput Methods Programs Biomed, 2016, 130: 54.
[22] Sapir-Pichhadze R, Kaplan B. Transplantation, 2020, 104(5): 905.
[23] Hariharan R. Urban Climate, 2021, 36: 100780.
[24] Bertolini F, Galimberti G, Schiavo G, et al. Animal, 2018, 12(1): 12.
[25] Roguet A, Eren A M, Newton R J, et al. Microbiome, 2018, 6(1): 185.
[26] De Santana F B, Borges Neto W, Poppi R J. Food Chemistry, 2019, 293: 323.
[27] Wu X M, Zhang Q Z, Wang Y Z. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2018, 205: 479.
[28] Qi L, Li J, Liu H, et al. Food & Function, 2018, 9(11): 5903.
[29] Dhanoa M S, Lister S J, Sanderson R, et al. Journal of Near Infrared Spectroscopy, 1994, 2(1): 43.
[30] LIN Jun-zhi, ZHANG Ding-kun, PEI Jin, et al(林俊芝,张定堃,裴 瑾,等). Chinese Journal of Experimental Traditional Medical Formulae(中国实验方剂学杂志), 2017, 23(13): 35.
[31] YU Dai-xin, GUO Sheng, YANG Jian, et al(余代鑫,郭 盛,杨 健,等). China Journal of Chinese Materia Medica(中国中药杂志), 2022, 47(4): 862.
|
[1] |
GAO Feng1, 2, XING Ya-ge3, 4, LUO Hua-ping1, 2, ZHANG Yuan-hua3, 4, GUO Ling3, 4*. Nondestructive Identification of Apricot Varieties Based on Visible/Near Infrared Spectroscopy and Chemometrics Methods[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 44-51. |
[2] |
BAO Hao1, 2,ZHANG Yan1, 2*. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157. |
[3] |
HU Cai-ping1, HE Cheng-yu2, KONG Li-wei3, ZHU You-you3*, WU Bin4, ZHOU Hao-xiang3, SUN Jun2. Identification of Tea Based on Near-Infrared Spectra and Fuzzy Linear Discriminant QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3802-3805. |
[4] |
LIU Xin-peng1, SUN Xiang-hong2, QIN Yu-hua1*, ZHANG Min1, GONG Hui-li3. Research on t-SNE Similarity Measurement Method Based on Wasserstein Divergence[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3806-3812. |
[5] |
HUANG You-ju1, TIAN Yi-chao2, 3*, ZHANG Qiang2, TAO Jin2, ZHANG Ya-li2, YANG Yong-wei2, LIN Jun-liang2. Estimation of Aboveground Biomass of Mangroves in Maowei Sea of Beibu Gulf Based on ZY-1-02D Satellite Hyperspectral Data[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3906-3915. |
[6] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[7] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[8] |
LUO Li, WANG Jing-yi, XU Zhao-jun, NA Bin*. Geographic Origin Discrimination of Wood Using NIR Spectroscopy
Combined With Machine Learning Techniques[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3372-3379. |
[9] |
ZHANG Shu-fang1, LEI Lei2, LEI Shun-xin2, TAN Xue-cai1, LIU Shao-gang1, YAN Jun1*. Traceability of Geographical Origin of Jasmine Based on Near
Infrared Diffuse Reflectance Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3389-3395. |
[10] |
YANG Qun1, 2, LING Qi-han1, WEI Yong1, NING Qiang1, 2, KONG Fa-ming1, ZHOU Yi-fan1, 2, ZHANG Hai-lin1, WANG Jie1, 2*. Non-Destructive Monitoring Model of Functional Nitrogen Content in
Citrus Leaves Based on Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3396-3403. |
[11] |
FANG Zheng, WANG Han-bo. Measurement of Plastic Film Thickness Based on X-Ray Absorption
Spectrometry[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3461-3468. |
[12] |
FU Gen-shen1, LÜ Hai-yan1, YAN Li-peng1, HUANG Qing-feng1, CHENG Hai-feng2, WANG Xin-wen3, QIAN Wen-qi1, GAO Xiang4, TANG Xue-hai1*. A C/N Ratio Estimation Model of Camellia Oleifera Leaves Based on
Canopy Hyperspectral Characteristics[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3404-3411. |
[13] |
HUANG Meng-qiang1, KUANG Wen-jian2, 3*, LIU Xiang1, HE Liang4. Quantitative Analysis of Cotton/Polyester/Wool Blended Fiber Content by Near-Infrared Spectroscopy Based on 1D-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3565-3570. |
[14] |
YAN Xing-guang, LI Jing*, YAN Xiao-xiao, MA Tian-yue, SU Yi-ting, SHAO Jia-hao, ZHANG Rui. A Rapid Method for Stripe Chromatic Aberration Correction in
Landsat Images[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3483-3491. |
[15] |
HUANG Zhao-di1, CHEN Zai-liang2, WANG Chen3, TIAN Peng2, ZHANG Hai-liang2, XIE Chao-yong2*, LIU Xue-mei4*. Comparing Different Multivariate Calibration Methods Analyses for Measurement of Soil Properties Using Visible and Short Wave-Near
Infrared Spectroscopy Combined With Machine Learning Algorithms[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3535-3540. |
|
|
|
|