Abstract:Lily bulbs, the underground metamorphic stems composed of thick scales grown by perennial herbaceous bulbous plants of the lily family Liliaceae, a typical medicinal and edible homologous crop. It is rich in nutrients and has anti-tumor, antidepressant, hypoglycemia, and improves immune functions. The market prices of lily bulbs from different origins are quite different. The traditional evaluation methods that rely on artificial experience and sensory are highly subjective and have poor certainty, making it difficult to be widely used in modern production links. Advanced detection methods based on chemical inspection methods are time-consuming and expensive and it are difficult to meet the requirements for origin identification. Raman spectroscopy is a vibration spectrum based on inelastic scattering, which can achieve fast and accurate non-destructive testing. Combining Raman spectroscopy with machine learning algorithms, a classification model of the three most widely distributed lily bulbs in China (Lanzhou lily, Yixing lily and Longya lily) was established. Observing the characteristic peaks of 479, 870, 942 and 1 606 cm-1 on the matrix spectrum, a non-destructive testing method based on the component content of Raman spectroscopy to determine the place of origin and evaluate the quality of lily bulbs is proposed. First, the traditional method is used to collect the spectrum of the lily bulb sample. After the spectral data is preprocessed, the artificial prior method is used to extract the representative substance of the lily bulb and determine the characteristic peaks. Then the principal component analysis and the t-distribution random neighborhood embedding method are used to reduce the dimensionality. Extract spectral data features. The data features obtained above are applied to support vector machines, decision trees and random forest algorithms. The experimental results show that these classification models all show ideal classification accuracy on the same test set. Among them, the model’s accuracy based on principal component analysis and decision tree algorithm reached 91.7%。The model’s accuracy based on t-distribution random neighborhood embedding and support vector machine is 93.7%, and the accuracy rate of the model combining the principal component analysis and random forest algorithm is as high as 95.8%. In summary, this method can provide on-site rapid identification and identification of the origin of lily bulbs, improve the accuracy of the quality assessment in the modern production process, and provide a reference for the identification of the origin of modern production and the quality analysis of lily bulbs.
王志新,王慧荟,张文波,王 忠,李月娥. 基于拉曼光谱和机器学习的百合分类识别[J]. 光谱学与光谱分析, 2023, 43(01): 183-189.
WANG Zhi-xin, WANG Hui-hui, ZHANG Wen-bo, WANG Zhong, LI Yue-e. Classification and Recognition of Lilies Based on Raman Spectroscopy and Machine Learning. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(01): 183-189.
[1] Li H, Wang R, Liu J, et al. International Journal of Biological Macromolecules, 2020, 148: 956.
[2] Lazare S, Bechar D, Fernie A R, et al. The Plant Journal, 2019, 97: 321.
[3] Kurokawa K, Kobayashi J, Nemoto K, et al. Frontiers in Plant Science, 2020, 11: 570915.
[4] Chi X, Wang S, Baloch Z, et al. Biomedicine & Pharmacotherapy, 2019, 112: 108616.
[5] Zhang H, Chi X, Pan W, et al. Genes, Brain and Behavior, 2020, 19(5): e12649.
[6] Li W, Wang Y, Zhang Y, et al. Acta Physiologiae Plantarum, 2020, 42(8): 127.
[7] Huang D, Li W, Dawuda M M, et al. Postharvest Biology and Technology, 2021, 176(11): 111520.
[8] Restaino O F, Rosa M D, Schiraldi C. Electrophoresis, 2020, 41(20): 1740.
[9] Bian Y, Zheng R, Bayer F P, et al. Nature Communications, 2020, 11(1): 157.
[10] Cecchin F D, Cardoso H K, Primieri N K, et al. Food Analytical Methods, 2019, 12(1): 108.
[11] Zhang S, Wang H, Zhu M, et al. Talanta, 2019, 196: 249.
[12] YUAN Zhi-ying, LI Dong-yang, LI Ya-lin, et al(袁志鹰, 李东洋, 李亚林, 等). Chinese Journal of Information on Traditional Chinese Medicine(中国中医药信息杂志), 2020, 27(1): 57.
[13] CHEN Qi, PAN Tian-hong, LI Yu-qiang, et al(陈 琦, 潘天红, 李鱼强, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2021, 41(9): 2776.
[14] Bergholt M S, Serio A, Albro M B. Frontiers in Bioengineering and Biotechnology, 2019, 7: 303.
[15] Ho C S, Jean N, Hogan C A, et al. Nature Communications, 2019, 10: 4927.
[16] Yaseen T, Sun D W, Pu H, et al. Food Analytical Methods, 2018, 11(9): 2518.
[17] Cui L, Li R, Mu T, et al. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2021, 264: 120283.