|
|
|
|
|
|
Origin Discrimination and Soluble Protein Content Prediction of Dried Daylily Based on Near Infrared Spectroscopy |
ZHANG Xue-li1, 2, YANG Hao1, 2, LI Chen-fei1, 2, SUN Yi-le1, 2, LIU Zong-lin1, 2, ZHENG De-cong1, 2, SONG Hai-yan1, 2* |
1. College of Agricultural Engineering, Shanxi Agricultural University, Taigu 030801, China
2. Dryland Farm Machinery Key Technology and Equipment Key Laboratory of Shanxi Province, Taigu 030801, China
|
|
|
Abstract Daylily is rich in nutrients and has high edible, medicinal, and economic value. It has many producing areas in China. The origin discrimination and soluble protein content prediction of daylilies are of great significance to the quality management of daylilies, the establishment of an agricultural product brand, and the development of the local economy. Because fresh daylily contains a variety of alkaloids, it is not suitable to eat in large quantities. Therefore, most of the daylilies on the market are dried daylilies. In this paper, the origin discrimination model and soluble protein content prediction model for dried daylily were established based on near-infrared spectroscopy. To address the issues of low discrimination accuracy and inaccurate content prediction in the original algorithm, the model was enhanced, resulting in a significant improvement in accuracy through the combination of various preprocessing methods and characteristic wavelength screening algorithms. In this study, Partial Least Squares Discriminant Analysis (PLS-DA), Random Forest (RF), and Support Vector Machine (SVM) were combined with Multiplicative Scatter Correction (MSC), Standard Normal Variate (SNV) and Savitzky-Golay smoothing (SG) respectively to establish the origin discrimination models of dried daylily and compare the model discrimination results. The experimental results show that PLS-DA combined with MSC has the best effect on origin discrimination, with an accuracy of 93.33%. The precision and recall of the three origins are all above 85%, with an average precision of 91.9% and an average recall of 91.9%. It demonstrates that the model exhibits good accuracy and stability, and can effectively distinguish the origin of dried daylilies. At the same time, Partial Least Squares Regression (PLSR) was combined with a variety of preprocessing methods and three characteristic wavelength screening algorithms: Unobserved Variable Elimination (UVE), Competitive Adaptive Reweighted Sampling (CARS) and Successive Projections Algorithm (SPA), respectively, to establish the prediction models of soluble protein content of dried daylily and compare the prediction results. The results show that the model established by PLSR, combined with SG and CARS, has the best predictive effect. The determination coefficient R2 reached 0.981 5, and the Root Mean Square Error of Prediction (RMSEP) was 0.021 4 g·kg-1. Compared with the original PLSR, the R2 increased by 0.12, and the RMSEP decreased by 0.033 1 g·kg-1. This prediction model can well predict the soluble protein content of dried daylily.
|
Received: 2024-12-11
Accepted: 2025-04-14
|
|
Corresponding Authors:
SONG Hai-yan
E-mail: yybbao@sxau.edu.cn
|
|
[1] ZHAO Xi-he, LI Hao-quan, GONG Li, et al(赵锡和,李浩权, 龚 丽, 等). Packaging and Food Machinery(包装与食品机械), 2009, 27(5): 4.
[2] LI Yu-jin(李玉浸). Journal of Agricultural Resources and Environment(农业环境与发展), 2007, 24(3): 1.
[3] LIU Wei, ZHANG Qun, LI Zhi-jian, et al(刘 伟, 张 群, 李志坚, 等). Food Science(食品科学), 2019, 40(10): 243.
[4] Eisenstecken D, Stürz B, Robatscher P, et al. Postharvest Biology and Technology, 2019, 147: 123.
[5] ZHANG Ai-wu, SONG Xue-jian, WANG Xin-hui, et al(张爱武, 宋雪健, 王欣卉, 等). Journal of the Chinese Cereals and Oils Association(中国粮油学报), 2021, 36(10): 166.
[6] YANG Qun, LING Qi-han, WEI Yong, et al(杨 群, 凌琪涵, 魏 勇, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2023, 43(11): 3396.
[7] Bartwal A, John R, Padhi S R, et al. Journal of Food Composition and Analysis, 2023, 116: 105087.
[8] XIANG Zhi-yong, MIAO Yu-bin(相志勇, 苗玉彬). Journal of Chinese Agricultural Mechanization(中国农机化学报), 2024, 45(11): 139.
[9] Wold S, Sjöström M, Eriksson L. Chemometrics and Intelligent Laboratory Systems, 2001, 58(2): 109.
[10] Breiman L. Machine Learning, 2001, 45(1): 5.
[11] Cortes C, Vapnik V. Machine Learning, 1995, 20(3): 273.
[12] DENG Li-li, PAN Xiao-qian, SHENG Ji-ping, et al(邓丽莉, 潘晓倩, 生吉萍, 等). Food Science(食品科学), 2012, 33(24): 185.
[13] LIANG Ying, MA Kun, ZHANG Xin-yu, et al(梁 莹, 马 琨, 张馨予, 等). Laser & Optoelectronics Progress(激光与光电子学进展), 2024, 61(15): 1530002.
[14] YANG Kai, HE Yu-ting, LI Wo-lin, et al(杨 凯, 何昱廷, 李沃霖, 等). Journal of Huazhong Agricultural University(华中农业大学学报), 2024, 43(2): 264.
[15] Workman Jr J, Weyer L. Practical Guide to Interpretive Near-Infrared Spectroscopy(近红外光谱解析实用指南). Translated by CHU Xiao-li, XU Yu-peng, TIAN Gao-you(褚小立,许育鹏,田高友,译). Beijing: Chemical Industry Press(北京:化学工业出版社), 2009.
[16] Esteki M, Shahsavari Z, Simal-Gandara J. Food Research International, 2019, 122: 303.
|
[1] |
ZHANG Xin-zhi1, SAIMAITI Patiguli1, ZHANG Lu-wen1, YANG Ya-fei2*, BIAN Xi-hui1, 3*. Research Progress on Geographical Origin Discrimination of Traditional Chinese Medicines Based on Near-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(09): 2410-2416. |
[2] |
ZHU Li-wei, DU Qian-xi, TANG Guo-hong, LI Hong-you, ZHANG Xiao-na, CHEN Qing-fu, SHI Tao-xiong*. Near-Infrared Modeling for Total Flavonoid and Protein Contents in
Buckwheat Leaves Based on CARS Feature Extraction[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(09): 2585-2589. |
[3] |
ZOU Liang1, KOU Shao-ping1, REN Ke-long1, YUAN Guang-fu2, XU Zhi-bin3, XU Shi-fan1, WU Jing-tao4*. A Collaborative Detection Method for Coal Ash Content and Calorific
Value Based on A-Unet3+ and Portable NIR Spectrometer[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(08): 2210-2217. |
[4] |
HUANG Tian-yu1, YANG Hui-hua1, 2, LI Ling-qiao1*. InvertResNet: A Qualitative Analysis Method for Drug Products Based on Deep Learning and Near-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(08): 2218-2227. |
[5] |
SHU Qin-da1, ZHANG Jia-hui2*, WANG Qi3, YUE Bao-hua3, LI Qian-qian1*. Construction of a NIR Solid-State Composite Seasoning Freshness AI Model Based on Consumer Sensory Evaluation Ability Assessment[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(08): 2228-2233. |
[6] |
WANG Jia-ying1, ZHU Yu-ting1, BAI Hao1, CHEN Ke-ming1, ZHAO Yan-ru1, 2, 3, WU Ting-ting1, 2, 3, MA Guo-ming4, YU Ke-qiang1, 2, 3*. Detecting the Metal Elements and Soil Organic Matter in Farmland by Dual-Modality Spectral Technologies[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(08): 2317-2325. |
[7] |
ZHANG Gui-yu1, 2, 3, ZHANG Lei1, 2, 3*, TUO Xian-guo1, 3*, WANG Yi-bo1, 3, XIANG Xing-rui1, 3, YAN Jun1, 3. Rapid Near-Infrared Detection of Base Baijiu Using Shapley Additive
Explanation Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(08): 2393-2400. |
[8] |
LAN Xi-hua, WANG Zhi-guo*, LUAN Xiao-li, LIU Fei. Rapid Detection for Xylose Content Using Near-Infrared Spectroscopy Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(07): 1916-1923. |
[9] |
TAN Fang-ping1, LU Tong-suo1, 2*. Study of Near-Infrared Fingerprints of Ganoderma Lucidum in Different Growth Environments[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(07): 1968-1978. |
[10] |
HU Shao-wen, HUANG Lang-xin*, YU Li-hui, WU Zhi-ping, LI Huai-yu, SHI Wei-li, LUO Hong-yu. Intelligent Recognition of Pure Milk Based on Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(07): 1827-1833. |
[11] |
YANG Guang-hui1, 3, ZHANG Yong-li1, 2, 3*, WANG Mei-pan1, 3, LIU Yan-de4, JIANG Xiao-gang4, SUN Jing2, 3, ZHOU Xin-qun2, 3, HAN Tai-lin1*. Determination of Soluble Solids Content in Fresh Corn by Near Infrared
Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(07): 1932-1939. |
[12] |
JU Wei-liang1, YANG Wei1, 2, SONG Ya-mei1, LIU Nan1, LI Hao1, LI Min-zan1, 2*. Prediction of Soil Total Nitrogen Based on NIR Spectroscopy and BiGRU Model With Attention Mechanism[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(07): 2017-2025. |
[13] |
MAO Ya-chun, XIA An-ni*, CAO Wang, LIU Jing, WEN Jie, HE Li-ming, CHEN Xuan-he. Rapid Inversion of Gold Ore Grades Based on Hyperspectral Data and Stacking Ensemble Learning Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(07): 2061-2067. |
[14] |
CHEN Bei, JIANG Si-yuan, ZHENG En-rang. Research on the Wavelength Attention 1DCNN Algorithm for Quantitative Analysis of Near-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(06): 1598-1604. |
[15] |
LI Xiao-xing1, 2, XIAO Jin-feng1*, ZHANG Hong-ming2*, LÜ Bo2, 3*, YIN Xiang-hui1, ZHAO Ming4, MA Fei5, FU Jia2, HU Yan1, 2, LI Zhi-hao1, 2, WANG Fu-di2, SHEN Yong-cai6, DAI Shu-yu7. Study on Improving Stability of Near-Infrared Spectra by Normal
Distribution Screening Method[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(06): 1566-1577. |
|
|
|
|