Abstract:To meet the requirements of buckwheat quality determination and breeding work, the Competitive Adaptive Re-Weighted Sampling (CARS) algorithm was used in this study to extract the characteristic spectrum and combined with the quantitative partial least squares method to rapidly determine the total flavonoid and protein content in buckwheat leaves. First, the Kennard-Stone (KS) algorithm was used to split the training and test sets. The training set's average, maximum, and minimum total flavonoid contents were 55.8, 92.5 and 28.1 mg·g-1, respectively. The test set's average, maximum, and minimum total flavonoid contents were 71.0, 99.8 and 31.5 mg·g-1, respectively. The training set's average, maximum, and minimum protein contents were 169.6, 331.0 and 121.2 mg·g-1, respectively. The samples' average, maximum, and minimum protein contents in the test set are 158.2, 183.0 and 129.1 mg·g-1, respectively. Then use Normalization, Normalization + Multiplicative Scatter Correction (MSC), Normalization + Standard Normal Variate Transform (SNV), Normalization + First Derivative, Normalization + Second Order Derivative, Normalization + Savitzky-Golay Smoothing Filter (SG) to preprocess the spectrum in the wavelength range from 4 000 to 12 000 cm-1, then use CARS algorithm to extract the characteristic spectrum, and finally use the partial least squares method to build prediction models. Through a comprehensive analysis of the coefficient of determination of the training model (Rc), the coefficient of determination of the test model (Rp), the root mean square error of cross-validation (RMSECV), the root mean square error of the test model (RMSEP) and the residual predictive deviation (RPD), we obtain the best model for the prediction of total flavonoid and protein in buckwheat. Three available prediction models for total flavonoidswere constructed. The best prediction model used 46 characteristic wavenumber points out of 1102 wavenumber points. The preprocessing method used was normalization + first derivative. The model's Rc, Rp, RMSECV, RMSEP, and RPD were 0.997, 0.933, 0.170, 0.829 and 2.893, respectively. Four available protein prediction models were created, the best of which used 42 characteristic wavenumber points, and the preprocessing method used was normalization + second derivative. The model'sRc, Rp, RMSECV, RMSEP, and RPD are 0.998, 0.965, 0.202, 0.353 and 3.849, respectively. The results show that the application of the KS algorithm and CARS algorithm in building the near-infrared spectroscopy model requires fewer samples to build a reliable prediction model, enables the rapid determination of total flavonoids and protein of buckwheat leaves, and provides powerful tools for buckwheat breeding.
Key words:Near infrared spectroscopy; Buckwheat; KS algorithm; CARS algorithm; Total flavonoids; Protein
朱丽伟,杜千禧,唐国红,李洪有,张晓娜,陈庆富,石桃雄. CARS算法提取特征光谱建立荞麦叶片总黄酮与蛋白质近红外模型[J]. 光谱学与光谱分析, 2025, 45(09): 2585-2589.
ZHU Li-wei, DU Qian-xi, TANG Guo-hong, LI Hong-you, ZHANG Xiao-na, CHEN Qing-fu, SHI Tao-xiong. Near-Infrared Modeling for Total Flavonoid and Protein Contents in
Buckwheat Leaves Based on CARS Feature Extraction. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(09): 2585-2589.
[1] Li H Y, Lv Q Y, Liu A K, et al. Food Chemistry, 2022, 371:131125.
[2] REN Chang-zhong, SHAN Fang, WANG Min, et al(任长忠, 陕 方, 王 敏, 等). Journal of the Chinese Cereals and Oils Association(中国粮油学报), 2022, 37(11): 261.
[3] Joshi D C, Zhang K, Wang C, et al. Biotechnology Advances, 2020, 39:107479.
[4] Huan K W, Chen X, Song X Y, et al. Infrared Physics & Technology, 2021, 119:103937.
[5] Yu Y, Chai Y H, Li Z T, et al. Food Chemistry, 2025, 462: 141033.
[6] LIU Xin-peng, QIN Yu-hua, ZHANG Feng-mei, et al(刘鑫鹏, 秦玉华, 张凤梅, 等). Journal of Instrumental Analysis(分析测试学报), 2022, 41(11): 1621.
[7] PAN Xiu-zhen, LIU Lu-lu, REN Xiao-hong, et al(潘秀珍,刘路路,任晓红,等). Journal of Beijing University of Traditional Chinese Medicine(北京中医药大学学报),2023,46(9):1280.
[8] ZHU Li-wei, ZHOU Yan, CAI Fang, et al(朱丽伟, 周 焱, 蔡 芳, 等). Spectroscopy and Spectroscopy Analysis(光谱学与光谱分析), 2020, 40(8): 2421.
[9] Xiao B, Li S Z, Dou S Q, et al. Computers and Electronics in Agriculture, 2024, 217: 108559.
[10] Zhang Z Y, Zeng S C, Ji T K, et al. Computers and Electronics in Agriculture, 2023, 210: 107882.
[11] Deng S L, Wang F, Xiong M, et al. Microchemical Journal, 2024, 207: 111920.
[12] Zhu L W, Du Q X, Shi T X, et al. Agronomy, 2024, 14: 1826.
[13] Yang Y, She X T, Cao X Q, et al. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2022, 277: 121249.
[14] Khamsopha D, Woranitta S, Teerachaichayut S. Food Control, 2021, 123: 107781.
[15] Chai Y H, Yu Y, Zhu H, et al. Current Research in Food Science, 2023, 7: 100573.
[16] Galvao R K H, Araujo M C U, Jose G E, et al. Talanta, 2005, 67(4): 736.
[17] Yu Y, Chai Y H, Li Z T, et al. Food Chemistry, 2025, 462: 141033.
[18] Rinnan Å, van der Berg F W J, Engelsen S B. TrAC Trends in Analytical Chemistry, 2009, 28(10): 1201.
[19] Rabatel G, Marini F, Walczak B, et al. Journal of Chemometrics, 2019, 34(2): e3164.
[20] Hicks J M, Muhammad A, Ferrier J, et al. Journal of AOAC INTERNATIONAL, 2012, 95(5): 1406.