|
|
|
|
|
|
Study on Sugar Content Detection of Kiwifruit Using Near-Infrared
Spectroscopy Combined With Stacking Ensemble Learning |
GUO Zhi-qiang1, ZHANG Bo-tao1, ZENG Yun-liu2* |
1. College of Information Engineering, Wuhan University of Technology, Wuhan 430070, China
2. National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, National R&D Center for Citrus Preservation, Wuhan 430070, China
|
|
|
Abstract In this study, we employ near-infrared spectroscopy with Stacking ensemble learning to perform non-destructive sugar content analysis in kiwifruit. Our research focuses on the “Yunhai No.1” kiwifruit variety from Hubei. Using an infrared analyzer, we gathered spectral data from 280 samples, spanning 1 557 wavelengths in the 4 000~10 000 cm-1 range, and measured sugar content with a refractometer. Outliers were identified and excluded using a singular sample identification algorithm that combines Monte Carlo random sampling with a T-test. The SPXY algorithm was then employed to split the data into training and testing sets in a 4∶1 ratio. Data preprocessing involved multiple scattering corrections (MSC), Savitzky-Golay smoothing (SG), de-trending (DT), vector normalization (VN), and standard normal variable (SNV) transformations. Feature wavelengths were initially selected using uninformative variable elimination (UVE), competitive adaptive reweighted sampling (CARS), and interval variable iterative space shrinkage approach (iVISSA), followed by a secondary selection with the successive projections algorithm (SPA) to remove collinear variables. To address the limitations of single models in generalization, we designed an integrated learning model using the Stacking algorithm. This model incorporated Bayesian ridge regression (BRR), partial least squares regression (PLSR), support vector regression (SVR), and artificial neural networks (ANN) as base learners, with linear regression (LR) serving as the meta-learner. We assessed the performance of various ensemble model combinations and analyzed the influence of base learners on ensemble performance using the Pearson correlation coefficient. Experimental results indicated that vector normalization was the most effective among the five preprocessing methods. The VN-CARS-PLSR model demonstrated superior performance, with R2P of 0.805 and RMSEP of 0.498, identifying 177 feature wavelengths and reducing data volume by 88.6% compared to the original spectrum. Comparisons of different base learner combinations in the Stacking algorithm revealed that the PLS+SVR+ANN integrated model achieved the highest predictive accuracy, with R2P of 0.853 and RMSEP of 0.433. The study concludes that the stacking ensemble model offers more comprehensive modeling capabilities and superior generalization than single models, providing valuable technical support for non-destructive sugar quality detection in kiwifruit.
|
Received: 2023-06-01
Accepted: 2024-03-15
|
|
Corresponding Authors:
ZENG Yun-liu
E-mail: zengyl@mail.hzau.edu.cn
|
|
[1] GUO Lin-lin, PANG Rong-li, WANG Rui-ping, et al(郭琳琳,庞荣丽,王瑞萍,等). Journal of Fruit Science(果树学报), 2022, 39(10): 1864.
[2] LU Yu-dan, LIU Xiao-chi, FENG Xin, et al(路喻丹,刘晓驰,冯 新,等). Southeast Horticulture(东南园艺), 2022, 10(2): 137.
[3] ZHAO Zhi-lei, WANG Xue-mei, LIU Dong-dong, et al(赵志磊,王雪妹,刘冬冬,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2022, 42(9): 2836.
[4] WANG Shu-xian, XIAO Hang, YANG Zhen-fa, et al(王淑贤,肖 航,杨振发,等). Laser & Optoelectronics Progress(激光与光电子学进展), 2022, 57(23): 392.
[5] Tan B, You W, Huang C, et al. Electronics, 2022, 11(21): 3504.
[6] Zhang K, Jiang H, Zhang H, et al. Agriculture, 2022, 12(4): 489.
[7] Chen H, Lin B, Cai K, et al. Infrared Physics & Technology, 2021, 112: 103582.
[8] SU Fu, LUO Hai-bo(苏 赋,罗海波). Computer Engineering & Science(计算机工程与科学), 2022, 44(12): 2153.
[9] DING Lan, LUO Pin-liang(丁 岚,骆品亮). Review of Investment Studies(投资研究), 2017, 36(4): 41.
[10] LI Shuai, CHANG Jin-cai, LI Lü-mu-zhi, et al(李 帅,常锦才,李吕牧之,等). Computer Engineering & Science(计算机工程与科学), 2022, 44(8): 1402.
[11] SUN Zhao, LI Yun, JIANG Yu-wu, et al(孙 昭,李 云,江毓武,等). Marine Forecasts(海洋预报), 2023, 40(1): 39.
[12] SONG Hui-juan, CHEN Yao-deng, OUYANG Lin,et al(宋慧娟,陈耀登,欧阳霖,等). Journal of the Meteorological Sciences(气象科学), 2022, 42(5): 569.
[13] Tan Z, Zhang J, He Y, et al. IEEE Access, 2020, 8: 227719.
[14] SHI Jia-qi, ZHANG Jian-hua(史佳琪,张建华). Proceedings of the CSEE(中国电机工程学报), 2019, 39(14): 4032.
[15] Deng B C, Yun Y H, Ma P, et al. The Analyst, 2015, 140(6): 1876.
|
[1] |
MAO Li-yu1, 2, BIN Bin1*, ZHANG Hong-ming2*, LÜ Bo2, 3*, GONG Xue-yu1, YIN Xiang-hui1, SHEN Yong-cai4, FU Jia2, WANG Fu-di2, HU Kui5, SUN Bo2, FAN Yu2, ZENG Chao2, JI Hua-jian2, 3, LIN Zi-chao2, 3. Development of Wheat Component Detector Based on Near Infrared
Spectrum[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(10): 2768-2777. |
[2] |
JIANG Xiao-gang1, 2, HE Cong1, 2, JIANG Nan3, LI Li-sha1, ZHU Ming-wang1, LIU Yan-de1, 2*. Discrimination of Apple Origin and Prediction of SSC Based on
Multi-Model Decision Fusion[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(10): 2812-2818. |
[3] |
MU Liang-yin1, ZHAO Zhong-gai1*, JIN Sai2, SUN Fu-xin2, LIU Fei1. Near-Infrared Prediction Models for Quality Parameters of Culture Broth in Seed Tank During Citric Acid Fermentation[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(10): 2819-2826. |
[4] |
ZHU Yu-kang1, LU Chang-hua1, ZHANG Yu-jun2, JIANG Wei-wei1*. Quantitative Method to Near-Infrared Spectroscopy With Multi-Feature Fusion Convolutional Neural Network Based on Wavelength Attention[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(09): 2607-2612. |
[5] |
MAO Ya-chun1, WEN Jie1*, CAO Wang1, DING Rui-bo1, WANG Shi-jia2, FU Yan-hua3, XU Meng-yuan1. Fusion Algorithm Research Based on Imaging Spectrum of Anshan Iron Ore[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(09): 2620-2625. |
[6] |
WENG Ding-kang1, FAN Zheng-xin1, KONG Ling-fei1, SUN Tong1*, YU Wei-wu2. Rapid Identification of Shelled Bad Torreya Grandis Seeds Based on
Visible-Near Infrared Spectroscopy and Chemometrics[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(09): 2675-2682. |
[7] |
WU Bin1, XIE Chen-ao2, CHEN Yong2, WU Xiao-hong2, JIA Hong-wen1. Discrimination of Chuzhou Chrysanthemum Tea Grades Using Noise
Discriminant C-Means Clustering[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(08): 2202-2207. |
[8] |
WANG Shu-tao1, WAN Jin-cong1*, LIU Shi-yu2, ZHANG Jin-qing1, WANG Yu-tian1. Qualitative Modeling Method of Mango Species in Near Infrared Based on Attention Mechanism Residual Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(08): 2262-2267. |
[9] |
HU Cai-ping1*, FU Zhao-min2*, XU Hong-jia2, WU Bin3, SUN Jun4. Discrimination of Lettuce Storage Time Based on Near-Infrared Spectroscopy Combined With Fuzzy Uncorrelated QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(08): 2268-2272. |
[10] |
XIAO Nan1, LI Han-lin1, WENG Ding-kang1, HU Dong1, SUN Tong1*, XIONG Yong-sen2. Rapid Identification of Apple Moldy Core Disease by Near Infrared
Spectroscopy With Information Fusion of Different Illumination
Patterns[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(08): 2388-2394. |
[11] |
XIAO Huai-chun1, LIU Yang1, WEI Bing-xue1, GAO Jia-rong1, LIU Yan-de2, XIAO Hui1. Identification of Visible and Short Wave Near Infrared Spectra of
Super-Enriched Plants in Uranium Ore Area[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1813-1819. |
[12] |
HUANG Hua1, LIU Ya2, MA Yi-hang1, XIANG Si-han1, HE Jia-ning1, WANG Shi-ting1, GUO Jun-xian3*. Prediction of Soluble Solid Contents in Apples Using Vis-NIRS and
Functional Linear Regression Model[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1905-1912. |
[13] |
CUI Hao-fan1, LIU Hong-zhi1, GUO Qin1*, GU Feng-ying1, ZHANG Yu2, WANG Qiang1*. Establishment of High-Throughput Model of Peanut Protein Components and Subunits by Near-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1982-1987. |
[14] |
YANG Sen1, WANG Zhen-min1*, SONG Wen-long1, XING Jian1, DAI Jing-min2. Optimization of Polished Rice Varieties Discrimination Based on
Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1988-1992. |
[15] |
NIU Xiao-ying1, 2, 3, MU Xiao-qing1, 2, 3, SUN Jie1, 2, 3, ZHAO Zhi-lei1, 2, 3*, ZHANG Chun-jiang4. Qualitative and Quantitative Analyses of Cooked Donkey Meat
Adulteration Based on NIR Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1993-2001. |
|
|
|
|