VIS-NIR Hyperspectral Prediction of Soil Organic Matter Based on
Stacking Generalization Model
ZHANG Xiu-quan1, LI Zhi-wei1*, ZHENG De-cong1*, SONG Hai-yan1, WANG Guo-liang2
1. College of Agricultural Engineering, Shanxi Agricultural University, Taigu 030801, China
2. Millet Research Institute, Shanxi Agricultural University, Changzhi 046000, China
Abstract:Accurate prediction of soil organic matter content is helpful in evaluating farmland fertility and provide a data for precision agriculture. In order to solve the problems of low accuracy and weak Generalization ability of a single model for rapid estimation of organic matter content in farmland surface soil. The surface soil of typical cinnamon farmland in Shanxi Province was studied,a Stacked Generalization Model (SGM) was proposed based on VIS-NIR hyperspectral data for predicting organic matter content. Firstly, the original hyperspectral data are smoothed by wavelet, and the reciprocal derivative and logarithmic reciprocal derivative transform are performed on the smoothed data. The feature bands are extracted by correlation coefficient and recursive feature elimination method. At the same time, Ensemble learning Random Forest (RF), Gradient Boosting Decision Tree (GBDT) and eXtreme Gradient Boosting are introduced in machine learning (XGBoost), and Adaboost are used to predict organic matter content through 5-fold cross-validation. Based on the prediction results of the primary learner, Stochastic gradient Descent (SGD) is used as a meta-learner to establish the SGM stack generalization model. The limitation of low accuracy and instability of a single model is broken through to realize the rapid and stable detection of organic matter content. The results show a good correlation between the spectral information and organic matter content after the penultimate differential transformation, and the maximum correlation is -0.611. Compared with the single model, the decision coefficient (R2) and relative analysis error (RPD) of the stacked generalization prediction model are 0.819 and 2.256, respectively, which are 0.055 and 0.323 higher than the average decision coefficient (R2) and relative analysis error (RPD) of other algorithms, respectively. The mean absolute error (MAE) and root mean square error (RMSE) are 1.742 and 2.308 g·kg-1, respectively, which are 0.406 and 0.389 g·kg-1lower than those of other algorithms. The optimization effect is obvious. It can be used to estimate organic matter content in farmland soil surfaces effectively. The results can provide a basis and reference for the rapid detection of organic matter content in farmland soil surface by hyperspectral method.
张秀全,李志伟,郑德聪,宋海燕,王国梁. 基于近红外-可见光高光谱的堆叠泛化模型褐土有机质预测[J]. 光谱学与光谱分析, 2023, 43(03): 903-910.
ZHANG Xiu-quan, LI Zhi-wei, ZHENG De-cong, SONG Hai-yan, WANG Guo-liang. VIS-NIR Hyperspectral Prediction of Soil Organic Matter Based on
Stacking Generalization Model. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(03): 903-910.
[1] TANG Hai-tao,MENG Xiang-tian,SU Xun-xin,et al(唐海涛,孟祥添,苏循新,等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报),2021,37(2):105.
[2] ZHAO Ming-song,XIE Yi,LU Long-mei,et al(赵明松,谢 毅,陆龙妹,等). Acta Pedologica Sinica(土壤学报),2021,58(1):42.
[3] ZHANG Zhi-tao,LAO Cong-cong,WANG Hai-feng,et al(张智韬,劳聪聪,王海峰,等). Transactions of the Chinese Society for Agricultural Machinery(农业机械学报),2020,51(1):156.
[4] Sorenson P T,Quideau S A,Rivard B. Geoderma,2018,315:170.
[5] WANG Hai-feng,ZHANG Zhi-tao,Arnon Karnieli,et al(王海峰,张智韬,Arnon Karnieli,等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报),2018,34(14):124.
[6] Conforti M,Buttafuoco G,Leone A P,et al. Catena,2013,110:44.
[7] Gomez C,Rossel R A V,Mcbratney A B. Geoderma,2008,146(3-4):403.
[8] HONG Yong-sheng,ZHU Ya-xing,SU Xue-ping,et al(洪永胜,朱亚星,苏学平,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析),2017,37(11):3537.
[9] Tan K,Ma W,Wu F,et al. Environmental Monitoring and Assessment,2019,191(7):446.
[10] Guo P T,Wu W,Sheng Q K,et al. Nutrient Cycling in Agroecosystems,2013,95(3):333.
[11] ZHONG Liang,GUO Xi,GUO Jia-xin,et al(钟 亮,郭 熙,国佳欣,等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报),2021,37(1):203.
[12] LÜ Pin,YU Wen-bing,WANG Xin,et al(吕 品,于文兵,汪 鑫,等). Acta Electronica Sinica(电子学报),2019,47(10):2228.
[13] ZHANG Juan-juan,XI Lei,YANG Xiang-yang,et al(张娟娟,席 磊,杨向阳,等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报),2020,36(17):135.
[14] LIU Tian-lin,ZHU Xi-cun,BAI Xue-yuan,et al(刘恬琳,朱西存,白雪源,等). Smart Agriculture(智慧农业),2020,2(3):129.
[15] ZHOU Zhi-hua(周志华). Maching Learning(机器学习). Beijing: Tsinghua University Press(北京:清华大学出版社),2016.
[16] Friedman J H, Meulman J J. Statistics in Medicine, 2003,22(9):1365.
[17] Wang Bin,Fan Shi dong,Jiang Pan,et al. Ocean Engineering,2020,217:108001.