Using Different Data Mining Algorithms to Predict Soil Organic Matter Based on Visible-Near Infrared Spectroscopy
JI Wen-jun1, LI Xi1, LI Cheng-xue2, ZHOU Yin1, SHI Zhou1, 3*
1. Institute of Remote Sensing and Information Technology, Zhejiang University, Hangzhou 310058, China 2. College of Resources and Environment, Yunnan Agricultural University, Kunming 650201, China 3. Cyrus Tang Center for Sensor Materials and Applications, Zhejiang University, Hangzhou 310058, China
Abstract:Using visible/near infrared spectroscopy to model soil properties is very important in current soil sensing research. It can be applied to rapidly access soil information and precision management. In the present study, paddy soil in Zhejiang Province is treated as the research samples. The nonlinear models such as random forests (RF), supported vector machines (SVM) and artificial neural networks (ANN) were used respectively to build models to predict soil organic matter based on different selection of calibration and validation datasets. The results show that there is a certain impact on prediction results under the division of different sample modes. Compared to the commonly used linear model PLSR, the nonlinear model RF and SVM have comparable prediction accuracy, especially predictions by SVM using all Vis-NIR wavelengths produced the smallest RMSE values. It shows that the model constructed by SVM method has a good predictive ability. In addition, a combined method, PLSR-ANN (with the introduction of ANN into PLSR), significantly improves the predictive ability of PLSR. Even though ANNs are “black box” systems the combination of PLSR and nonliner modelling helps achieve good predictions and interpretability.
纪文君1, 李 曦1, 李成学2, 周 银1, 史 舟1, 3* . 基于全谱数据挖掘技术的土壤有机质高光谱预测建模研究[J]. 光谱学与光谱分析, 2012, 32(09): 2393-2398.
JI Wen-jun1, LI Xi1, LI Cheng-xue2, ZHOU Yin1, SHI Zhou1, 3* . Using Different Data Mining Algorithms to Predict Soil Organic Matter Based on Visible-Near Infrared Spectroscopy . SPECTROSCOPY AND SPECTRAL ANALYSIS, 2012, 32(09): 2393-2398.
[1] Viscarra Rossel R A, Cattle S R, Ortega A, et al. Geoderma, 2009, 150: 253. [2] JI Wen-jun, SHI Zhou, ZHOU Qing, et al(纪文君, 史 舟, 周 清, 等). Journal of Infrared and Millimeter Waves(红外与毫米波学报), 2012, 31(3): 277. [3] Stenberg B, Viscarra Rossel R A, Mouazen A M, et al. Advances in Agronomy, 2010, 107: 163. [4] LIU Lei, SHEN Run-ping, DING Guo-xiang(刘 磊, 沈润平, 丁国香). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2011, 31(3): 762. [5] SHEN Run-ping, DING Guo-xiang, WEI Guo-shuan, et al(沈润平, 丁国香, 魏国栓, 等). Acta Pedologica Sinica(土壤学报), 2009, 46(3): 391. [6] ZHENG Li-hua, LI Min-zan, SUN Hong, et al(郑立华,李民赞,孙 红, 等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报), 2010, 26: 81. [7] Joel B S, David J B, Melisa L B, et al. Geoderma, 2008, 148: 149. [8] Volkan Bilgili A, van Es H M, Akbas F, et al. Journal of Arid Environments, 2010,(74): 229. [9] Viscarra Rossel R A, Behrens T. Geoderma, 2010,158: 46. [10] ZHANG Juan-juan, TIAN Yong-chao, ZHU Yan, et al(张娟娟, 田永超, 朱 艳, 等). Scientia Agricultura Sinica(中国农业科学), 2009, 42(9): 3154. [11] Tormod N, tomas I, Tom F, et al. Auser-Friedly Quide to Multivariate Calibration and Classification. Chicester, UK: NIR Publications, 2002. 27.