Study on Inversion Model of Soil Heavy Metal Content Based on NMF-PLS Water Content
WU Xi-jun1, ZHANG Jie1, XIAO Chun-yan2, ZHAO Xue-liang1, 3, LI Kang3, PANG Li-li3, SHI Yan-xin3, LI Shao-hua4
1. Hebei Province Key Laboratory of Test/Measurement Technology and Instrument, School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, China
2. School of Resources and Environment, Henan University of Technology, Jiaozuo 454000, China
3. Center for Hydrogeology and Environmental Geology, China Geological Survey, Geological Environment Monitoring Engineering Technology Innovation Center of The Ministry of Natural Resources, Baoding 071051, China
4. Hebei Sailhero Environmental Protection Hi-Tech Co., Ltd., Shijiazhuang 050000, China
Abstract:The excessively high content of heavy metals in the soil is hugely harmful, not only causing serious environmental pollution, but entering the human body through the food chain poses a serious threat to human health, so it is very important for heavy metal detection. X-ray fluorescence spectroscopy has been widely used because of its short detection time, non-destructive testing, and low testing costs. However, the detection of spectral data is severely disturbed by soil moisture factors, which leads to lower accuracy in estimating the heavy metal content in the soil directly. Taking the soil samples of Mancheng District, Baoding City, Hebei Province as the research object, the collected soil samples were cleaned, screened, dried, and then added with a certain amount of heavy metal solution to prepare samples with different water content and heavy metals for detection. The Mahalanobis distance and NJW clustering were calculated for the abnormal data in the experiment, and the influence of soil moisture content on the heavy metal spectrum was analyzed, the results show that the spectral repeatability of different water content is poor, and the spectral intensity decreases nonlinearly with the increase of soil water content. The Savitzky-Golay convolution smoothing denoising method and linear background method are used to preprocess the spectrum to solve the problems of noise and baseline drift caused by the environment and the instrument itself. A non-negative matrix factorization algorithm was used to deal with the peak signal-to-noise ratio evaluation model to determine the number of end elements. The results show that the peak signal-to-noise ratio tends to increase when the number of end elements increases to 10. The stable fluctuation is very small. After the non-negative matrix decomposition treatment, the spectrum repeatability and similarity are good among the same heavy metal content and different water content. The correlation coefficient between the spectra is calculated to prove the similarity between the spectra further. A partial least squares prediction model was established after removing the water content for spectral interference. In order to verify the accuracy of the prediction model, a partial least squares prediction model with no water content removed was established, and the partial water content was removed by orthogonalization with external parameters The least squares prediction model is evaluated using the evaluation parameter determination coefficient (R2), cross-validated root mean square error (RMSECV), average absolute error (MAE), and relative analysis error (RPD). Validation results show that compared to models built without removing water content, non-negative moments are used partial least squares model established by matrix decomposition and removal of water content R2 and RPD increased by 0.019 7 and 1.029 2, RMSECV and MAE decreased by 2.386 3 and 1.439 6; Compared to the partial least squares model established by the external parameter orthogonalization method, the RPD and RPD increased by 0.009 9 and 0.108 1, and the RMSECV and MAE decreased by 0.244 7 and 0.356 6, it is shown that the model established after denoising by non-negative matrix decomposition can effectively improve the accuracy and robustness of prediction. Non-negative matrix factorization can effectively eliminate the effect of soil water content on the spectrum, and the partial least squares model established on this basis has realized the inversion of soil heavy gold content and provided certain technical support for quantitative detection of heavy metals.
Key words:Soil heavy metals; Energy dispersive X-ray fluorescence spectra; Non-negative matrix factorization; Partial least squares
吴希军,张 杰,肖春艳,赵学亮,李 康,庞丽丽,史彦新,李少华. 基于NMF-PLS对含水量影响下土壤重金属含量反演模型研究[J]. 光谱学与光谱分析, 2021, 41(01): 271-277.
WU Xi-jun, ZHANG Jie, XIAO Chun-yan, ZHAO Xue-liang, LI Kang, PANG Li-li, SHI Yan-xin, LI Shao-hua. Study on Inversion Model of Soil Heavy Metal Content Based on NMF-PLS Water Content. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(01): 271-277.
[1] Martinuzzi Sebastián, Januchowski‐Hartley Stephanie R, Pracheil B M, et al. Global Change Biology, 2014, 20(1): 113.
[2] Zheng R, Zhao J L, Zhou X, et al. Pedosphere, 2016, 26(1): 74.
[3] Ji W, Rossel R A V, Shi Z. European Journal of Soil Science, 2015, 66(3): 555.
[4] HU Xiao-yan, CUI Xu, HAN Xiao-ping, et al(胡晓艳,崔 旭,韩小平,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2019, 39(4): 1059.
[5] Minasny B, Mcbratney A B, Bellon-Maurel V, et al. Geoderma, 2011, 167-168: 118.
[6] Maruyama R, Maeda K, Moroda H, et al. Neural Networks, 2014, 55(55c): 11.
[7] Lee D D, Seung H S. Proceedings of the 13th International Conference on Neural Information Processing Systems. MIT Press, 2000.
[8] Lin C J. IEEE Transactions on Neural Networks, 2007, 18(6): 1589.
[9] Sakhaii P, Bermel W. Journal of Magnetic Resonance, 2014, 242(3): 220.
[10] YU Lei, HONG Yong-sheng, ZHOU Yong, et al(于 雷,洪永胜,周 勇,等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报), 2016, 32(13): 95.