|
|
|
|
|
|
A Comparative Study of the COD Hyperspectral Inversion Models in
Water Based on the Maching Learning |
WANG Chun-ling1, 2, SHI Kai-yuan1, 2, MING Xing3*, CONG Mao-qin3, LIU Xin-yue3, GUO Wen-ji3 |
1. School of Information Science and Technology,Beijing Forestry University, Beijing 100083,China
2. Engineering Research Center for Forestry-oriented Intelligent Information Processing of National Forestry and Grassland Administration,Beijing 100083,China
3. Nanjing Institute of Software Technology, Institute of Software Chinese Academy of Sciences, Nanjing 210049, China
|
|
|
Abstract Chemical oxygen demand (COD) is an important indicator of organic pollution in water. How to quickly and accurately test the COD content of water is particularly important. The application of machine learning in the field of water quality inversion is increasing, and more research results have been obtained. Hyperspectral remote sensing has the advantages of high spectral-spatial resolution and multiple imaging channels, so it has great potential in retrieving water’s COD. This study uses different hyperspectral pre-processing methods to process the original hyperspectral data. It uses the hyperspectral data before and after processing to compare the inversion performance of different machine learning models and different hyperspectral pre-processing methods on the COD content of water. Firstly, 1 548 groups of COD content and corresponding hyperspectral data (400~1 000 nm) samples were collected by ZK-UVIR-I in-situ spectral water quality on-line monitor in Baodai River. In order to reduce the interference of spectral noise and eliminate the influence of spectral scattering, Savitzky-Golay (SG) smoothing, Multiplicative scatter correction (MSC) and SG smoothing combined with MSC methods were used to pre-process the original spectra. Secondly, the sample set is randomly divided into training set and test set, where the training set accounts for 80% and the test set accounts for 20%. A COD hyperspectral inversion model based on the four machine learning methods of linear regression, random forest (random forest), AdaBoost, and XGBoost was established for the pre-processed training set full-band spectrum. Moreover, three indexes of determination coefficient (R2), root mean square error (RMSE) and relative analysis error (RPD) were selected to evaluate the accuracy of the hyperspectral inversion model. The results show that random forest, AdaBoost and XGboost are all the better than linear regression. The prediction ability of the inversion model established by XGboost is the best whether the spectral data is processed or not, with R2 of 0.92, RMSE of 7.1 mg·L-1, and RPD of 3.4. Considering that the original spectrum may be redundant, the dimensionality reduction of the spectrum after SG smoothing and MSC processing is performed by principal component analysis (PCA), and the top ten principal components with a cumulative contribution rate of 95% are selected as the input variables of the model. XGBoost established the inversion model, and the results show that after PCA, the accuracy of the inversion model is improved, the RPD is 3.8, and the training time of the model is shortened from 72 seconds to 2.9 seconds. The above research can provide new methods and ideas for establishing hyperspectral inversion models of this water area and similar water areas.
|
Received: 2021-06-15
Accepted: 2021-11-02
|
|
Corresponding Authors:
MING Xing
E-mail: mingnix@163.com
|
|
[1] Chander S, Gujrati A, Hakeem K A, et al. Current Science, 2019, 116(7):1172.
[2] Gidudu A, Mugo R, Letaru L, et al. African Journal of Aquatic Science, 2018, 43(2):141.
[3] Usali N, Ismail M H. Journal of Sustainable Development, 2010, 3(3):228.
[4] LI Xin-xing, GUO Wei, BAI Xue-bing, et al(鑫 星, 郭 渭, 白雪冰, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2021, 41(5):1343.
[5] LIU Li-xin, HE Di, LI Meng-zhu, et al(刘立新, 何 迪, 李梦珠, 等). Chinese Journal of Lasers(中国激光), 2020, 47(11):291.
[6] Hu Z T, Zhou Y. Geospatial Information, 2020, 18(7):4.
[7] Liu J X, Zhai W L, Li J F, et al. Contribu-Tions to Geology and Mineral Resources Research, 2020, 35(04):487.
[8] PAN De-lu, MA Rong-hua(潘德炉, 马荣华). Lake Science(湖泊科学), 2008,(2):139.
[9] Cao Y, Ye Y T, Zhao H L. China Environmental Science, 2017, 37(10):3940.
[10] Blix K, Eltoft T. Remote Sensing, 2018, 10(05):775.
[11] Hafeez S, Wong M S, Ho H C,et al. Remote Sensing, 2019, 11(6):617.
[12] Lu H, Ma X. Chemosphere, 2020, 249:126169.
[13] LI Yuan-bo, CAO Han(李远博, 曹 菡). Computer Technology and Development(计算机技术与发展), 2016, 26(2):26.
[14] Biau G, Scornet E. Test, 2016, 25(2):197.
[15] Schapire R E. Explaining Ababoost Empirical Inference, Springer, Berlin, Heidelberg, 2013:37. |
[1] |
LIU Ye-kun, HAO Xiao-jian*, YANG Yan-wei, HAO Wen-yuan, SUN Peng, PAN Bao-wu. Quantitative Analysis of Soil Heavy Metal Elements Based on Cavity
Confinement LIBS Combined With Machine Learning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2387-2391. |
[2] |
LI Bin, YIN Hai, ZHANG Feng, CUI Hui-zhen, OUYANG Ai-guo*. Research on Protein Powder Adulteration Detection Based on
Hyperspectral Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2380-2386. |
[3] |
WU Ye-lan1, GUAN Hui-ning1, LIAN Xiao-qin1, YU Chong-chong1, LIAO Yu2, GAO Chao1. Study on Detection Method of Leaves With Various Citrus Pests and
Diseases by Hyperspectral Imaging[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2397-2402. |
[4] |
WANG Yan1, 2, 3, WANG Bao-rui1, 2, 3*, WANG Yue1, 2, 3. Study on Radical Characteristics of Methane Laminar Premixed Flame Based on Hyperspectral Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2403-2410. |
[5] |
LI Hong-qiang1, SUN Hong2, LI Min-zan2*. Study on Identification of Common Diseases in Potato Storage Period Based on Spectral Structure[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2471-2476. |
[6] |
GUO Jing-jing1, YU Hai-ye1, LIU Shuang2, XIAO Fei1, ZHAO Xiao-man1, YANG Ya-ping1, TIAN Shao-nan1, ZHANG Lei1*. Study on the Hyperspectral Discrimination Method of Lettuce Leaf
Greenness[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2557-2564. |
[7] |
FENG Tian-shi1, 2, 3, PANG Zhi-guo1, 2, 3*, JIANG Wei1, 2, 3. Remote Sensing Retrieval of Chlorophyll-a Concentration in Lake Chaohu Based on Zhuhai-1 Hyperspectral Satellite[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2642-2648. |
[8] |
ZHOU Ming-rui1, 2, QU Jiang-bei2, LI Peng1, 2*, HE Yi-liang1, 2. The “Cluster-Regression” COD Prediction Model of Distributed Rural Sewage Based on Three-Dimensional Fluorescence Spectrum and
Ultraviolet-Visible Absorption Spectrum[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2113-2119. |
[9] |
YANG Qiao-ling1, 2, CHEN Qin2, NIU Bing2, DENG Xiao-jun3*, MA Jin-ge3, GU Shu-qing3, YU Yong-ai4, GUO De-hua3, ZHANG Feng5. Visualization of Thiourea in Bulk Milk Powder Based on Portable Raman Hyperspectral Imaging Technology On-Site Rapid Detection Method
Research[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2156-2162. |
[10] |
WANG Cai-ling1, WANG Bo2, JI Tong3, XU Jun4, JU Feng5, WANG Hong-wei6*. Simulated Estimation of Nitrite Content in Water Based on
Transmission Spectrum[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2181-2186. |
[11] |
YANG Jie-kai1, GUO Zhi-qiang1, HUANG Yuan2, 3*, GAO Hong-sheng1, JIN Ke1, WU Xiang-shuai2, YANG Jie1. Early Classification and Detection of Melon Graft Healing State Based on Hyperspectral Imaging[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2218-2224. |
[12] |
ZHANG Fu-jie, SHI Lei, LI Li-xia*, ZHAO Hao-ran, ZHU Yin-long. Study on Nondestructive Identification of Panax Notoginseng Powder Quality Grade Based on Hyperspectral Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2255-2261. |
[13] |
HUANG Qing1, XUE He-ru1*, LIU Jiang-ping1*, LIU Mei-chen1, HU Peng-wei1, SUN De-gang2. Spectral Selection Method Based on Ant Colony-Genetic Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2262-2268. |
[14] |
WANG Zhi-hao, YIN Yong*, YU Hui-chun, YUAN Yun-xia, XUE Shu-ning. Early Warning Method of Apple Spoilage Based on 2D Hyperspectral
Information Representation With Pixel Mean and Variance[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2290-2296. |
[15] |
WANG Jian-wei1, 2, LI Wei-yan1, SUN Jian-ying1, LI Bing1, CHEN Xin-wen1, TAN Zheng1, ZHAO Na1, LIU Yang-yang1, 3, LÜ Qun-bo1, 3*. Fast Spectral Calibration Method of Spectral Imager[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2013-2017. |
|
|
|
|