|
|
|
|
|
|
A Comparative Study of the COD Hyperspectral Inversion Models in
Water Based on the Maching Learning |
WANG Chun-ling1, 2, SHI Kai-yuan1, 2, MING Xing3*, CONG Mao-qin3, LIU Xin-yue3, GUO Wen-ji3 |
1. School of Information Science and Technology,Beijing Forestry University, Beijing 100083,China
2. Engineering Research Center for Forestry-oriented Intelligent Information Processing of National Forestry and Grassland Administration,Beijing 100083,China
3. Nanjing Institute of Software Technology, Institute of Software Chinese Academy of Sciences, Nanjing 210049, China
|
|
|
Abstract Chemical oxygen demand (COD) is an important indicator of organic pollution in water. How to quickly and accurately test the COD content of water is particularly important. The application of machine learning in the field of water quality inversion is increasing, and more research results have been obtained. Hyperspectral remote sensing has the advantages of high spectral-spatial resolution and multiple imaging channels, so it has great potential in retrieving water’s COD. This study uses different hyperspectral pre-processing methods to process the original hyperspectral data. It uses the hyperspectral data before and after processing to compare the inversion performance of different machine learning models and different hyperspectral pre-processing methods on the COD content of water. Firstly, 1 548 groups of COD content and corresponding hyperspectral data (400~1 000 nm) samples were collected by ZK-UVIR-I in-situ spectral water quality on-line monitor in Baodai River. In order to reduce the interference of spectral noise and eliminate the influence of spectral scattering, Savitzky-Golay (SG) smoothing, Multiplicative scatter correction (MSC) and SG smoothing combined with MSC methods were used to pre-process the original spectra. Secondly, the sample set is randomly divided into training set and test set, where the training set accounts for 80% and the test set accounts for 20%. A COD hyperspectral inversion model based on the four machine learning methods of linear regression, random forest (random forest), AdaBoost, and XGBoost was established for the pre-processed training set full-band spectrum. Moreover, three indexes of determination coefficient (R2), root mean square error (RMSE) and relative analysis error (RPD) were selected to evaluate the accuracy of the hyperspectral inversion model. The results show that random forest, AdaBoost and XGboost are all the better than linear regression. The prediction ability of the inversion model established by XGboost is the best whether the spectral data is processed or not, with R2 of 0.92, RMSE of 7.1 mg·L-1, and RPD of 3.4. Considering that the original spectrum may be redundant, the dimensionality reduction of the spectrum after SG smoothing and MSC processing is performed by principal component analysis (PCA), and the top ten principal components with a cumulative contribution rate of 95% are selected as the input variables of the model. XGBoost established the inversion model, and the results show that after PCA, the accuracy of the inversion model is improved, the RPD is 3.8, and the training time of the model is shortened from 72 seconds to 2.9 seconds. The above research can provide new methods and ideas for establishing hyperspectral inversion models of this water area and similar water areas.
|
Received: 2021-06-15
Accepted: 2021-11-02
|
|
Corresponding Authors:
MING Xing
E-mail: mingnix@163.com
|
|
[1] Chander S, Gujrati A, Hakeem K A, et al. Current Science, 2019, 116(7):1172.
[2] Gidudu A, Mugo R, Letaru L, et al. African Journal of Aquatic Science, 2018, 43(2):141.
[3] Usali N, Ismail M H. Journal of Sustainable Development, 2010, 3(3):228.
[4] LI Xin-xing, GUO Wei, BAI Xue-bing, et al(鑫 星, 郭 渭, 白雪冰, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2021, 41(5):1343.
[5] LIU Li-xin, HE Di, LI Meng-zhu, et al(刘立新, 何 迪, 李梦珠, 等). Chinese Journal of Lasers(中国激光), 2020, 47(11):291.
[6] Hu Z T, Zhou Y. Geospatial Information, 2020, 18(7):4.
[7] Liu J X, Zhai W L, Li J F, et al. Contribu-Tions to Geology and Mineral Resources Research, 2020, 35(04):487.
[8] PAN De-lu, MA Rong-hua(潘德炉, 马荣华). Lake Science(湖泊科学), 2008,(2):139.
[9] Cao Y, Ye Y T, Zhao H L. China Environmental Science, 2017, 37(10):3940.
[10] Blix K, Eltoft T. Remote Sensing, 2018, 10(05):775.
[11] Hafeez S, Wong M S, Ho H C,et al. Remote Sensing, 2019, 11(6):617.
[12] Lu H, Ma X. Chemosphere, 2020, 249:126169.
[13] LI Yuan-bo, CAO Han(李远博, 曹 菡). Computer Technology and Development(计算机技术与发展), 2016, 26(2):26.
[14] Biau G, Scornet E. Test, 2016, 25(2):197.
[15] Schapire R E. Explaining Ababoost Empirical Inference, Springer, Berlin, Heidelberg, 2013:37. |
[1] |
WANG Cai-ling1,ZHANG Jing1,WANG Hong-wei2*, SONG Xiao-nan1, JI Tong3. A Hyperspectral Image Classification Model Based on Band Clustering and Multi-Scale Structure Feature Fusion[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 258-265. |
[2] |
GAO Hong-sheng1, GUO Zhi-qiang1*, ZENG Yun-liu2, DING Gang2, WANG Xiao-yao2, LI Li3. Early Classification and Detection of Kiwifruit Soft Rot Based on
Hyperspectral Image Band Fusion[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 241-249. |
[3] |
WU Hu-lin1, DENG Xian-ming1*, ZHANG Tian-cai1, LI Zhong-sheng1, CEN Yi2, WANG Jia-hui1, XIONG Jie1, CHEN Zhi-hua1, LIN Mu-chun1. A Revised Target Detection Algorithm Based on Feature Separation Model of Target and Background for Hyperspectral Imagery[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 283-291. |
[4] |
CHU Bing-quan1, 2, LI Cheng-feng1, DING Li3, GUO Zheng-yan1, WANG Shi-yu1, SUN Wei-jie1, JIN Wei-yi1, HE Yong2*. Nondestructive and Rapid Determination of Carbohydrate and Protein in T. obliquus Based on Hyperspectral Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3732-3741. |
[5] |
HUANG You-ju1, TIAN Yi-chao2, 3*, ZHANG Qiang2, TAO Jin2, ZHANG Ya-li2, YANG Yong-wei2, LIN Jun-liang2. Estimation of Aboveground Biomass of Mangroves in Maowei Sea of Beibu Gulf Based on ZY-1-02D Satellite Hyperspectral Data[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3906-3915. |
[6] |
ZHOU Bei-bei1, LI Heng-kai1*, LONG Bei-ping2. Variation Analysis of Spectral Characteristics of Reclaimed Vegetation in an Ionic Rare Earth Mining Area[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3946-3954. |
[7] |
LUO Li, WANG Jing-yi, XU Zhao-jun, NA Bin*. Geographic Origin Discrimination of Wood Using NIR Spectroscopy
Combined With Machine Learning Techniques[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3372-3379. |
[8] |
YUAN Wei-dong1, 2, JU Hao2, JIANG Hong-zhe1, 2, LI Xing-peng2, ZHOU Hong-ping1, 2*, SUN Meng-meng1, 2. Classification of Different Maturity Stages of Camellia Oleifera Fruit
Using Hyperspectral Imaging Technique[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3419-3426. |
[9] |
FANG Zheng, WANG Han-bo. Measurement of Plastic Film Thickness Based on X-Ray Absorption
Spectrometry[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3461-3468. |
[10] |
FU Gen-shen1, LÜ Hai-yan1, YAN Li-peng1, HUANG Qing-feng1, CHENG Hai-feng2, WANG Xin-wen3, QIAN Wen-qi1, GAO Xiang4, TANG Xue-hai1*. A C/N Ratio Estimation Model of Camellia Oleifera Leaves Based on
Canopy Hyperspectral Characteristics[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3404-3411. |
[11] |
SHEN Ying, WU Pan, HUANG Feng*, GUO Cui-xia. Identification of Species and Concentration Measurement of Microalgae Based on Hyperspectral Imaging[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3629-3636. |
[12] |
XIE Peng, WANG Zheng-hai*, XIAO Bei, CAO Hai-ling, HUANG Yi, SU Wen-lin. Hyperspectral Quantitative Inversion of Soil Selenium Content Based on sCARS-PSO-SVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3599-3606. |
[13] |
QIAN Rui1, XU Wei-heng2, 3 , 4*, HUANG Shao-dong2, WANG Lei-guang2, 3, 4, LU Ning2, OU Guang-long1. Tea Plantations Extraction Based on GF-5 Hyperspectral Remote Sensing
Imagery in the Mountainous Area[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3591-3598. |
[14] |
ZHU Zhi-cheng1, WU Yong-feng2*, MA Jun-cheng2, JI Lin2, LIU Bin-hui3*, JIN Hai-liang1*. Response of Winter Wheat Canopy Spectra to Chlorophyll Changes Under Water Stress Based on Unmanned Aerial Vehicle Remote Sensing[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3524-3534. |
[15] |
KANG Ming-yue1, 3, WANG Cheng1, SUN Hong-yan3, LI Zuo-lin2, LUO Bin1*. Research on Internal Quality Detection Method of Cherry Tomatoes Based on Improved WOA-LSSVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3541-3550. |
|
|
|
|