|
|
|
|
|
|
Classification of Organic Contaminants in Water Distribution Systems Developed by SPA and Multi-Classification SVM Using UV-Vis Spectroscopy |
HUANG Ping-jie, LI Yu-han, YU Qiao-jun, WANG Ke, YIN Hang, HOU Di-bo*, ZHANG Guang-xin |
State Key Laboratory of Industrial Control Technology, College of Control Science and Engineering, Zhejiang University, Hangzhou 310027, China |
|
|
Abstract Quickly and effectively identifying the water contaminants is vital for reducing the impact of sudden drinking water pollution incidents. PCA is mostly used to extract the feature of different contaminants in drinking water with UV-Vis spectra. However, for the organic contaminants with high similarity in UV-Vis spectra, the identification result is ineffective when only extracting the feature of the largest variance direction from the data-driven point of view. This paper studies the classification of organic contaminants in water distribution systems developed by SPA and multi-classification SVM using UV-Vis spectroscopy. Firstly, the original spectral data of phenol, hydroquinone, resorcinol and m-phenylenediamine are measured by UV spectrometer and pretreated. The correlation between wavelength and concentration of four contaminants was compared. The peaks between phenol and resorcinol, hydroquinone and m-phenylenediamine are overlapped seriously, the classification results can interfere easily. In feature extraction, the SPA is introduced to select the organic contaminants’ characteristic wavelengths of UV-Vis spectra. Then, multiple linear regression analysis is carried out to choose the optimal parameter combination, which corresponds to the minimum prediction standard deviation. Based on this, the multi-classification support vector machine is used to form an identification model for drinking water organic contaminants. Finally, the classification results of spectral data based on full spectrum, PCA and SPA under different classification methods and different concentrations are compared and analyzed, and the applicability and stability of SPA are further explained. Experimental results demonstrate that SPA-based feature extraction method eliminates the interference of multi-collinearity and amplifies the difference among the UV-Vis spectra of different organic contaminants, thereby improving the accuracy of the classification model. This method has certain reference value for solving the problem of identifying the types of pollutants with overlapped peaks in the drinking water.
|
Received: 2019-04-18
Accepted: 2019-08-09
|
|
Corresponding Authors:
HOU Di-bo
E-mail: houdb@zju.edu.cn
|
|
[1] Miao X, Tang Y, Wong C W Y, et al. Environmental Pollution, 2015, 196: 473.
[2] Assaad A, Pontvianne S, Marie-Noëlle Pons. Environmental Monitoring and Assessment, 2017, 189(5): 229.
[3] Zhang H, Tian J, Huang J, et al. RSC Adv.,2016, 6: 110356.
[4] Huang P, Wang K, Hou D, et al. Applied Optics, 2017, 56(22): 6317.
[5] Yu Q,Yin H,Wang K, et al. Water, 2018, 10(11): 1566.
[6] ZHAO Ming-fu, TANG Ping, TANG Bin, et al(赵明富, 唐 平, 汤 斌, 等). Journal of Atmospheric and Environmental Optics(大气与环境光学学报), 2018, 13(6): 436.
[7] Hou D, Zhang J, Yang Z, et al. Optics Express, 2015, 23(13): 17487.
[8] GUO Bing-bing, HOU Di-bo, JIN Yu, et al(郭冰冰, 侯迪波, 金 宇,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2017,37(5):1460.
[9] Zhu H, Chu B, Zhang C, et al. Scientific Reports, 2017, 7(1): 4125.
[10] Gomes A A, Schenone A V, Goicoechea H C, et al. Analytical & Bioanalytical Chemistry, 2015, 407(19): 5649.
[11] Terrile, Amélia E, Marcheafave, et al. Journal of the Brazilian Chemical Society, 2016, 27(7): 1254.
[12] Vapnik V. The Nature of Statistical Learning Theory. Springer, 1995.
[13] Huang P, Jin Y, Hou D, et al. Sensors, 2017, 17(3): 581.
[14] Daengduang S, Vateekul P . IEEE 2017 9th International Conference on Knowledge and Smart Technology (KST)-Chonburi, Thailand (2017.2.1—2017.2.4). 2017 9th International Conference on Knowledge and Smart Technology (KST)-Applying One-Versus-One SVMs to Classify Multi-Label Data with Large Labels Using Spark,2017. 72.
[15] CHU Xiao-li(褚小立). Molecular Spectroscopy Analytical Technology Combined with Chemometrics and Its Applications(化学计量学方法与分子光谱分析技术). Beijing: Chemical Industry Press(北京:化学工业出版社), 2011.
[16] Ting K M. Confusion Matrix. Encyclopedia of Machine Learning and Data Mining. Springer US, 2017. |
[1] |
ZHENG Pei-chao, YIN Yi-tong, WANG Jin-mei*, ZHOU Chun-yan, ZHANG Li, ZENG Jin-rui, LÜ Qiang. Study on the Method of Detecting Phosphate Ions in Water Based on
Ultraviolet Absorption Spectrum Combined With SPA-ELM Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 82-87. |
[2] |
JIA Zong-chao1, WANG Zi-jian1, LI Xue-ying1, 2*, QIU Hui-min1, HOU Guang-li1, FAN Ping-ping1*. Marine Sediment Particle Size Classification Based on the Fusion of
Principal Component Analysis and Continuous Projection Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3075-3080. |
[3] |
AN Bai-song1, 2, WANG Xue-mei1, 2*, HUANG Xiao-yu1, 2, KAWUQIATI Bai-shan1, 2. Hyperspectral Estimation of Soil Lead Content Based on Random Frog Band Selection Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3302-3309. |
[4] |
CHEN Chao-yang1, 2, LIU Cui-hong1, 2, LI Zhi-bin3, Andy Hsitien Shen1, 2*. Alexandrite Effect Origin of Gem Grade Diaspore[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(08): 2557-2562. |
[5] |
ZHANG Hai-liang1, XIE Chao-yong1, TIAN Peng1, ZHAN Bai-shao1, CHEN Zai-liang1, LUO Wei1*, LIU Xue-mei2*. Measurement of Soil Organic Matter and Total Nitrogen Based on Visible/Near Infrared Spectroscopy and Data-Driven Machine Learning Method[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2226-2231. |
[6] |
WANG Zi-min1, MAO Xiao-tian1, YIN Zuo-wei1*, CHEN Chang2, CHENG Tian-jia1. Study on the Spectral Characteristics and the Color-Change Effect of Spinel[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(11): 3541-3545. |
[7] |
YANG Dong-feng1, LI Ai-chuan1, LIU Jin-ming1, CHEN Zheng-guang1, SHI Chuang1, HU Jun2*. Optimization of Seed Vigor Near-Infrared Detection by Coupling Mean Impact Value With Successive Projection Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(10): 3135-3142. |
[8] |
ZHONG Xiang-jun1,2, YANG Li1,2*, ZHANG Dong-xing1,2, CUI Tao1,2, HE Xian-tao1,2, DU Zhao-hui1,2. Prediction of Organic Matter Content in Sandy Fluvo-Aquic Soil by
Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(09): 2924-2930. |
[9] |
ZHU Wen-qiong, ZHOU Mu-chun*, ZHAO Qi, LIAO Jun. End-Point Prediction of BOF Steelmaking Based on Flame Spectral Feature Selection Using WCARS-ISPA[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(08): 2332-2336. |
[10] |
ZOU Jin-ping1, ZHANG Shuai2, DONG Wen-tao2, ZHANG Hai-liang2*. Application of Hyperspectral Image to Detect the Content of Total Nitrogen in Fish Meat Volatile Base[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(08): 2586-2590. |
[11] |
REN Qian-qian1*, YUAN Yi-chai2. Study on Spectroscopy of Sphene From Pakistan[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(07): 2263-2268. |
[12] |
CHEN Bei1, ZHENG En-rang1*, MA Jin-fang2, GE Fa-huan3, XIAO Huan-xian4. Prediction Method for Production Year of Antai Pills Based on Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(08): 2592-2597. |
[13] |
MENG Shi-yu1, HUANG Ying-lai1*, ZHAO Peng1, LI Chao1, LIU Zhen-bo2, LIU Yi-xing2, XU Yan3. Wood Quality of Chinese Zither Panels Based on Convolutional Neural Network and Near-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(01): 284-289. |
[14] |
HAN Jian, LI Yu-zhao, CAO Zhi-min*, LIU Qiang, MOU Hai-wei. Water Content Prediction for High Water-Cut Crude Oil Based on SPA-PLS Using Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(11): 3452-3458. |
[15] |
GAO Rui, LI Ze-dong, MA Zheng, KONG Qing-ming, Muhammad Rizwan, SU Zhong-bin*. Research on Crude Protein of Pasture Based on Hyperspectral Imaging[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(10): 3245-3250. |
|
|
|
|