|
|
|
|
|
|
Combination Weight COD Concentration Prediction Model Based on BiPLS and SiPLS |
CHEN Ying1, DI Yuan-jian1, TANG Xin-liang2*, CUI Xing-ning1, GAO Xin-bei1, CAO Jing-gang1, LI Shao-hua3 |
1. Hebei Province Key Laboratory of Test/Measurement Technology and Instrument, School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, China
2. School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang 050018, China
3. Hebei Sailhero Environmental Protection Hi-tech Co., Ltd., Shijiazhuang 050000, China |
|
|
Abstract The excessively high concentration of organic matter in water poses a great harm, which not only causes serious environmental pollution, but also harms human health. The traditional chemical method for detecting COD(Chemical oxygen denmand, COD) in water usually takes a long time, which is not conducive to rapid quantitative detection of COD in water. In order to solve these problems, a rapid and quantitative detection of COD using a combination of UV spectroscopy and combined weight models is proposed in this paper, the prediction model is based on the backward interval partial least squares (BiPLS) and synergy interval partial least squares (SiPLS) algorithm for screening the characteristic Intervals of UV spectra, and then based on the weights of the characteristic Intervals, a combination weight concentration prediction model is established. In this paper, 45 samples of COD standard solution are experimented; The first derivative and S-G screening of COD UV spect rum date are preprocessed to eliminate baseline drift and environmental noise; The SPXY algorithm is used to divide the experimental data sets into calibration sets and prediction sets. Then, the wavelength of the whole spectral range is screened based on the BiPLS algorithm. In the process of BiPLS screening, the selection of the number of target interval division will have a great influence on the model, so the number of Interval divisions is optimized, subintervals are divided into 15 to 25, and PLS modeling is performed under different interval numbers. The optimal subinterval number is selected by cross-validating root mean square error (RMSECV). When the number of intervals is 18, the effect of the model is the best. 6 characteristic wavelengths are selected from 18 wavelengths. The selected Intervals are 2, 1, 3, 11, 7, 6, and the corresponding wavelengths are 234~240, 262~268, 269~275, 290~296, 297~303, 304~310 nm, respectively. These 6 characteristic wavelength ranges cover a large amount of spectral information and contribute greatly to the final prediction model. Then, these 6 regions are further screened and combined through the SiPLS algorithm, PLS models with different characteristic intervals are constructed using different combinations under the same combination number, the optimal results of an interval combination number are screened out, and the error and correlation of the prediction models under different combinations are compared, the 6 interval are combined into 3 characteristic wavelength intervals, which are 234~240, 262~275 and 290~310 nm respectively. The optimal factor of the optimal PLS model for these three characteristic intervals is 4, 4 and 3, respectively. The characteristic interval combination method of the traditional SiPLS is improved, and the three characteristic intervals are linearly combined based on the weight value instead of the previous direct combination of characteristic intervals. The weights of these three characteristic intervals are calculated by the weight formula as 0.509, 0.318 and 0.173 respectively. Finally, a linear combination weight COD concentration prediction model is established. In order to verify the accuracy of the combined weight prediction model, a PLS prediction model over the full wavelength range, a PLS prediction model with a single characteristic wavelength interval, and a PLS model directly combining characteristic wavelength intervals are established, and the square of the correlation coefficient of the evaluation parameter (R2), the root mean square error of the predicted value and the true concentration value (RMSEC) as well as the Predicted recovery (T) are used to evaluate the model. Compared with other predictive models, the verification results show that the square of the correlation coefficient of the combined weight model reaches 0.999 7, which is obviously higher than the 0.968 0 of the direct combined characteristic interval model, the prediction root mean square error is 0.532, which is more than the prediction of the direct combination characteristic intervals. The model error is reduced by 29.3%, the predicted recovery rate is 96.4%~103.1%, which significantly improves the prediction accuracy. The method is simple and feasible without generating twice pollution, which can provide some technical support for on-line monitoring of COD concentration in water.
|
Received: 2018-06-08
Accepted: 2018-10-27
|
|
Corresponding Authors:
TANG Xin-liang
E-mail: tangxinliang@hebust.edu.cn
|
|
[1] MAI Wei, ZHAO Xiao-ming, ZHANG Jian-fei, et al(买 巍, 赵晓明, 张健飞, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2017, 37(7): 2105.
[2] TANG Bin, ZHAO Jing-xiao, WEI Biao, et al(汤 斌, 赵敬晓, 魏 彪, 等). China Environmental Science(中国环境科学), 2015, 35(2): 478.
[3] Hu Yingtian, Wang X. Sensors & Actuators B Chemical, 2017, 239: 718.
[4] Lepot M, Torres A, Hofer T, et al. Water Research, 2016, 101: 519.
[5] Leardi R, Nørgaard L. Journal of Chemometrics, 2010, 18(11): 486.
[6] Suzuki Y, Kikuta Y, Yamada K, et al. Bunseki Kagaku, 2014, 63(11): 895.
[7] Brito R S, Pinheiro H M, Ferreira F, et al. Urban Water Journal, 2014, 11(4): 261.
[8] ZHAO You-quan, WANG Hui-min, LIU Zi-yu, et al(赵友全, 王慧敏, 刘子毓, 等). Chinese Journal of Scientific Instrument(仪器仪表学报), 2010, 31(9): 1927.
[9] Lepot M, Torres A, Hofer T, et al. Water Research, 2016, 101: 519.
[10] Chen Y, Luo P, Zhao Z Y, et al. Physics Letters A, 2017, 381(40): 3472.
[11] Uusheimo S, Tulonen T, Arvola L, et al. Environmental Monitoring & Assessment, 2017, 189(7): 357.
[12] Agustsson J, Akermann O, Barry D A, et al. Environmental Science Processes & Impacts, 2014, 16(8): 1897.
[13] YUAN Yong-qiang, FU Jia, CHENG Quan-guo(袁永强, 付 佳, 程全国). Mathematics in Practice and Theory(数学的实践与认识), 2015, 45(16): 107.
[14] Hu Y, Wen Y, Wang X. et al. Sensors & Actuators B Chemical, 2016, 227: 393. |
[1] |
ZHANG Xue-fei1, DUAN Ning1, 2*, JIANG Lin-hua1, 2*, CHENG Wen2, YU Zhao-sheng3, LI Wei-dong2, ZHU Guang-bin4, XU Yan-li2. Study on Stability and Sensitivity of Deep Ultraviolet Spectrophotometry Detection System[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(12): 3802-3810. |
[2] |
CHEN Feng-xia1, YANG Tian-wei2, LI Jie-qing1, LIU Hong-gao3, FAN Mao-pan1*, WANG Yuan-zhong4*. Identification of Boletus Species Based on Discriminant Analysis of Partial Least Squares and Random Forest Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(02): 549-554. |
[3] |
LU Li-min, SHI Bin, TANG Tian-yu, ZHAO Xian-hao, WEI Xiao-nan, TANG Yan-lin*. Spectral Analysis of Epinephrine Molecule Based on Density Functional Theory[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(01): 248-252. |
[4] |
LIN Yan1, SU Jun-hong1*, TANG Yan-lin2, YANG Dan3. Ultraviolet Spectrum and Excitation Properties Calculations of Vitamin C Based on Density Functional Theory[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(01): 304-309. |
[5] |
XU Hui-hua, SHI Dong-po*, WU Hao, YIN Xian-qing, ZHENG Yan-cheng, CHEN Wu, LI Geng. Influence of AEO-9 on Ultraviolet Absorbance Spectrum of TDBAC Reduced by β-CD[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(12): 3931-3935. |
[6] |
YANG Lu-ze, LIU Miao*. Construction of a 3D-QSAR Model With Dual Spectral Effects and Its Application in Molecular Modification of Environmentally Friendly PBBs[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(02): 430-434. |
[7] |
TONG Ang-xin, TANG Xiao-jun*, ZHANG Feng, WANG Bin. Species Identification of NaCl, NaOH and β-Phenylethylamine Based on Ultraviolet Spectrophotometry and Supervised Pattern Recognition Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(02): 448-453. |
[8] |
CHEN Ying1,XU Yang-mei1, DI Yuan-jian1,CUI Xing-ning1,ZHANG Jie1,ZHOU Xin-de1,XIAO Chun-yan2, LI Shao-hua3. COD Concentration Prediction Model Based on Multi-Spectral Data Fusion and GANs Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(01): 188-193. |
[9] |
YANG Hui-qin1, 2, ZHANG Bo1, 2, MA Ling1, 2, SHANG Yi1, 2, GAO Dong-li1, 2*. Extraction and Spectroscopic Analysis of Chlorogenic Acid in Diploid Potato[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(12): 3860-3864. |
[10] |
LI Xin1, SU Cheng-zhi1,2*, YU Dan-yang1, SHENG Yu-bo1, CHANG Chuan1, SHI Lei1, JIANG Ji-guang1. Study on the Influence of Wavelength and Low Temperature on COD Detection by Ultraviolet Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(08): 2403-2408. |
[11] |
HE Zhi-heng1, XU Rong2, LIN Jun-feng2, YAN Ning1, CHEN Chun-xia3, CHEN Run-quan3, CHAI Xin-sheng1, 3*. Tri-Wavelength UV Spectroscopy Method by Figuring out the Isobestic Points Shift for the Determination of Fluorescent Whitening Agents in Paper Products[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(06): 1758-1762. |
[12] |
QIU Jia-chu1,4, RUAN Ping2,4*, YONG Jun-guang3, FENG Bo-hua2, 4, HUANG Dai-zheng5, SHEN Hong-tao6. UV-Visible Absorption Spectra and FTIR of Hemoglobin of Healthy People and It Spectroscopic Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(05): 1425-1430. |
[13] |
XIONG Yuan-hui1, 2,LUO Zhong-jie1,CHEN Zhen-wei2, YU Guang-bao1, 2,DUAN Wei-min2, LIU Lin-mei2,LI Fa-quan2,WU Kui-jun2*. Study on Ultraviolet Imaging Remote Sensing Monitoring Technology for SO2 Gas Emission[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(04): 1289-1296. |
[14] |
CHEN Ying1, HE Lei1, CUI Xing-ning1, XIAO Chun-yan2, ZHANG Jie1, ZHANG Can1, YANG Hui1, ZHOU Xin-de1, LI Shao-hua3. Study on Turbidity Compensation of Nitrate Nitrogen in Water Based on Ultraviolet Spectrum[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(02): 472-477. |
[15] |
LI Xin-xing1, ZHOU Jing1, TANG Hong2, SUN Long-qing1, CAO Xia-min3, ZHANG Xiao-shuan4*. Rapid Determination of Total Nitrogen in Aquaculture Water Based on Ultraviolet Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(01): 195-201. |
|
|
|
|