1. 安徽理工大学电气与信息工程学院,安徽 淮南 232000
2. 阜阳师范学院计算机与信息工程学院,安徽 阜阳 236000
3. School of Electronic and Electrical Engineering, University of Leeds, Woodhouse Lane, Leeds L859JT, UK
Application of Unsupervised Learning AE and MVO-DBSCAN Combined with LIF in Mine Water Inrush Recognition
LAI Wen-hao1, ZHOU Meng-ran1*, LI Da-tong1, WANG Ya2, HU Feng1, ZHAO Shun3, GU Yu-lin1
1. School of Electrical and Information Engineering, Anhui University of Science and Technology, Huainan 232000, China
2. School of Computer and Information Engineering, Fuyang Normal University, Fuyang 236000, China
3. School of Electronic and Electrical Engineering, University of Leeds, Woodhouse Lane, Leeds L859JT, UK
Abstract:Quick and accurate identification of water inrush types and sources of water inrush is of great significance for safe mining of coal mines. Laser-induced fluorescence (LIF) technology is rapid and sensitive in detection, which applies LIF to the detection of water inrush in coal mines and uses pattern recognition algorithm to quickly identify the source of water inrush. The current algorithms for identifying water samples are too dependent on pre-established water sample spectral databases When the water source is not in the library, it is easy to cause misidentification. The unsupervised learning algorithm DBSCAN does not require the label and category information of the sample set when clustering, which can reduce the misidentification of unknown categories. Therefore, the DBSCAN algorithm is used to identify the laser-induced fluorescence spectra in water inrush, and MVO is used for the parameter optimization of DBSCAN, which can eliminate the cumbersome manual parameter optimization process. In the experiment, four water samples were taken from the water intake point of Xieqiao Coal Mine, and 30 sets of spectral data were collected for each water sample. The fluorescence spectra of the water samples were collected using a USB2000+ spectrometer with a pixel of 2 048. First, the unsupervised learning algorithm automatic encoder (AE) reduces the dimension of the original spectral data to reduce the influence of redundant information in the spectral data on the clustering. The structure of the AE designed in this paper is a multi-layer network model between shallow and deep layers, which can reduce the original spectral data to 2 dimensions. In order to make the dimensionality reduction model sparse, the author adds a Dropout layer to the traditional AE algorithm. It can be seen from the experiment that the dimensionality reduction model after adding the Dropout layer has a faster convergence speed. Then, using the multivariate optimization (MVO) algorithm to optimize the DBSCAN parameters. In the parameter optimization process, the spectral recognition rate of the water sample after DBSCAN is up to 97.5%, and the corresponding range of the parameter Eps is [0.023 66 0.040 65]. The normalized unscaled spectral data is used for DBSCAN cluster identification to verify the effectiveness of AE on the dimensionality reduction of water sample spectral data. The recognition rate of the original water sample spectrum by DBSCAN is up to 95%, which is 2.5% lower than that of the post-dimensional water sample. The results show that using AE dimensionality reduction data can improve the recognition rate of DBSCAN for different spectra. Finally, the supervised learning algorithm K nearest neighbor (KNN) is used to identify the water sample spectrum after dimension reduction, and the recognition result and the unsupervised learning algorithm DBSCAN are compared. The training set uses three water samples, and the test set uses four water samples. For the test set data, the supervised learning algorithm can only accurately identify the water sample categories contained in the training set, but all the categories that are not in the training set are identified incorrectly. On the contrary, DBSCAN can accurately identify the water sample spectrum not in the training set. The nonlinear dimensionality reduction algorithm AE can achieve dimensionality reduction on high-dimensional water spectral data. The use of MVO-DBSCAN for LIF spectral identification of coal mine water inrush can effectively reduce the misidentification caused by the incompleteness of the mine water source spectrum database.
Key words:Mine water inrush; Laser induced fluorescence; Spectral recognition; Density-based special clustering of applications with noise(DBSCAN); Multi-verse optimizer; Auto encoder; Dropout
来文豪,周孟然,李大同,王 亚,胡 锋,赵 舜,顾煜林. 无监督学习AE和MVO-DBSCAN结合LIF在煤矿突水识别中的应用[J]. 光谱学与光谱分析, 2019, 39(08): 2437-2442.
LAI Wen-hao, ZHOU Meng-ran, LI Da-tong, WANG Ya, HU Feng, ZHAO Shun, GU Yu-lin. Application of Unsupervised Learning AE and MVO-DBSCAN Combined with LIF in Mine Water Inrush Recognition. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(08): 2437-2442.
[1] WANG Wen-tan, ZHANG Meng-xue, ZHAO Shu-fang(王文坦, 张梦雪, 赵述芳). Journal of Chemical Industry and Engineering(化工学报), 2013, 64(3): 771.
[2] ZHANG Hui-shu, YUAN Xi-gang, Ali K M(张会书, 袁希钢, Ali K M). Journal of Chemical Industry and Engineering(化工学报), 2014(9):3331.
[3] LI Jia-ming, CHU Ying-bo, ZHAO Nan(李嘉铭, 褚应波, 赵 楠). Analytical Chemistry(分析化学), 2016, 44(7): 1042.
[4] WANG Ya, ZHOU Meng-ran, YAN Peng-cheng(王 亚, 周孟然, 闫鹏程). Journal of China Coal Society(煤炭学报), 2017, 42(9): 2427.
[5] ZHOU Meng-ran, LAI Wen-hao, WANG Ya(周孟然, 来文豪, 王 亚). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2018, 38(7): 2262.
[6] Ester M, Kriegel H P, Xu X. International Conference on Knowledge Discovery and Data Mining. The Association for the Advancement of Artificial Intelligence Press, 1996. 226.
[7] Mirjalili S, Mirjalili S M, Hatamlou A. Neural Computing and Applications, 2015, 27(2): 495.
[8] Faris H, Aljarah I, Mirjalili S. Applied Intelligence, 2016, 45(2): 1.
[9] Rotter F, Scholz J, Grimsel J. Applied Physics B, 2010, 101(4): 909.
[10] WU Yu-hong(伍育红). Computer Science(计算机科学), 2015, 42(z6): 491.
[11] QU Jian-ling, DU Chen-fei, DI Ya-zhou(曲建岭, 杜辰飞, 邸亚洲). Computer and Modernization, 2014(8): 128.
[12] Goodfellow I J, Pouget-Abadie J, Mirza M. Advances in Neural Information Processing Systems, 2014, 3: 2672.
[13] Hinton G E, Srivastava N, Krizhevsky A. Computer Science, 2012, 3(4): 212.