|
|
|
|
|
|
A Classification Method of Coal and Gangue Based on XGBoost and
Visible-Near Infrared Spectroscopy |
LI Rui1, LI Bo1*, WANG Xue-wen1, LIU Tao1, LI Lian-jie1,2, FAN Shu-xiang2 |
1. College of Mechanical and Vehicle Engineering, Taiyuan University of Technology, Taiyuan 030024, China
2. Beijing Research Center of Intelligent Equipment for Agriculture, Beijing 100097, China
|
|
|
Abstract Intelligent recognition of coal and gangue is a new technology that needs to be developed urgently to realize the intelligentization of fully mechanized caving mining. Visible-near infrared spectroscopy technology has many advantages such as environmental friendly and real-time, which meets the requirements of intelligent separation of coal and gangue. The Extreme Gradient Boosting Tree (XGBoost) algorithm which performed well in data science competitions, was introducedto achieve the recognition of coal and gangue based on visible-near infrared spectroscopy. Firstly, a visible-near infrared spectroscopy experimental platform was built to collect the reflectance spectra of lump coal and gangue samples from Shanxi Ximing, Shaanxi Shenmu, and Inner Mongolia Balongtu coal mines in the range of 370~1 049 nm. The collected original spectra were preprocessed through black and white correction, method of removing the start and end bands, Savitzky-Golay (SG) smoothing and standard normal variable transformation (SNV) to reduce the effects of uneven illumination, noise and optical path difference. Secondly, the experimental group and test group were divided according to the difference of reflection spectrum of samples from different mines. The experimental group had a minor difference, which was used to compare the performance of different models and select the best algorithm; the difference of test groups was obvious, which was used to test the performance of the best algorithm in other coal mines and verified the applicability of the algorithm to different coal mines. In the experiment of the experimental group, the coal and gangue classification model was established based on the XGBoost algorithm, and the commonly used machine learning classification algorithms k-nearest neighbor method (KNN), random forest (RF), support vector machine (SVM), which were introduced for comparison. The results showed XGBoost performed best. The average accuracy of 10-fold cross-validation (ACC10), classification accuracy (ACC), and AUC values respectively reached 0.957 2, 0.970 5, and 0.971 6, showing strong stability and classification capabilities. Then in order to reduce the data dimension and calculations, recursive feature elimination (RFE), successive projections algorithm (SPA) and competitive adaptive reweighted sampling (CARS) were used to select the characteristic wavelength and combined with the above four classification algorithms to construct a simplified classification model, respectively. The simplified model of the combination of RFE and XGBoost(RFE-XGB) performed best in the test. The ACC10, ACC, AUC was 0.965 7, 0.980 3, 0.980 3, respectively, and the data dimension reduced to 9. Simplified model improved the stability and classification ability of the model while reducing the data dimension. In the experiment of the test group, the model based on XGBoost and RFE-XGB algorithms can also achieve stable and accurate recognition of coal and gangue in other coal mines, and the simplified model performed better, which was consistent with the results of the experimental group.
|
Received: 2021-10-19
Accepted: 2022-04-04
|
|
Corresponding Authors:
LI Bo
E-mail: libo@tyut.edu.cn
|
|
[1] YU Bin, XU Gang, HUANG Zhi-zeng, et al(于 斌,徐 刚,黄志增,等). Journal of China Coal Society(煤炭学报), 2019, 44(1): 42.
[2] Zhang N, Liu C. Scientific Reports, 2018, 8(1): 190.
[3] LI Bo, WANG Xue-wen, GAO Xin-yu, et al(李 博,王学文,高新宇,等). China Powder Science and Technology(中国粉体技术), 2021, 27(4): 77.
[4] Wang S H, Zhao Y, Hu R, et al. Chinese Journal of Analytical Chemistry, 2019, 47 (4): E19034.
[5] SONG Liang, LIU Shan-jun, MAO Ya-chun, et al(宋 亮,刘善军,毛亚纯,等). Journal of Northeastern University·Natural Science(东北大学学报·自然科学版), 2017, 38(10): 1473.
[6] YANG En, WANG Shi-bo, GE Shi-rong, et al(杨 恩,王世博,葛世荣,等). Industry and Mine Automation(工矿自动化), 2017, 45(3): 46.
[7] Yang E, Ge S, Wang S, et al. Journal of Spectroscopy, 2018, 2018: 2754908.
[8] Mao Y, Le B T, Xiao D, et al. Optics and Laser Technology, 2019, 114: 10.
[9] Xiao D, Li H, Sun X. ACS Omega, 2020, 5(40): 25772.
[10] Hu F, Zhou M, Yan P, et al. IEEE Access, 2019, 7: 169697.
[11] Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2016, 785.
[12] HUANG Qing, XIE He-liang(黄 卿,谢合亮). Mathematics in Practice and Theory(数学的实践与认识), 2018, 48(8): 297.
[13] TAO Meng-qi, LIU Jia-xiang, WU Yue, et al(陶孟琪,刘家祥,吴 越,等). Acta Optica Sinica(光学学报), 2020, 40 (7): 0730002.
[14] JI Hui-jie, NI Feng, LIU Jiang, et al(冀慧杰,倪 枫,刘 姜,等). Computer Technology and Development(计算机技术与发展), 2021, 31(5): 21.
|
[1] |
GAO Hong-sheng1, GUO Zhi-qiang1*, ZENG Yun-liu2, DING Gang2, WANG Xiao-yao2, LI Li3. Early Classification and Detection of Kiwifruit Soft Rot Based on
Hyperspectral Image Band Fusion[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 241-249. |
[2] |
YANG Qun1, 2, LING Qi-han1, WEI Yong1, NING Qiang1, 2, KONG Fa-ming1, ZHOU Yi-fan1, 2, ZHANG Hai-lin1, WANG Jie1, 2*. Non-Destructive Monitoring Model of Functional Nitrogen Content in
Citrus Leaves Based on Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3396-3403. |
[3] |
MU Da1, 2, WANG Qi-shu1, 2*, CUI Zong-yu1, 2, REN Jiao-jiao1, 2, ZHANG Dan-dan1, 2, LI Li-juan1, 2, XIN Yin-jie1, 2, ZHOU Tong-yu3. Study on Interference Phenomenon in Terahertz Time Domain
Spectroscopy Nondestructive Testing of Glass Fiber Composites[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3031-3040. |
[4] |
CAI Jian-rong1, 2, HUANG Chu-jun1, MA Li-xin1, ZHAI Li-xiang1, GUO Zhi-ming1, 3*. Hand-Held Visible/Near Infrared Nondestructive Detection System for Soluble Solid Content in Mandarin by 1D-CNN Model[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2792-2798. |
[5] |
TANG Ruo-han1, 2, LI Xiu-hua1, 2*, LÜ Xue-gang1, 2, ZHANG Mu-qing2, 3, YAO Wei2, 3. Transmittance Vis-NIR Spectroscopy for Detecting Fibre Content of
Living Sugarcane[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(08): 2419-2425. |
[6] |
LUO Zheng-fei1, GONG Zheng-li1, 2, YANG Jian1, 2*, YANG Chong-shan2, 3, DONG Chun-wang3*. Rapid Non-Destructive Detection Method for Black Tea With Exogenous Sucrose Based on Near-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(08): 2649-2656. |
[7] |
ZHANG Yue1, 2, LI Yang1, 2, SONG Yue-peng1, 2*. Nondestructive Detection of Slight Mechanical Damage of Apple by Hyperspectral Spectroscopy Based on Stacking Model[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2272-2277. |
[8] |
ZHANG Mei-zhi1, ZHANG Ning1, 2, QIAO Cong1, XU Huang-rong2, GAO Bo2, MENG Qing-yang2, YU Wei-xing2*. High-Efficient and Accurate Testing of Egg Freshness Based on
IPLS-XGBoost Algorithm and VIS-NIR Spectrum[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(06): 1711-1718. |
[9] |
LIN Jing-tao, XIN Chen-xing, LI Yan*. Spectral Characteristics of “Trapiche-Like Sapphire” From ChangLe, Shandong Province[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(04): 1199-1204. |
[10] |
MAO Xiao-tian1, CHEN Chang2, YIN Zuo-wei1*, WANG Zi-min1. Spectra Characterization of Cr-Grossular (Tsavorite) With “Frogspawn” Color Zoning From Canada[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(02): 520-525. |
[11] |
SHENG Qiang1, 2, ZHENG Jian-ming1*, LIU Jiang-shan2, SHI Wei-chao1, LI Hai-tao2. Advances and Prospects in Inner Surface Defect Detection Based on Cite Space[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(01): 9-15. |
[12] |
BAO Pei-jin1, CHEN Quan-li1, 2*, WU Yan-han1, LI Xuan1, ZHAO An-di1. Spectroscopy Characteristics of Emerald From Swat Valley, Pakistan[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(01): 213-219. |
[13] |
ZHU Jin-yan, ZHU Yu-jie*, FENG Guo-hong*, ZENG Ming-fei, LIU Si-qi. Optimization of Near-Infrared Detection Model of Blueberry Sugar Content Based on Deep Belief Network and Hybrid Wavelength Selection Method[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(12): 3775-3782. |
[14] |
WANG Wei, LI Yong-yu*, PENG Yan-kun, YANG Yan-ming, YAN Shuai, MA Shao-jin. Design and Experiment of a Handheld Multi-Channel Discrete Spectrum Detection Device for Potato Processing Quality[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(12): 3889-3895. |
[15] |
LI Ming1*, HAN Dong-hai2*, LU Ding-qiang1, LU Xiao-xiang1, CHAI Chun-xiang1, LIU Wen3, SUN Ke-xuan1. Research Progress of Universal Model of Near-Infrared Spectroscopy in Agricultural Products and Foods Detection[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(11): 3355-3360. |
|
|
|
|