|
|
|
|
|
|
A Classification Method of Coal and Gangue Based on XGBoost and
Visible-Near Infrared Spectroscopy |
LI Rui1, LI Bo1*, WANG Xue-wen1, LIU Tao1, LI Lian-jie1,2, FAN Shu-xiang2 |
1. College of Mechanical and Vehicle Engineering, Taiyuan University of Technology, Taiyuan 030024, China
2. Beijing Research Center of Intelligent Equipment for Agriculture, Beijing 100097, China
|
|
|
Abstract Intelligent recognition of coal and gangue is a new technology that needs to be developed urgently to realize the intelligentization of fully mechanized caving mining. Visible-near infrared spectroscopy technology has many advantages such as environmental friendly and real-time, which meets the requirements of intelligent separation of coal and gangue. The Extreme Gradient Boosting Tree (XGBoost) algorithm which performed well in data science competitions, was introducedto achieve the recognition of coal and gangue based on visible-near infrared spectroscopy. Firstly, a visible-near infrared spectroscopy experimental platform was built to collect the reflectance spectra of lump coal and gangue samples from Shanxi Ximing, Shaanxi Shenmu, and Inner Mongolia Balongtu coal mines in the range of 370~1 049 nm. The collected original spectra were preprocessed through black and white correction, method of removing the start and end bands, Savitzky-Golay (SG) smoothing and standard normal variable transformation (SNV) to reduce the effects of uneven illumination, noise and optical path difference. Secondly, the experimental group and test group were divided according to the difference of reflection spectrum of samples from different mines. The experimental group had a minor difference, which was used to compare the performance of different models and select the best algorithm; the difference of test groups was obvious, which was used to test the performance of the best algorithm in other coal mines and verified the applicability of the algorithm to different coal mines. In the experiment of the experimental group, the coal and gangue classification model was established based on the XGBoost algorithm, and the commonly used machine learning classification algorithms k-nearest neighbor method (KNN), random forest (RF), support vector machine (SVM), which were introduced for comparison. The results showed XGBoost performed best. The average accuracy of 10-fold cross-validation (ACC10), classification accuracy (ACC), and AUC values respectively reached 0.957 2, 0.970 5, and 0.971 6, showing strong stability and classification capabilities. Then in order to reduce the data dimension and calculations, recursive feature elimination (RFE), successive projections algorithm (SPA) and competitive adaptive reweighted sampling (CARS) were used to select the characteristic wavelength and combined with the above four classification algorithms to construct a simplified classification model, respectively. The simplified model of the combination of RFE and XGBoost(RFE-XGB) performed best in the test. The ACC10, ACC, AUC was 0.965 7, 0.980 3, 0.980 3, respectively, and the data dimension reduced to 9. Simplified model improved the stability and classification ability of the model while reducing the data dimension. In the experiment of the test group, the model based on XGBoost and RFE-XGB algorithms can also achieve stable and accurate recognition of coal and gangue in other coal mines, and the simplified model performed better, which was consistent with the results of the experimental group.
|
Received: 2021-10-19
Accepted: 2022-04-04
|
|
Corresponding Authors:
LI Bo
E-mail: libo@tyut.edu.cn
|
|
[1] YU Bin, XU Gang, HUANG Zhi-zeng, et al(于 斌,徐 刚,黄志增,等). Journal of China Coal Society(煤炭学报), 2019, 44(1): 42.
[2] Zhang N, Liu C. Scientific Reports, 2018, 8(1): 190.
[3] LI Bo, WANG Xue-wen, GAO Xin-yu, et al(李 博,王学文,高新宇,等). China Powder Science and Technology(中国粉体技术), 2021, 27(4): 77.
[4] Wang S H, Zhao Y, Hu R, et al. Chinese Journal of Analytical Chemistry, 2019, 47 (4): E19034.
[5] SONG Liang, LIU Shan-jun, MAO Ya-chun, et al(宋 亮,刘善军,毛亚纯,等). Journal of Northeastern University·Natural Science(东北大学学报·自然科学版), 2017, 38(10): 1473.
[6] YANG En, WANG Shi-bo, GE Shi-rong, et al(杨 恩,王世博,葛世荣,等). Industry and Mine Automation(工矿自动化), 2017, 45(3): 46.
[7] Yang E, Ge S, Wang S, et al. Journal of Spectroscopy, 2018, 2018: 2754908.
[8] Mao Y, Le B T, Xiao D, et al. Optics and Laser Technology, 2019, 114: 10.
[9] Xiao D, Li H, Sun X. ACS Omega, 2020, 5(40): 25772.
[10] Hu F, Zhou M, Yan P, et al. IEEE Access, 2019, 7: 169697.
[11] Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2016, 785.
[12] HUANG Qing, XIE He-liang(黄 卿,谢合亮). Mathematics in Practice and Theory(数学的实践与认识), 2018, 48(8): 297.
[13] TAO Meng-qi, LIU Jia-xiang, WU Yue, et al(陶孟琪,刘家祥,吴 越,等). Acta Optica Sinica(光学学报), 2020, 40 (7): 0730002.
[14] JI Hui-jie, NI Feng, LIU Jiang, et al(冀慧杰,倪 枫,刘 姜,等). Computer Technology and Development(计算机技术与发展), 2021, 31(5): 21.
|
[1] |
ZHOU Tong-tong1, SUN Xiao-lin1, SUN Zhi-zhong2, PENG He-huan1, SUN Tong1, HU Dong1*. Current Status and Future Perspective of Spectroscopy and Imaging
Technique Applications in Bruise Detection of Fruits and Vegetables:
A Review[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(09): 2657-2665. |
[2] |
WU Ye-lan1, GUAN Hui-ning1, LIAN Xiao-qin1, YU Chong-chong1, LIAO Yu2, GAO Chao1. Study on Detection Method of Leaves With Various Citrus Pests and
Diseases by Hyperspectral Imaging[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2397-2402. |
[3] |
ZHONG Xiang-jun1, 2, YANG Li1, 2*, ZHANG Dong-xing1, 2, CUI Tao1, 2, HE Xian-tao1, 2, DU Zhao-hui1, 2. Effect of Different Particle Sizes on the Prediction of Soil Organic Matter Content by Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2542-2550. |
[4] |
LIAN Xiao-qin1, 2, CHEN Qun1, 2, TANG Shen-miao1, 2, WU Jing-zhu1, 2, WU Ye-lan1, 2, GAO Chao1, 2. Quantitative Analysis Method of Key Nutrients in Lanzhou Lily Based on NIR and SOM-RBF[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2025-2032. |
[5] |
YANG Qiao-ling1, 2, CHEN Qin2, NIU Bing2, DENG Xiao-jun3*, MA Jin-ge3, GU Shu-qing3, YU Yong-ai4, GUO De-hua3, ZHANG Feng5. Visualization of Thiourea in Bulk Milk Powder Based on Portable Raman Hyperspectral Imaging Technology On-Site Rapid Detection Method
Research[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2156-2162. |
[6] |
BAI Xue-bing, MA Dian-kun, ZHANG Meng-jie, MA Rui-qin*. Hyperspectral Non-Destructive Analysis of Red Meat Quality: A Review[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 1993-1998. |
[7] |
LIU Yan-de, WANG Shun. Research on Non-Destructive Testing of Navel Orange Shelf Life Imaging Based on Hyperspectral Image and Spectrum Fusion[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1792-1797. |
[8] |
WANG Ming-xuan, WANG Qiao-yun*, PIAN Fei-fei, SHAN Peng, LI Zhi-gang, MA Zhen-he. Quantitative Analysis of Diabetic Blood Raman Spectroscopy Based on XGBoost[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1721-1727. |
[9] |
LI Xue-ying1, 2, LI Zong-min3*, CHEN Guang-yuan4, QIU Hui-min2, HOU Guang-li2, FAN Ping-ping2*. Prediction of Tidal Flat Sediment Moisture Content Based on Wavelet Transform[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(04): 1156-1161. |
[10] |
ZHANG Xiao-hong1, JIANG Xue-song1*, SHEN Fei2*, JIANG Hong-zhe1, ZHOU Hong-ping1, HE Xue-ming2, JIANG Dian-cheng1, ZHANG Yi3. Design of Portable Flour Quality Safety Detector Based on Diffuse
Transmission Near-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(04): 1235-1242. |
[11] |
XIAO Shi-jie1, WANG Qiao-hua1, 2*, LI Chun-fang3, 4, DU Chao3, ZHOU Zeng-po4, LIANG Sheng-chao4, ZHANG Shu-jun3*. Nondestructive Testing and Grading of Milk Quality Based on Fourier Transform Mid-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(04): 1243-1249. |
[12] |
LI Lian-jie1, 2, FAN Shu-xiang2, WANG Xue-wen1, LI Rui1, WEN Xiao1, WANG Lu-yao1, LI Bo1*. Classification Method of Coal and Gangue Based on Hyperspectral
Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(04): 1250-1256. |
[13] |
WANG Xiao-bin1, 2, 3, ZHANG Xi1, GUAN Chen-zhi1, HONG Hua-xiu1, HUANG Shuang-gen2*, ZHAO Chun-jiang3. Quantitative Detection of Ascorbic Acid Additive in Flour Based on Raman Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(12): 3765-3770. |
[14] |
XIAO Shi-jie1, WANG Qiao-hua1, 2*, FAN Yi-kai3, LIU Rui3, RUAN Jian3, WEN Wan4, LI Ji-qi4, SHAO Huai-feng4, LIU Wei-hua5, ZHANG Shu-jun3*. Rapid Determination of αs1-Casein and κ-Casein in Milk Based on Fourier Transform Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(12): 3688-3694. |
[15] |
YU Guo-wei1, MA Ben-xue1,2*, CHEN Jin-cheng1,3, DANG Fu-min4,5, LI Xiao-zhan1, LI Cong1, WANG Gang1. Vis-NIR Spectra Discriminant of Pesticide Residues on the Hami Melon Surface by GADF and Multi-Scale CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(12): 3701-3707. |
|
|
|
|