Abstract:Rock spectrum is the comprehensive embodiment of rock's physical and chemical properties, composition and structure. Now, it has been widely used in rock classification research. Due to the difficulty of collecting the data on the rock spectrum, it often needs to be collected manually, which not only causes great labor cost but also leads to the limited data on the rock spectrum collected. When the rock spectral classification model is trained with a limited number of samples, the dimensional disaster phenomenon will generally occur. That is, the accuracy of classification will decrease with the rise of the feature dimension, and the rock spectral data coincides with this feature, with a high dimensional number of features. Therefore, to achieve good classification results, a large number of training samples are needed to be used in the training of traditional rock spectral classification models, usually more times than the feature dimension. If the number of samples is small, we must reduce the features to obtain the ideal classification accuracy. Therefore, when the number of samples is small, obtaining a more accurate classification effect on rock spectral data has become a hot research topic. This paper collects the spectral data of typical rocks in Xingcheng, Liaoning Province. Based on the Python programming language, the Siamese Network classification model is constructed with few training samples, and the Triplet Loss is used as the loss function to realize the 3-way-1-shot classification model, and the prediction accuracy of 97.8% is achieved in the verification set. At the same time, four traditional machine learning methods, which include Decision Tree, Random Forest, Support Vector Machine and K-Nearest Neighbor, were used to establish the classification model under the same training samples and compared with them. By drawing the learning curve, it is verified that these four traditional machine learning methods do not have good classification functions in the case of small samples. Since converting the original spectral data into image data will not affect the classification effect of the Siamese Network classification model, the rock spectral classification problem can be transformed into the problem of image classification. Then the image classification methods and means can be used. The experimental results show that the Siamese Network classification model in the case of fewer rock spectral samples can still achieve excellent classification effect, which effectively makes up for the shortcomings of the traditional machine learning model in the case of small samples. Because the data input is paired, it can effectively reduce the overfitting problem caused by too few training samples.
肖志强,贺金鑫,陈德博,战 晔,逯燕乐. 基于孪生网络模型的岩石光谱自动分类方法[J]. 光谱学与光谱分析, 2024, 44(02): 558-562.
XIAO Zhi-qiang, HE Jin-xin, CHEN De-bo, ZHAN Ye, LU Yan-le. Automatic Classification of Rock Spectral Features Based on Siamese
Network Model. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(02): 558-562.
[1] Pane S A, Sihombing F M H. IOP Conference Series: Earth and Environmental Science, 2021, 830: 012042.
[2] LIU Ye, CHENG Guo-jian, MA Wei, et al(刘 烨,程国建,马 微,等). Journal of Central South University(Science and Technology)[中南大学学报(自然科学版)], 2016, 47(7): 2375.
[3] WEI Ren, XU Liang-ji, MENG Xue-ying, et al(韦 任,徐良骥,孟雪莹,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2021, 41(6): 1942.
[4] Crowley J K. Remote Sensing of Environment, 1993, 44(2-3): 337.
[5] HE Jin-xin, REN Xiao-yu, CHEN Sheng-bo, et al(贺金鑫,任小玉,陈圣波,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2021, 41(1): 141.
[6] Tanabe K, Uesaka H, Inoue T, et al. Bunseki Kagaku, 1994, 43(10): 765.
[7] Ishikawa S T, Gulick V C. Computers & Geosciences, 2013, 54: 259.
[8] CHANG Xiang-ping(常项平). Master Dissertation(硕士论文). Zhengzhou University of Aeronautics(郑州航空工业管理学院), 2020.
[9] Xu L, Xie J, Cai F, et al. Electronics, 2021, 10(16): 1892.
[10] MAN Wen-jing, CHEN Wen-long, XU Yuan-jin(满文婧,陈文龙,徐元进). Geography and Geo-information Science(地理与地理信息科学), 2021, 37(3): 29.
[11] Liu J, Osadchy M, Ashton L, et al. Analyst, 2017, 142(21): 4067.
[12] Ji Z, Chai X, Yu Y, et al. Pattern Recognition Letters, 2020, 140: 81.