|
|
|
|
|
|
Near-Infrared Spectral Feature Selection of Water-Bearing Rocks Based on Mutual Information |
ZHANG Xiu-lian1, 2, ZHANG Fang1, 2*, ZHOU Nuan1, 2, ZHANG Jing-jie1,2, LIU Wen-fang3, ZHANG Shuai1, 2, YANG Xiao-jie1, 2 |
1. State Key Laboratory for Geomechanics and Deep Underground Engineering, China University of Mining and Technology, Beijing 100083, China
2. School of Mechanics and Civil Engineering, China University of Mining and Technology, Beijing 100083, China
3. College of Resources and Civil Engineering, Northeastern University, Shenyang 110004, China |
|
|
Abstract The relationship between near-infrared spectroscopic measurements of rock and its water content does not follow simple linear correlations, preventing the direct use of classical correlation analysis. In the present paper, an experiment on the water migration in cliff conglomerates from the Mogao Grottoes was performed,and collected 51 pieces of near-infrared spectra from three different positions sample. These spectra cover the whole process of the conglomerate from the initial dry state to the saturated state; then we selected a combined N point smooth and baseline correction method (NPS+B-corr) to preprocess the original near-infrared spectrum. According to the spectral curve features at 1 450 and 1 950 nm of the strong absorption spectrum, six initial feature variables, namely Height, Full Width at Half Maximum (FWHM), Area, Left Half Width (LHW), Right Half Width (RHW), and (LHW/RHW), were extracted to establish the initial feature set. Simultaneously, the extracted spectral characteristic variables were normalized, and the curve of each spectral characteristic parameter and the change of water content were drawn according to the result of the processing, determine the water content level. Then,the correlation among the feature variables of the initial feature set should be screened to remove redundant features. The initial feature set is simplified to three characteristic variables: Height, LHW, RHW. Finally, based on mutual information, the Best Individual Feature and Maximal Information Coefficient methods were used to evaluate the relationship between samples’ spectral characteristic parameters and water content. We found that: (1) at wavelengths between approximately 1 450 and 1 930 nm, the near-infrared spectrum of the conglomerate has obvious absorption peaks, and the absorption peaks show a strong correlation with the change of water content, which indicates that spectral reflectance was significantly correlated with water content; (2) the relationship of primary spectral characteristic parameters with total water content can be described by an S-shaped function,water content can be divided into three states of dry, water-absorbing, and saturated; (3) The near-infrared spectral characteristics selected by the two information methods are not completely consistent. Based on the BIF method, the correlation between the characteristic variable at 1 450 nm and the rock moisture content ranks from right to left as right shoulder width, peak height, and left shoulder width; at 1 900 nm, the peak height, right shoulder width, and left shoulder width. Based on the MIC method, the correlation between the characteristic variables at 1 450 and 1 900 nm and the rock water content level from the highest to the lowest in the left shoulder width, peak height, and right shoulder width; (4) Decision tree analysis suggests that the MIC method achieves higher accuracy in identifying water content level than the BIF method.
|
Received: 2020-05-26
Accepted: 2020-09-12
|
|
Corresponding Authors:
ZHANG Fang
E-mail: zhangf76@163.com
|
|
[1] Cen H, He Y. Trends in Food Science & Technology, 2007. 18(2): 83.
[2] David B L, Gregory P A. Soil Science Society of America Journal, 2002, 66: 722.
[3] Mouazen A M, Karoui R, De Baerdemaeker J, et al. Soil Science Society of America Journal, 2006,70(4): 1295.
[4] CHU Xiao-li(褚小立). Molecular Spectroscopy Analytical Technology Combined With Chemometrics and Its Applications(结合化学计量学的分子光谱分析技术及其应用). Beijing: Chemical Industry Press(北京: 化学工业出版社), 2011. 264.
[5] Balabin R M, Smirnov S V. Analytica Chimica Acta, 2011, 692(1-2): 63.
[6] Vinh L T, Lee S, Park Y T, et al. Applied Intelligence, 2012, 37(1): 100.
[7] YAO Xu, WANG Xiao-dan, ZHANG Yu-xi, et al(姚 旭, 王晓丹, 张玉玺, 等). Control and Decision(控制与决策), 2012, 27(2): 161.
[8] Reshef D N, Reshef Y A, Finucane H K, et al. Science, 2011,334(6062): 1518.
[9] LIANG Ji-ye, FENG Chen-jiao, SONG Peng(梁吉业,冯晨娇,宋 鹏). Chinese Journal of Computers(计算机学报), 2016,39(1): 1.
[10] HE Man-chao,ZHANG Guo-feng,ZHAO Jian(何满潮,张国锋,赵 健). Chinese Patent(中国专利):CN 102253181, 2011.
[11] ZHANG Fang, HU Zuo-le, HOU Xin-li, et al(张 芳, 户佐乐, 侯欣莉, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2019, 39(11): 3395. |
[1] |
GAO Feng1, 2, XING Ya-ge3, 4, LUO Hua-ping1, 2, ZHANG Yuan-hua3, 4, GUO Ling3, 4*. Nondestructive Identification of Apricot Varieties Based on Visible/Near Infrared Spectroscopy and Chemometrics Methods[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 44-51. |
[2] |
BAO Hao1, 2,ZHANG Yan1, 2*. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157. |
[3] |
HU Cai-ping1, HE Cheng-yu2, KONG Li-wei3, ZHU You-you3*, WU Bin4, ZHOU Hao-xiang3, SUN Jun2. Identification of Tea Based on Near-Infrared Spectra and Fuzzy Linear Discriminant QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3802-3805. |
[4] |
LIU Xin-peng1, SUN Xiang-hong2, QIN Yu-hua1*, ZHANG Min1, GONG Hui-li3. Research on t-SNE Similarity Measurement Method Based on Wasserstein Divergence[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3806-3812. |
[5] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[6] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[7] |
LUO Li, WANG Jing-yi, XU Zhao-jun, NA Bin*. Geographic Origin Discrimination of Wood Using NIR Spectroscopy
Combined With Machine Learning Techniques[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3372-3379. |
[8] |
ZHANG Shu-fang1, LEI Lei2, LEI Shun-xin2, TAN Xue-cai1, LIU Shao-gang1, YAN Jun1*. Traceability of Geographical Origin of Jasmine Based on Near
Infrared Diffuse Reflectance Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3389-3395. |
[9] |
YANG Qun1, 2, LING Qi-han1, WEI Yong1, NING Qiang1, 2, KONG Fa-ming1, ZHOU Yi-fan1, 2, ZHANG Hai-lin1, WANG Jie1, 2*. Non-Destructive Monitoring Model of Functional Nitrogen Content in
Citrus Leaves Based on Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3396-3403. |
[10] |
HUANG Meng-qiang1, KUANG Wen-jian2, 3*, LIU Xiang1, HE Liang4. Quantitative Analysis of Cotton/Polyester/Wool Blended Fiber Content by Near-Infrared Spectroscopy Based on 1D-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3565-3570. |
[11] |
HUANG Zhao-di1, CHEN Zai-liang2, WANG Chen3, TIAN Peng2, ZHANG Hai-liang2, XIE Chao-yong2*, LIU Xue-mei4*. Comparing Different Multivariate Calibration Methods Analyses for Measurement of Soil Properties Using Visible and Short Wave-Near
Infrared Spectroscopy Combined With Machine Learning Algorithms[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3535-3540. |
[12] |
KANG Ming-yue1, 3, WANG Cheng1, SUN Hong-yan3, LI Zuo-lin2, LUO Bin1*. Research on Internal Quality Detection Method of Cherry Tomatoes Based on Improved WOA-LSSVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3541-3550. |
[13] |
HUANG Hua1, LIU Ya2, KUERBANGULI·Dulikun1, ZENG Fan-lin1, MAYIRAN·Maimaiti1, AWAGULI·Maimaiti1, MAIDINUERHAN·Aizezi1, GUO Jun-xian3*. Ensemble Learning Model Incorporating Fractional Differential and
PIMP-RF Algorithm to Predict Soluble Solids Content of Apples
During Maturing Period[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3059-3066. |
[14] |
CHEN Jia-wei1, 2, ZHOU De-qiang1, 2*, CUI Chen-hao3, REN Zhi-jun1, ZUO Wen-juan1. Prediction Model of Farinograph Characteristics of Wheat Flour Based on Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3089-3097. |
[15] |
GUO Ge1, 3, 4, ZHANG Meng-ling3, 4, GONG Zhi-jie3, 4, ZHANG Shi-zhuang3, 4, WANG Xiao-yu2, 5, 6*, ZHOU Zhong-hua1*, YANG Yu2, 5, 6, XIE Guang-hui3, 4. Construction of Biomass Ash Content Model Based on Near-Infrared
Spectroscopy and Complex Sample Set Partitioning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3143-3149. |
|
|
|
|