Abstract:Terahertz time-domain spectroscopy (THz-TDS) is an important method for rapid and nondestructive material identification due to its spectral fingerprint properties, which has broad application exploitation in the nondestructive inspection of drugs and explosives. Spectral identification is one of the most important aspects of the applied research of THz-TDS. Most existing spectral identification methods are machine-learning based classification of manually selected features or thresholding classification of absorption spectral peak. Those methods are not adapt well to low signal-to-noise ratio, because some materials have few or no spectral absorption peaks features in the terahertz waveband and spectra are affected easily by concentrations of samples, air humidity and noises. Meanwhile computational cost increases with data quantity and category. In recent years, with the rise of deep learning technology, the methods represented by CNN (Convolutional Neural Network) and RNN (Recurrent Neural Network) have been widely applied to fields such as computer vision and natural language processing where they have been shown to produce better results than traditional machine learning methods. Due to the strong nonlinear classification capability of deep learning technology, two networks respectively were designed based on RNN and CNN for spectral identification in this paper: one-dimensional spectral line classifier based on RNN and two-dimensional spectral image classifier based on CNN. To simulate the practical application scenario, over 20 000 terahertz time-domain spectra of 12 materials were measured in a non-vacuum environment as training-set and test-set. After analyzing the effects of concentrations of samples and air humidity on spectra, S-G(Savitzky-Golay) filter was introduced to reduce noises of spectra. Experimental results show that S-G filter could improve the identification accuracy, because processed spectra have more obvious feature compared with the unprocessed spectra; the proposed methods based on RNN and CNN are more accurate and faster on the test-set, compared with traditional machine learning algorithm k-NN (k-Nearest Neighbor); CNN demonstrated better robustness to noises than RNN on spectral identification task. Therefore, deep learning technology could be utilized for quick and effective identification terahertz time-domain spectra, which provide a theoretical and experimental basis for new nondestructive safety inspection techniques.
胡其枫,蔡 健. 基于深度学习的太赫兹时域光谱识别研究[J]. 光谱学与光谱分析, 2021, 41(01): 94-99.
HU Qi-feng, CAI Jian. Research of Terahertz Time-Domain Spectral Identification Based on Deep Learning. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(01): 94-99.
[1] Trofimov V A, Varentsova S A. PLOS ONE, 2018, 13(8): e0201297.
[2] Liu J, Li Z, Hu F, et al. Optical and Quantum Electronics, 2015, 47(2): 313.
[3] Yin X, Mo W, Wang Q, et al. Advances in Condensed Matter Physics, 2018, 2018: 1618750.
[4] Liang J, Guo Q, Chang T, et al. Optik, 2018, 174: 7.
[5] Yin M, Tang S, Tong M. Analytical Methods, 2016, 8(13): 2794.
[6] Mumtaz M, Mahmood A, Khan S D, et al. Applied Spectroscopy, 2017, 71(3): 456.
[7] XIE Qi, YANG Hong-ru, LI Hong-guang, et al(解 琪, 杨鸿儒, 李宏光, 等). Optics and Precision Engineering(光学精密工程), 2016,24(10): 2392.
[8] Naftaly M. IEEE Sensors Journal, 2013, 13(1): 8.
[9] LI Jin, LIU Quan-cheng, XIONG Liang(李 进, 刘泉澄, 熊 亮). Laser & Optoelectronics Progress(激光与光电子学进展), 2018, 55(9): 70.
[10] Liu H, Zhang Z, Yang Y, et al. Optik, 2018, 172: 668.
[11] Yu S, Jia S, Xu C. Neurocomputing, 2017, 219: 88.
[12] Maggiori E, Tarabalka Y, Charpiat G, et al. IEEE Transactions on Geoscience and Remote Sensing, 2017, 55(2): 645.
[13] Karim F, Majumdar S, Darabi H, et al. IEEE Access, 2018, 6: 1662.