|
|
|
|
|
|
UMAP-Assisted Fuzzy C-Clustering Method for Recognition of
Terahertz Spectrum |
YI Can-can1, 2, 3, 5*, TUO Shuai1, 2, 3, TU Shan1, 2, 3, 4, ZHANG Wen-tao5 |
1. Key Laboratory of Metallurgical Equipment and Control Technology of Ministry of Education,Wuhan University of Science and Technology, Wuhan 430081, China
2. Hubei Provincial Key Laboratory of Mechanical Transmission and Manufacturing Engineering,Wuhan University of Science and Technology, Wuhan 430081, China
3. Precision Manufacturing Institute,Wuhan University of Science and Technology, Wuhan 430081, China
4. School of Physical Science and Technology,Guangxi Normal University, Guilin 541004, China
5. School of Electronic Engineering and Automation,Guilin University of Electronic Technology, Guilin 541004, China
|
|
|
Abstract Terahertz (THz) waves characterized by low energy, instantaneity and proficiency in spectral analysis have a promising futures in material identification. Although the existing substance identification methods based on THz have achieved certain effects, they are prone to fall into local optimization, resulting in low identification accuracy. Uniform manifold Approximation and Projection (UMAP), as a nonlinear dimensionality reduction method, assume that the data are uniformly distributed on Riemannian manifolds, which can be used to model manifolds with fuzzy topology. UMAP dimension reduction is to optimize the layout of data representation in low-dimensional space by minimizing the cross-entropy between two topological representations. The initial clustering centre is often given randomly in the traditional fuzzy C-clustering method (FCM). When the initial clustering center is not selected correctly, it is easy to fall into the local optimum, leading to wrong clustering. To this end, this paper proposes a Uniform Manifold Approximation and Projection (UMAP) assisted fuzzy C-clustering algorithm. Firstly, UMAP is used to reduce the dimensionality of the input THz sample matrix. And then, based on the principle of maximizing the distance between categories, the appropriate initial clustering center is selected. Finally, the fuzzy C-means method is employed to perform the clustering analysis. This proposed algorithm can solve the overcrowding problem between categories in the clustering process and reflect the distance information between categories to facilitate the selection of appropriate initial clustering centers. In order to verify the reliability of the algorithm proposed in this paper, four different types of genetically modified cotton seeds of Lu Mianyan28, Lu Mianyan29, Lu Mianyan36, and Zhongmian28 were detected by using THz time-domain spectroscopy technology. Then, the UMAP-assisted fuzzy C-clustering method was used to cluster the absorbance spectral data of four different types of genetically modified cotton seeds. The different cotton seeds are successfully well separated, and the clustering effect with a total correct rate of 0.9833 is obtained. The result fully demonstrates that the fuzzy C-clustering method based on UMAP-assisted proposed in this paper has a good application prospect in identifying material THz spectrum.
|
Received: 2021-08-02
Accepted: 2021-11-03
|
|
Corresponding Authors:
YI Can-can
|
|
[1] Hu Xiaohua, Lang Wenhui, Liu Wei, et al. Journal of Infrared, Millimeter and Terahertz Waves, 2017, 38(8): 980.
[2] You Chengwu, Wang Tianyi, Qian Shunrong, et al. Applied Optics, 2018, 57(17): 4884.
[3] SHI Wei, WANG Hai-qing, HOU Lei, et al(施 卫, 王海青, 侯 磊, 等). SCIENTIA SINICA Physica, Mechanica & Astronomica(中国科学: 物理学、力学、天文学), 2021, 51(5): 054210.
[4] RAO Jin-qiu, ZHAO Qi-duo, QIU Feng(饶近秋, 赵启铎, 邱 峰). China Journal of Chinese Materia Medica(中国中药杂志), 2020, 45(4): 825.
[5] YANG Fan, YU Xiao, LIU Li, et al(杨 帆, 余 晓, 刘 黎, 等). Transactions of China Electrotechnical Society(电工技术学报), 2021, 36(4): 777.
[6] MA Qing-xiao, LI Chun, LI Tian-ying, et al(马卿效, 李 春, 李天莹, 等). Progress in Laser and Optoelectronics(激光与光电子学进展), 2020, 57(13): 130006.
[7] Cao Yunqi, Chen Jiani, Zhang Guangxin, et al. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2021, 256: 119713.
[8] Qin Binyi, Li Zhi, Chen Tao, et al. Optik-International Journal for Light and Electron Optics, 2017, 142: 576.
[9] CHEN Tao, LI Zhi, HU Fang-rong, et al(陈 涛, 李 智, 胡放荣, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2017, 37(2): 618.
[10] Liu Jianjun, Li Zhi, Hu Fangrong, et al. Optik-International Journal for Light and Electron Optics, 2014, 125(23): 6914.
[11] Sivertsen Edvard, Thyholt Kari, Rustad Turid, et al. Food Control, 2021, 131: 108385.
[12] SHAN Zhi-lei, ZHANG Wang-fei, ZHAO Xi-lin, et al(单治磊, 张王菲, 赵熙临, 等). Journal of Southwest Forestry University·Natural Science(西南林业大学学报·自然科学), 2017, 142(6): 188.
[13] Tang Qiu, Chai Yi, Qiu Jianfeng, et al. Journal of Process Control, 2019, 81: 76.
[14] Mcinnes L, Healy J. Journal of Open Source Software, 2018, 3(29): 861.
[15] TANG Zheng-hua(汤正华). Acta Metrologica Sinica(计量学报), 2020, 41(4): 505.
|
[1] |
WAN Mei, ZHANG Jia-le, FANG Ji-yuan, LIU Jian-jun, HONG Zhi, DU Yong*. Terahertz Spectroscopy and DFT Calculations of Isonicotinamide-Glutaric Acid-Pyrazinamide Ternary Cocrystal[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3781-3787. |
[2] |
LIU Xin-peng1, SUN Xiang-hong2, QIN Yu-hua1*, ZHANG Min1, GONG Hui-li3. Research on t-SNE Similarity Measurement Method Based on Wasserstein Divergence[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3806-3812. |
[3] |
LI Yang1, LI Xiao-qi1, YANG Jia-ying1, SUN Li-juan2, CHEN Yuan-yuan1, YU Le1, WU Jing-zhu1*. Visualisation of Starch Distribution in Corn Seeds Based on Terahertz Time-Domain Spectral Reflection Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2722-2728. |
[4] |
CAO Qian, MA Xiang-cai, BAI Chun-yan, SU Na, CUI Qing-bin. Research on Multispectral Dimension Reduction Method Based on Weight Function Composed of Spectral Color Difference[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2679-2686. |
[5] |
JIN Chun-bai1, YANG Guang1*, LU Shan2*, LIU Wen-jing1, LI De-jun1, ZHENG Nan1. Band Selection Method Based on Target Saliency Analysis in Spatial Domain[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2952-2959. |
[6] |
ZHENG Zhi-jie1, LIN Zhen-heng1, 2*, XIE Hai-he2, NIE Yong-zhong3. The Method of Terahertz Spectral Classification and Identification for Engineering Plastics Based on Convolutional Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(05): 1387-1393. |
[7] |
WANG Yu-ye1, 2, LI Hai-bin1, 2, JIANG Bo-zhou1, 2, GE Mei-lan1, 2, CHEN Tu-nan3, FENG Hua3, WU Bin4ZHU Jun-feng4, XU De-gang1, 2, YAO Jian-quan1, 2. Terahertz Spectroscopic Early Diagnosis of Cerebral Ischemia in Rats[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(03): 788-794. |
[8] |
YUAN Zhuang1, DONG Da-ming2*. Near-Infrared Spectroscopy Measurement of Contrastive Variational Autoencoder and Its Application in the Detection of Liquid Sample[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(11): 3637-3641. |
[9] |
CAO Yu-qi2, KANG Xu-sheng1, 2*, CHEN Piao-yun2, XIE Chen2, YU Jie2*, HUANG Ping-jie2, HOU Di-bo2, ZHANG Guang-xin2. Research on Discrimination Method of Absorption Peak in Terahertz
Regime[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(10): 3058-3062. |
[10] |
BAI Zi-jin1, PENG Jie1*, LUO De-fang1, CAI Hai-hui1, JI Wen-jun2, SHI Zhou3, LIU Wei-yang1, YIN Cai-yun1. A Mid-Infrared Spectral Inversion Model for Total Nitrogen Content of Farmland Soil in Southern Xinjiang[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(09): 2768-2773. |
[11] |
LI Yan1, LIU Qi-hang2, 3, HUANG Wei1, DUAN Tao1, CHEN Zhao-xia1, HE Ming-xia2, 3, XIONG Yu1*. Terahertz Imaging Study of Dentin Caries[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2374-2379. |
[12] |
MIAO Shu-guang1, SHAO Dan1*, LIU Zhong-yu2, 3, FAN Qiang1, LI Su-wen1, DING En-jie2, 3. Study on Coal-Rock Identification Method Based on Terahertz
Time-Domain Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1755-1760. |
[13] |
CAO Yao-yao1, 2, 4, LI Xia1, BAI Jun-peng2, 4, XU Wei2, 4, NI Ying3*, DONG Chuang2, 4, ZHONG Hong-li5, LI Bin2, 4*. Study on Qualitative and Quantitative Detection of Pefloxacin and
Fleroxacin Veterinary Drugs Based on THz-TDS Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1798-1803. |
[14] |
ZHANG Liu, ZHANG Jia-kun, LÜ Xue-ying, SONG Hong-zhen, WANG Wen-hua*. Research on Tunable Spectrum Reconstruction[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(05): 1378-1384. |
[15] |
PAN Zhao1, LI Zong-liang1, ZHANG Zhen-wei2, WEN Yin-tang1, ZHANG Peng-yang1. Defect Detection and Analysis of Ceramic Fiber Composites Based on
THz-TDS Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(05): 1547-1552. |
|
|
|
|