|
|
|
|
|
|
Automatic Classification Method of Star Spectra Based on
Semi-Supervised Mode |
JIANG Xia*, QIU Bo, WANG Lin-qian, GUO Xiao-yu |
School of Electronics and Information Engineering,Hebei University of Technology,Tianjin 300401,China
|
|
|
Abstract With the implementation of more and more large-scale spectral surveys, many star spectral data are generated, which is of great significance to the study of star evolution theory, but also brings great challenges to the traditional spectral classification and processing. In the LAMOST DR7 (v2.0 version) spectral data set released in 2021, the total spectrum of stars has reached the order of millions, but the number of O-type is only 129, and the data set is seriously unbalanced. Traditional machine learning classification methods cannot achieve good results for such a large amount of data and a serious imbalance of data sets. Therefore, they are mostly used to classify the spectra of adjacent stars of two types, partial types or subclasses. An automatic star spectrum classification method based on semi-supervised mode of one dimensional convolutional neural network (CNN) and one dimensional generative adversarial network (GAN) is used to solve this problem. Firstly, each spectrum is tailored and denoised, and the wavelength range of the spectrum is 370.00~867.16 nm. Then, uniform sampling and normalization are carried out to generate 1×3 700 data set samples, which are sent to one-dimensional CNN for training. In order to avoid overfitting and improve model’s prediction ability for unknown data, dropout is applied between the full connection layer and the pooling layer. This one-dimensional CNN network is used to classify six kinds of spectra except for O-type, and the average classification accuracy is 98.08%. This experiment uses one-dimensional GAN network to solve the problem that the number of O-type is seriously small. The input of GAN is a noise signal whose size is 1×900.Through the fully connected three-layer convolution operation in the generator, the output size is 1×3 700. The separate alternating iterative training of the generator and discriminator finally outputs the required number of O-type star samples to expand the data set. Compared with oversampling, using data sets expanded by GAN to classify the star spectra can improve the classification accuracy of O-type stars from 72.92% to 97.92%, and the accuracy of the classifier can reach 96.28%. The experimental results show that the automatic star spectra classification method using this semi-supervised mode can realize the rapid and accurate classification of seven types of star spectra and can also be used to mine the unclassified spectra marked as “unknown”.
|
Received: 2022-03-18
Accepted: 2022-06-06
|
|
Corresponding Authors:
JIANG Xia
E-mail: xunjiang@hebut.edu.cn
|
|
[1] Li Xiangru,Pan Ruyang,Duan Fuqing. Research in Astronomy and Astrophysics,2017,17(4):36.
[2] Morgan W, Keenan P. Annual Review of Astronomy and Astrophysics, 1973, 11: 29.
[3] Du Bing,Luo A-li,Zhang Shuo,et al. Research in Astronomy and Astrophysics,2021,21(8):202.
[4] Fan J,Han F,Liu H. Nationanl Science Review,2014,1(2):293.
[5] Schierscher F,Paunzen E. Astronomische Nachrichten,2011,332(6):597.
[6] Liu C,Cui W Y,Zhang B,et al. Research in Astronomy and Astrophysics,2015,15(8):1137.
[7] Kheirdastan S,Bazarghan M. Astrophysics and Space Science,2016,361(9):304.
[8] SHI Chao-jun,QIU Bo,ZHOU Ya-tong,et al(石超君,邱 波,周亚同,等). Spectroscopy and Spectral Analysis (光谱学与光谱分析),2019,39(4):1312.
[9] Wen Xiaoqing,Yang Jinmeng. Chinese Journal of Physics,2021,69:303.
[10] XU Ting-ting,MA Chen-ye,ZHANG Jing-min,et al(许婷婷,马晨晔,张静敏,等). Acta Astronomica Sinica(天文学报),2019,60(2):11.
[11] Du Changde,Luo Ali, Yang Haifeng. New Astronomy,2017,51:51.
[12] Zheng Zipeng,Qiu Bo,Luo A-li,et al. Publications of the Astronomical Society of the Pacific,2020,132:024504.
[13] Si Jianmin,Li Yinbi,Luo A-li,et al. Research in Astronomy and Astrophysics,2015,15(10):1671.
|
[1] |
YU Shui1, HUAN Ke-wei1*, LIU Xiao-xi2, WANG Lei1. Quantitative Analysis Modeling of Near Infrared Spectroscopy With
Parallel Convolution Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(06): 1627-1635. |
[2] |
LI Hao1, ZHAO Qing1*, CUI Chen-zhou2, FAN Dong-wei2, ZHANG Cheng-kui1, SHI Yan-cui1, WANG Yuan1. A Stellar Spectrum Classification Algorithm Based on CNN and LSTM Composite Deep Learning Model[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(06): 1668-1675. |
[3] |
WANG Zi-xuan1, YANG Liang2, 3, 4*, HUANG Ling-xia2, HE Yong4, ZHAO Li-hua3, ZHAN Peng-fei3. Nondestructive Determination of TSS Content in Postharvest Mulberry Fruits Using Hyperspectral Imaging and Deep Learning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(06): 1724-1730. |
[4] |
HU Jin-quan1, 2, YANG Hui-hua1, ZHAO Guo-liang3, ZHOU Rui-zhi4, LI Ling-qiao5. Prediction Method of Wool Content in Waste Spinning Samples Based on Semi Supervised Regression of Generative Adversarial Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(05): 1417-1424. |
[5] |
LIU Zong-yi1, 2, ZHANG Cai-hong1, 2, JIANG Jian-kang1, 2, SHEN Bin-guo3, DING Yan-fei1, 2, ZHANG Lei-lei1, 2*, ZHU Cheng1, 2*. Rapid Determination of Total Flavonoids in Dendrobium Officinale Based on Raman Spectroscopy and CNN-LSTM Deep Learning Method[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(04): 1018-1024. |
[6] |
YANG Wan-qi1, 2, LI Zhi-qi1, 3, LI Fu-sheng1, 2*, LÜ Shu-bin1, 2, FAN Jia-jing1, 2. A Combined CARS and 1D-CNN Method for the Analysis of Heavy Metals Exceedances in Soil by XRF Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(03): 670-674. |
[7] |
TANG Jie1, LUO Yan-bo2, LI Xiang-yu2, CHEN Yun-can1, WANG Peng1, LU Tian3, JI Xiao-bo4, PANG Yong-qiang2*, ZHU Li-jun1*. Study on One-Dimensional Convolutional Neural Network Model Based on Near-Infrared Spectroscopy Data[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(03): 731-736. |
[8] |
SUN He-yang1, ZHOU Yue1, 2, LI Si-jia1, 2, LI Li1, YAN Ling-tong1, FENG Xiang-qian1*. Identification of Ancient Ceramic by Convolution Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(02): 354-358. |
[9] |
ZHANG Hai-liang1, ZHOU Xiao-wen1, LIU Xue-mei2*, LUO Wei2, ZHAN Bai-shao2, PAN Fan3. Freshness Identification of Turbot Based on Convolutional Neural
Network and Hyperspectral Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(02): 367-371. |
[10] |
MENG Xing-zhi, LIU Ya-qiu*, LIU Li-na. Raman Spectroscopy Combined With the WGANGP-ResNet Algorithm to Identify Pathogenic Species[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(02): 542-547. |
[11] |
ZHANG Fan1, WANG Wen-xiu1, WANG Chun-shan2, ZHOU Ji2, PAN Yang3, SUN Jian-feng1*. Study on Hyperspectral Detection of Potato Dry Rot in Gley Stage Based on Convolutional Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(02): 480-489. |
[12] |
LI Xin-ting, ZHANG Feng, FENG Jie*. Convolutional Neural Network Combined With Improved Spectral
Processing Method for Potato Disease Detection[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 215-224. |
[13] |
LAN Yan1,WANG Wu1,XU Wen2,CHAI Qin-qin1*,LI Yu-rong1,ZHANG Xun2. Discrimination of Planting and Tissue-Cultured Anoectochilus Roxburghii Based on SMOTE and Inception-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 158-163. |
[14] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[15] |
HUANG Meng-qiang1, KUANG Wen-jian2, 3*, LIU Xiang1, HE Liang4. Quantitative Analysis of Cotton/Polyester/Wool Blended Fiber Content by Near-Infrared Spectroscopy Based on 1D-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3565-3570. |
|
|
|
|