|
|
|
|
|
|
Stellar Spectra Classification by Support Vector Machine with Unlabeled Data |
LIU Zhong-bao1,2, LEI Yu-fei1, SONG Wen-ai2, ZHANG Jing2, WANG Jie3, TU Liang-ping4 |
1. School of Software, Quanzhou University of Information Engineering, Quanzhou 362000,China
2. School of Software, North University of China, Taiyuan 030051, China
3. Xinjiang Astronomical Observatory, Chinese Academy of Sciences, Urumqi 830011, China
4. School of Science, University of Science and Technology Liaoning, Anshan 114051, China |
|
|
Abstract Stellar spectra classification is one of hot spots in astronomical techniques and methods. With continuous operation and improvement of observation apparatus, hundreds and thousands of spectra were obtained by researchers, which presented challenges to process them manually. In view of this, data mining algorithms have attracted more attentions, and have been utilized to deal with the spectra. Neural networks, self organization mapping, association rules and other data mining algorithms have been utilized to classify the stellar spectra in recent years. In these algorithms, Support Vector Machine (SVM) is much more popular due to its good learning capability and excellent classification performance. The basic idea of standard SVM is to find an optimal separating hyper-plane between the positive and negative samples. SVM as a convex programming problem has a unique optimal solution, which can be posed as a quadratic programming (QP) problem. In order to further improve the classification efficiency, Twin Support Vector Machine (TSVM) has been proposed. It aims at generating two non-parallel hyper-planes such that each plane is close to one class and as far as possible from the other one. The learning speed of TSVM is approximately four times faster than that of the classical SVM. TSVM receives many attentions since it shows low computational complexity, and many variants of TSVM have been proposed in literatures. During the process of stellar spectra classification, the classification model is built based on the observation data. The key step is to manually label the spectra, which is time-consuming and painstaking. Therefore, how to construct the spectra classification model based on the labeled and unlabeled spectra is a problem deserving study. In order to effectively classify the stellar spectra, Twin Support Vector Machine with Unlabeled Data (TSVMUD) is proposed in this paper. In TSVMUD, the stellar spectra are firstly divided into two parts, one is for training, and the other is for test. Then, the proposed method TSVMUD is utilized on the training data and the classification model is obtained. At last, the spectra in the test dataset are verified by the classification model. TSVMUD not only preserve the advantage of low computational complexity, but also improve the classification efficiency by taking both the labeled and unlabeled data into consideration. The comparative experiments on the SDSS datasets verify that TSVMUD performs better than the traditional classifiers, such as SVM, TSVM, KNN (K Nearest Neighbor). However, some limitations exist in TSVMUD, for example, how to deal with the mass spectra is quite difficult to solve. Inspired by random sampling, we will research the adaptability of our proposed method in the big data environment based on big data technologies.
|
Received: 2018-02-11
Accepted: 2018-06-19
|
|
|
[1] Bazarghan M. Astrophysics and Space Science, 2012, 337(1): 93.
[2] Navarro S G, Corradi R L M, Mampaso A. Astronomy and Astrophysics, 2012, 538(1): 143.
[3] Bolton A S, Schlegel D J, Aubourg E, et al. The Astronomical Journal, 2012, 144(5): 507.
[4] Hernandez R D, Barreto H P, Robles L A, et al. Experimental Astronomy, 2014, 38(1): 193.
[5] Gray R O, Corbally C J. The Astronomical Journal, 2014, 147(4): 80.
[6] Fuentes O, Gulati R K. Proceedings of the 7th Texas-Mexico Conference on Astrophysics: Flows, Blows and Glows, 2001. 209.
[7] JIANG Bin, LI Zi-xuan, QU Mei-xia, et al(姜 斌, 李紫宣, 曲美霞, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2016, 36(7): 2275.
[8] Bu Y D, Chen F Q, Pan J C. New Astronomy, 2014, 28: 35.
[9] Bu Y D, Pan J C, Jiang B, et al. Publications of the Astronomical Society of Japan, 2013, 65(4): 173.
[10] Cai J H, Zhao X J, Sun S W, et al. Research in Astronomy and Astrophysics, 2013, 13(3): 334.
[11] Liu Z B, Song L P, Zhao W J. Monthly Notices of the Royal Astronomical Society, 2016, 455(4): 4289.
[12] Liu Z B. Journal of Astrophysics and Astronomy, 2016, 37(2): 1. |
[1] |
FAN Ya-wen, LIU Yan-ping*, QIU Bo, JIANG Xia, WANG Lin-qian, WANG Kun. Research on Spectral Classification of Stellar Subtypes Based on
SSTransformer[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(08): 2523-2528. |
[2] |
JIANG Xia*, QIU Bo, WANG Lin-qian, GUO Xiao-yu. Automatic Classification Method of Star Spectra Based on
Semi-Supervised Mode[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(06): 1875-1880. |
[3] |
LU Ya-kun1, QIU Bo1*, LUO A-li2, GUO Xiao-yu1, WANG Lin-qian1, CAO Guan-long1, BAI Zhong-rui2, CHEN Jian-jun2. Classification of 2D Stellar Spectra Based on FFCNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1881-1885. |
[4] |
WANG Tian-xiang1, 2, FAN Yu-feng1*, WANG Xiao-li1, LONG Qian1, WANG Chuan-jun1. Prediction of Stellar Spectrum Categories Based on Deep Residual Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(05): 1602-1606. |
[5] |
JIANG Bin, ZHAO Zi-liang, WANG Shu-ting, WEI Ji-yu, QU Mei-xia*. Decomposition and Classification of Stellar Spectra Based on t-SNE[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(09): 2913-2917. |
[6] |
WANG Nan-nan1, QIU Bo1*, MA Jie1*, SHI Chao-jun1, SONG Tao1, GUO Ping2*. Fast Classification Method of Star Spectra Data Based on Convolutional Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(10): 3297-3301. |
[7] |
XUE Ren-zheng1, CHEN Shu-xin1*, HUANG Hong-ben2. Line Index of A-Type Stellar Astronomical Spectrum Predict Effective Temperature by Ridge Regression Model[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(08): 2624-2629. |
[8] |
LIU Zhong-bao1, 2, QIN Zhen-tao1, LUO Xue-gang1, ZHOU Fang-xiao1, ZHANG Jing1. Stellar Spectra Classification by Support Vector Machine with Spectral Distribution Properties[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(04): 1307-1311. |
[9] |
SHI Chao-jun1, QIU Bo1*, ZHOU Ya-tong1, DUAN Fu-qing2*. Automatic Classification Method of Star Spectra Data Based on Convolutional Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(04): 1312-1316. |
[10] |
ZHANG Jing, LIU Zhong-bao*, SONG Wen-ai, FU Li-zhen, ZHANG Yong-lai. Stellar Spectra Classification Method Based on Multi-Class Support Vector Machine[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2018, 38(07): 2307-2310. |
[11] |
YU Hui-ling1, MEN Hong-sheng2, LIANG Hao2, ZHANG Yi-zhuo2*. Near Infrared Spectroscopy Identification Method of Wood Surface Defects Based on SA-PBT-SVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2018, 38(06): 1724-1728. |
[12] |
CHEN Shu-xin1, 2,SUN Wei-min1*,KONG Xiao3. Similarity Measurement Among Classification Templates for LAMOST Stellar Spectra[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2018, 38(06): 1922-1925. |
[13] |
LIU Zhong-bao1, REN Juan-juan2, SONG Wen-ai1*, ZHANG Jing1, KONG Xiao2, FU Li-zhen1. Stellar Spectra Classification with Entropy-Based Learning Machine[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2018, 38(02): 660-664. |
[14] |
WANG Guang-pei1, PAN Jing-chang1*, YI Zhen-ping1, WEI Peng2, JIANG Bin1 . Outlier Data Mining and Analysis of LAMOST Stellar Spectra in Line Index Feature Space[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2016, 36(10): 3364-3368. |
[15] |
WANG Guang-pei1, PAN Jing-chang1*, YI Zhen-ping1, WEI Peng2, JIANG Bin1 . Research on the Clustering of Massive Stellar Spectra Based on Line Index [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2016, 36(08): 2646-2650. |
|
|
|
|