|
|
|
|
|
|
Automatic Classification Method of Star Spectra Data Based on Convolutional Neural Network |
SHI Chao-jun1, QIU Bo1*, ZHOU Ya-tong1, DUAN Fu-qing2* |
1. School of Electronics and Information Engineering, Hebei University of Technology, Tianjin 300401, China
2. College of Information Science and Technology, Beijing Normal University, Beijing 100875, China |
|
|
Abstract Star spectral automatic classification is the basis for the study of Star Spectral analysis. The fast and accurate automatic identification and classification of the star spectra can improve the search for the speed of the special celestial bodies, which is of great significance to the study of astronomy. At present, LAMOST, a large-scale spacecraft project in China, releases millions of spectral data every year. Fast and accurate automatic identification and classification of massive star spectra has become one of the hot spots in the field of astronomical data analysis and processing. Aiming at the problem of star spectral automatic classification, a new spectral classification method of K, F stellar based on convolutional neural network (CNN) is proposed. Support Vector Machine (SVM) and Back Propagation (BP) neural network algorithms are compared algorithms. The cross-validation method is used to verify the performance of the classifier. Compared with the traditional method, CNN has the advantages of sharing the weight and reducing the learning parameters of the model. It can automatically extract training data features. The experiment uses the Tensorflow depth learning framework and the Python 3.5 programming environment. The K, F stellar spectral dataset uses the LAMOST DR3 data provided by the National astronomical observatory of the Chinese academy of sciences. Spectra with wavelengths in the 3 500 to 7 500 range are sampled evenly to generate data sets. Data sets were normalized using the min-max normalization method. The CNN structure includes an input layer, a convolution layer C1, a pooling layer S1, a convolution layer C2, a pooling layer S2, a convolution layer C3, a pooling layer S3, a full connection layer and an output layer. The input layer is the flow value at 3 700 wavelength points of a group of K and F stars. The C1 layer has 10 convolution kernels in size of 1×3 steps of 1. S1 layer using the maximum pooling method. The size of the sampling window is 1×2, no overlapping sampling. The sampling result produces 10 features, which is the same as the number of the C1 features, and each feature is one-half the size of the C1 feature. The C2 layer has 20 convolution kernels of size 1×2 steps of 1 which outputs 20 feature maps. S2 layer outputs 20 features. The C3 layer has 30 convolution kernels of size 1×3 steps of 1 which outputs 30 feature maps. S3 layer outputs 30 features. The number of fully connected layer neurons is set to 50, and each neuron is connected to all the neurons in the S3 layer. The number of neurons in the output layer is set to 2, and the output classification results are obtained. The activation function of convolution layer uses the ReLU function, and the activation function of output layer uses the softmax function. The contrast algorithm SVM type is C-SVC, and its kernel function uses the radial basis function. The BP algorithm has three hidden layers, each with 20, 40 and 20 neurons. Data set is divided into training data and test data. The training data of 40%, 60%, 80% and 100% are used as training sets and the test data is used as a test set. The training sets are put into the model for training. Each training iteration 8 000 times. Each trained model is validated with a test set. The training data of 100% are used as a training set for comparative experiments. And test data are used as a test set. The accuracy, recall, F-score and accuracy are used to evaluate the performance of the model. The results of experiments are analyzed in detail. Analysis results show that CNN algorithm can quickly and automatically classify and screen K, F star spectra. The greater the amount of data in the training set, the stronger the model generalization ability and the higher the classification accuracy. Contrast experiment results demonstrate that CNN algorithm significantly outperform the competitors SVM and BP algorithms on automatic classification method of K and F star spectra data.
|
Received: 2018-02-07
Accepted: 2018-06-12
|
|
Corresponding Authors:
QIU Bo, DUAN Fu-qing
E-mail: qiubo@hebut.edu.cn; fqduan@hnu.edu.cn
|
|
[1] LIU Rong, QIAO Xue-jun, ZHANG Jian-nan, et al(刘 蓉,乔学军,张健楠,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2017, 37(5): 1553.
[2] Li Xiangru, Pan Ruyang, Duan Fuqing. Research in Astronomy and Astrophysics, 2017, 17(4): 36.
[3] Navarro S G, Corradi R L M, Mampaso A. Astronomy and Astrophysics, 2012, 538(1): 76.
[4] Lazer D, Kennedy R, King G, et al. Science, 2014, 343(6176): 1203.
[5] Fan J, Han F, Liu H. National Science Review, 2014, 1(2): 293.
[6] Zhang J N, Luo A L, Tu L P. 2008 Congress on Image and Signal Processing, 2008, 5: 249.
[7] Yi Zhenping, Pan Jingchang. 2010 3rd International Congress on Image and Signal Processing, 2010, 7: 3129.
[8] Liu Chao, Cui Wenyuan, Zhang Bo, et al. Research in Astronomy and Astrophysics, 2015, 15(8): 1137.
[9] Kheirdastan S, Bzarghan M. Astrophysics and Space Science, 2016, 361(9): 304.
[10] Jiang Bin, Cao Rui, Li Mingze, et al. 2017 IEEE 3rd International Conference on Control Science and Systems Engineering, 2017. 457.
[11] Yann LeCun, Leon Bottou, Yoshua Bengio, et al. Proceedings of the IEEE, 1998, 86(11): 2278.
[12] Yann LeCun, Boser B, Denker J S, et al. Neural Computation, 1989, 1(4): 541.
[13] ZHOU Fei-yan, JIN Lin-peng, DONG Jun(周飞燕,金林鹏,董 军). Chinese Journal of Computers(计算机学报), 2017, 40(6): 1229. |
[1] |
LI Xin-ting, ZHANG Feng, FENG Jie*. Convolutional Neural Network Combined With Improved Spectral
Processing Method for Potato Disease Detection[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 215-224. |
[2] |
LAN Yan1,WANG Wu1,XU Wen2,CHAI Qin-qin1*,LI Yu-rong1,ZHANG Xun2. Discrimination of Planting and Tissue-Cultured Anoectochilus Roxburghii Based on SMOTE and Inception-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 158-163. |
[3] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[4] |
HUANG Meng-qiang1, KUANG Wen-jian2, 3*, LIU Xiang1, HE Liang4. Quantitative Analysis of Cotton/Polyester/Wool Blended Fiber Content by Near-Infrared Spectroscopy Based on 1D-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3565-3570. |
[5] |
LI Zhong-bing1, 2, JIANG Chuan-dong2, LIANG Hai-bo3, DUAN Hong-ming2, PANG Wei2. Rough and Fine Selection Strategy Binary Gray Wolf Optimization
Algorithm for Infrared Spectral Feature Selection[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3067-3074. |
[6] |
CAI Jian-rong1, 2, HUANG Chu-jun1, MA Li-xin1, ZHAI Li-xiang1, GUO Zhi-ming1, 3*. Hand-Held Visible/Near Infrared Nondestructive Detection System for Soluble Solid Content in Mandarin by 1D-CNN Model[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2792-2798. |
[7] |
PU Shan-shan, ZHENG En-rang*, CHEN Bei. Research on A Classification Algorithm of Near-Infrared Spectroscopy Based on 1D-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(08): 2446-2451. |
[8] |
FAN Ya-wen, LIU Yan-ping*, QIU Bo, JIANG Xia, WANG Lin-qian, WANG Kun. Research on Spectral Classification of Stellar Subtypes Based on
SSTransformer[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(08): 2523-2528. |
[9] |
TANG Ting, PAN Xin*, LUO Xiao-ling, GAO Xiao-jing. Fusion of ConvLSTM and Multi-Attention Mechanism Network for
Hyperspectral Image Classification[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(08): 2608-2616. |
[10] |
LI Wen-xia1, DU Yu-jun2, WANG Yue1, LIU Zheng-dong3*, ZHENG Jia-hui1, DU Wen-qian1, WANG Hua-ping4. Research on On-Line Efficient Near-Infrared Spectral Recognition and Automatic Sorting Technology of Waste Textiles Based on Convolutional Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2139-2145. |
[11] |
LIANG Wan-jie1, FENG Hui2, JIANG Dong3, ZHANG Wen-yu1, 4, CAO Jing1, CAO Hong-xin1*. Early Recognition of Sclerotinia Stem Rot on Oilseed Rape by Hyperspectral Imaging Combined With Deep Learning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2220-2225. |
[12] |
JIANG Xia*, QIU Bo, WANG Lin-qian, GUO Xiao-yu. Automatic Classification Method of Star Spectra Based on
Semi-Supervised Mode[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(06): 1875-1880. |
[13] |
ZHENG Zhi-jie1, LIN Zhen-heng1, 2*, XIE Hai-he2, NIE Yong-zhong3. The Method of Terahertz Spectral Classification and Identification for Engineering Plastics Based on Convolutional Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(05): 1387-1393. |
[14] |
CHEN Dong-ying1, 2, ZHANG Hao1, 2*, ZHANG Zi-long1, YU Mu-xin1, CHEN Lu3. Research on the Origin Traceability of Honeysuckle Based on Improved 1D-VD-CNN and Near-Infrared Spectral Data[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(05): 1471-1477. |
[15] |
LI Kai-yu1, ZHANG Hui2, MA Jun-cheng3, ZHANG Ling-xian1*. Segmentation Method for Crop Leaf Spot Based on Semantic Segmentation and Visible Spectral Images[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(04): 1248-1253. |
|
|
|
|