光谱学与光谱分析 |
|
|
|
|
|
A Method of Stellar Spectral Classification Based on Map/Reduce Distributed Computing |
PAN Jing-chang1, WANG Jie1, JIANG Bin1, LUO A-li1, 2, WEI Peng2, ZHENG Qiang3 |
1. School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, Weihai 264209, China 2. Key Laboratory of Optical Astronomy, National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100012, China 3. College of Computer and Control Engineering, Yantai University, Yantai 264005, China |
|
|
Abstract Celestial spectrum contains a great deal of astrophysical information. Through the analysis of spectra, people can get the physical information of celestial bodies, as well as their chemical composition and atmospheric parameters. With the implementation of LAMOST, SDSS telescopes and other large-scale surveys, massive spectral data will be produced, especially along with the formal operation of LAMOST, 2 000 to 4 000 spectral data will be generated each observation night. It requires more efficient processing technology to cope with such massive spectra. Automatic classification of stellar spectra is a basic content of spectral processing. The main purpose of this paper is to research the automatic classification of massive stellar spectra. The Lick index is a set of standard indices defined in astronomical spectra to describe the spectral intensity of spectral lines, which represent the physical characteristics of spectra. Lick index is a relatively wide spectral characteristics, each line index is named after the most prominent absorption line. In this paper, the Bayesian method is used to classify stellar spectra based on Lick line index, which divides stellar spectra to three subtypes: F, G, K. First of all, Lick line index of spectra is calculated as the characteristic vector of spectra, and then Bayesian method is used to classify these spectra. For massive spectra, the computation of Lick indices and the spectral classification using Bayesian decision method are implemented on Hadoop. With use of the high throughput and good fault tolerance of HDFS, combined with the advantages of MapReduce parallel programming model, the efficiency of analysis and processing for massive spectral data have been improved significantly. The main innovative contributions of this thesis are as follows. (1) Using Lick indices as the characteristic to classify stellar spectra based on Bayesian decision method. (2) Implementing parallel computation of Lick indices and parallel classification of stellar spectra using Bayesian based on Hadoop MapReduce distributed computing framework.
|
Received: 2015-03-02
Accepted: 2015-08-15
|
|
Corresponding Authors:
PAN Jing-chang
E-mail: pjc@sdu.edu.cn
|
|
[1] Sloan Digital Sky Survey: http://www.sdss.org/. [2] Jianmin Si, et al. Science China-Physics Mechanics & Astronomy, 2014, 57(1): 176. [3] LAMOST Experiment for Galactic Understanding and Exploration(LEGUE)—The Survey’s Science Plan. Research in Astronomy and Astrophysics, 2012, 12(7): 735. [4] Cui X, et al. Research in Astronomy and Astrophysics, 2012, 12(9): 1197. [5] Bu Yude, Chen Fuqiang, Pan Jingchang. New Astronomy, 2014, 28: 35. [6] Navarro S G, Corradi R L M, Mampaso A. Astronomy & Astrophysics, 2012, 538. [7] Daniel Thomas, Claudia Maraston, Jonas Johansson. Monthly Notices of the Royal Astronomical Society, 2011, 412(4): 2183. [8] Jonas Johansson, Daniel Thomas, Claudia Maraston. Monthly Notices of the Royal Astronomical Society, 2010, 406(1): 165. [9] Franchini M, et al. Astrophysical Journal, 2011, 730(2): 117. |
[1] |
JIANG Bin, ZHAO Zi-liang, WANG Shu-ting, WEI Ji-yu, QU Mei-xia*. Decomposition and Classification of Stellar Spectra Based on t-SNE[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(09): 2913-2917. |
[2] |
XUE Ren-zheng1, CHEN Shu-xin1*, HUANG Hong-ben2. Line Index of A-Type Stellar Astronomical Spectrum Predict Effective Temperature by Ridge Regression Model[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(08): 2624-2629. |
[3] |
LIU Zhong-bao1, REN Juan-juan2, SONG Wen-ai1*, ZHANG Jing1, KONG Xiao2, FU Li-zhen1. Stellar Spectra Classification with Entropy-Based Learning Machine[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2018, 38(02): 660-664. |
[4] |
YU Jing-jing1, PANG Jing-chang1*, MENG Fan-long1, WEI Peng2. The Automatic Detection of the Continuum Problem in the Stellar Spectra Based on Distance Metric[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2017, 37(07): 2246-2249. |
[5] |
WANG Guang-pei1, PAN Jing-chang1*, YI Zhen-ping1, WEI Peng2, JIANG Bin1 . Outlier Data Mining and Analysis of LAMOST Stellar Spectra in Line Index Feature Space[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2016, 36(10): 3364-3368. |
[6] |
WANG Guang-pei1, PAN Jing-chang1*, YI Zhen-ping1, WEI Peng2, JIANG Bin1 . Research on the Clustering of Massive Stellar Spectra Based on Line Index [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2016, 36(08): 2646-2650. |
[7] |
WANG Jie, PAN Jing-chang*, TAN Xin . Application of KPLSR Based on Line Index in Stellar Atmospheric Physical Parameter Measurement[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2014, 34(03): 833-837. |
[8] |
TAN Xin1, PAN Jing-chang1*, WANG Jie1, LUO A-li2, TU Liang-ping3 . Line Index Stellar Atmospheric Physical Parameter Measurement Based on Artificial Neural Network [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2013, 33(06): 1701-1705. |
[9] |
TAN Xin1, PAN Jing-chang1*, WANG Jie1 , LUO A-li2 , TU Liang-ping3 . Stellar Spectrum Parameter Measurement Based on Line Index by Linear Regression [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2013, 33(05): 1397-1400. |
[10] |
YANG Hui-hua1, DU Ling-ling2, LI Ling-qiao2, TANG Tian-biao2, GUO Tuo2, LIANG Qiong-lin3, WANG Yi-ming3, LUO Guo-an3. Parallel PLS Aigorithm Using MapReduce and Its Aplication in Spectral Modeling [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2012, 32(09): 2399-2404. |
[11] |
ZHANG Jian-nan1, ZHAO Yong-heng1, LIU Rong2 . An Automated Stellar Spectra Classification System Based on Non-Parameter Regression and Nearest Neighbor Method[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2009, 29(12): 3424-3428. |
|
|
|
|