Research of Clustering for LAMOST Early M Type Spectra
LIU Jie1, PANG Jing-chang1*, WU Ming-lei1, 3, LIU Cong1, WEI Peng2, YI Zhen-ping1, LIU Meng1
1. School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, Weihai 264209, China
2. Key Laboratory of Optical Astronomy, National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100012, China
3. Harbin University of Science and Technology Rongcheng Campus, Weihai 264209, China
Abstract:Large-scale spectral survey projects such as LAMOST produce a great deal of valuable research data, and how to effectively analyze the data of this magnitude is a current research hotspot. Clustering algorithm is a kind of unsupervised machine learning algorithm, which makes the clustering algorithm deal with the data without knowledge of the domain, and internal law and structure will be found out. Stellar spectral clustering is a very important work in astronomical data processing. It mainly classifies the mass spectral survey data according to its physical and chemical properties. In this paper, we use a variety of clustering algorithms such as K-Means, Bisecting K-Means and OPTICS to do clustering analysis for the early M-type stellar data in LAMOST survey. The performance of these algorithms on the early M-type stellar data is also discussed. In this paper, the performance of the Euclidean distance, the Manhattan distance, the residual distribution distance for the three clustering algorithms are studied, and the clustering algorithm depends on the distance measurement algorithm. The experimental results show that: (1) The clustering algorithm can well analyze the spectral data of the early M-type dwarf star, and the cluster data produced by clustering is very good with the MK classification. (2) The performance of the three different clustering algorithms is different, and Bisecting K-Means has more advantages in stellar spectral subdivision. (3) In the cluster at the same time it will produce some small number of clusters, and some rare celestial bodies can be found from these clusters. OPTICS is relatively suitable for finding rare objects.
刘 杰,潘景昌,吴明磊,刘 聪,韦 鹏,衣振萍,刘 猛. 早M型矮恒星光谱聚类方法与分析[J]. 光谱学与光谱分析, 2017, 37(12): 3904-3907.
LIU Jie, PANG Jing-chang, WU Ming-lei, LIU Cong, WEI Peng, YI Zhen-ping, LIU Meng. Research of Clustering for LAMOST Early M Type Spectra. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2017, 37(12): 3904-3907.
[1] Luo A L, Zhang H T, Zhao Y H, et al. Research in Astronomy and Astrophysics, 2012, 12(9): 1243.
[2] Luo A L, Zhao Y H, Zhao G, et al. Research in Astronomy and Astrophysics, 2015, 15(8): 1095.
[3] Hartigan J A, Wong M A. Journal of the Royal Statistical Society, Series(Applied Statistics), 1979,28(1): 100.
[4] Kanungo T, Mount D M, Netanyahu N S, et al. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 2002, 24(7): 881.
[5] Ankerst M, Breunig M M, Kriegel H P, et al. OPTICS: Ordering Points to Identify the Clustering Structure. ACM Sigmod Record. ACM, 1999, 28(2): 49.
[6] Almeida J S, Prieto C A. The Astrophysical Journal, 2013, 763(1): 50.
[7] LIU Jie, PAN Jing-chang, LUO A-li, et al(刘 杰, 潘景昌, 罗阿理, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2015, 35(12): 3524.
[8] Kong X, Cheng F Z. Monthly Notices of the Royal Astronomical Society, 2001, 323(4): 1035.
[9] Wei P, Luo A, Li Y, et al. The Astronomical Journal, 2014, 147(5): 101.