LAMOST Unknown Spectral Analysis Based on Influence Space and Data Field
YANG Yu-qing1, CAI Jiang-hui1, 2*, YANG Hai-feng1*, ZHAO Xu-jun1, YIN Xiao-na1
1. School of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan 030024, China
2. School of Computer Science and Technology, North University of China, Taiyuan 030051, China
Abstract:Based on the spectral data classified as Unknown by LAMOST DR5 Pipeline, the characteristics of low-quality spectra are extracted, and clustering analysis is conducted in this paper. The main work includes: (1) Feature extraction based on influence space and the data field. Firstly, a large number of small clusters are extracted from the low SNR spectrum based on influence space; secondly, each small cluster’s data field is calculated, and the spectrum is sorted using the above field; and then, access the sorted spectrum and the members in its small cluster to obtain the characteristic spectrum. (2) Carry out K-means clustering with the above characteristic spectrum and statistics on the sky area, observed visual ninety, the signal-to-noise ratio in each band, brightness, and spectrometer/fiber distribution for each class of targets. (3) Analysis of clustering results of the low SNR spectra. All low-quality spectra are divided into five categories through cluster analysis: A. The spectral SNR is low, or the spectrum is different from the traditional classification template, but its category can be determined by feature analysis (accounting for 2.7%); B. Suspected characteristic lines or molecular bands that do not match the line table appear at the blue or red end of the spectrum (accounting for 23.6%); C. The SNR at the spectrum’s blue end is very low, and the noise value in this wavelength region is strong. While in other wavelength regions, the features of continuous spectrum and line are weak (accounting for 48%); D. Due to the splicing problem, a protrusion can be seen in the local spectrum between 5 700 and 5 900 Å, and the continuum and line characteristics are poor at other wavelengths (accounting for 24.2%); E. Many default values make it impossible to determine the category of the spectrum (accounting for 1.5%). The experimental results show that this method can not only effectively extract the characteristic spectrum of low SNR spectrum, but also effectively carry out clustering analysis on the characteristic spectrum to reveal their causes, to provide a reference for the formulation of spectrum observation plan and the analysis and processing of low SNR spectrum.
杨雨晴,蔡江辉,杨海峰,赵旭俊,殷晓娜. 基于影响空间和数据场的LAMOST低质量光谱分析[J]. 光谱学与光谱分析, 2022, 42(04): 1186-1191.
YANG Yu-qing, CAI Jiang-hui, YANG Hai-feng, ZHAO Xu-jun, YIN Xiao-na. LAMOST Unknown Spectral Analysis Based on Influence Space and Data Field. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(04): 1186-1191.
[1] Luo A L, Zhang Y X, Zhao Y H. Advanced Software, Control, and Communication Systems for Astronomy, 2004, 5496: 756.
[2] Luo A L, Wu Y, Zhao J K, et al. Advanced Software and Control for Astronomy II, 2008, 7019: 701935.
[3] QU Cai-xia, YANG Hai-feng, CAI Jiang-hui, et al(屈彩霞,杨海峰,蔡江辉,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2020, 40(4): 1304.
[4] Luo A L, Zhang J N, Chen J J, et al. Setting the Scene for Gaia and LAMOST, 2014, 9(298): 428.
[5] Xie L. Chinese Astronomy and Astrophysics, 2019, 43(4) : 579.
[6] Pan J. Research in Astronomy and Astrophysics, 2020, 20(9): 146.
[7] Chakrabarti D, Maji T, Mondal C, et al. Physical Review D, 2017, 95(7): 074028.
[8] Robnik J, Seljak U. Monthly Notices of the Royal Astronomical Society, 2021, 504(4): 5829.
[9] Yang Y Q, Cai J H, Yang H F, et al. Expert Systems with Application, 2020, 139: 112846.
[10] Thébault E, Lesur V, Kauristie K, et al. Space Science Reviews, 2017, 206(1-4): 191.