|
|
|
|
|
|
LAMOST Unknown Spectral Analysis Based on Influence Space and Data Field |
YANG Yu-qing1, CAI Jiang-hui1, 2*, YANG Hai-feng1*, ZHAO Xu-jun1, YIN Xiao-na1 |
1. School of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan 030024, China
2. School of Computer Science and Technology, North University of China, Taiyuan 030051, China
|
|
|
Abstract Based on the spectral data classified as Unknown by LAMOST DR5 Pipeline, the characteristics of low-quality spectra are extracted, and clustering analysis is conducted in this paper. The main work includes: (1) Feature extraction based on influence space and the data field. Firstly, a large number of small clusters are extracted from the low SNR spectrum based on influence space; secondly, each small cluster’s data field is calculated, and the spectrum is sorted using the above field; and then, access the sorted spectrum and the members in its small cluster to obtain the characteristic spectrum. (2) Carry out K-means clustering with the above characteristic spectrum and statistics on the sky area, observed visual ninety, the signal-to-noise ratio in each band, brightness, and spectrometer/fiber distribution for each class of targets. (3) Analysis of clustering results of the low SNR spectra. All low-quality spectra are divided into five categories through cluster analysis: A. The spectral SNR is low, or the spectrum is different from the traditional classification template, but its category can be determined by feature analysis (accounting for 2.7%); B. Suspected characteristic lines or molecular bands that do not match the line table appear at the blue or red end of the spectrum (accounting for 23.6%); C. The SNR at the spectrum’s blue end is very low, and the noise value in this wavelength region is strong. While in other wavelength regions, the features of continuous spectrum and line are weak (accounting for 48%); D. Due to the splicing problem, a protrusion can be seen in the local spectrum between 5 700 and 5 900 Å, and the continuum and line characteristics are poor at other wavelengths (accounting for 24.2%); E. Many default values make it impossible to determine the category of the spectrum (accounting for 1.5%). The experimental results show that this method can not only effectively extract the characteristic spectrum of low SNR spectrum, but also effectively carry out clustering analysis on the characteristic spectrum to reveal their causes, to provide a reference for the formulation of spectrum observation plan and the analysis and processing of low SNR spectrum.
|
Received: 2021-07-23
Accepted: 2022-01-19
|
|
Corresponding Authors:
CAI Jiang-hui, YANG Hai-feng
E-mail: Jianghui@tyust.edu.cn; hfyang@tyust.edu.cn
|
|
[1] Luo A L, Zhang Y X, Zhao Y H. Advanced Software, Control, and Communication Systems for Astronomy, 2004, 5496: 756.
[2] Luo A L, Wu Y, Zhao J K, et al. Advanced Software and Control for Astronomy II, 2008, 7019: 701935.
[3] QU Cai-xia, YANG Hai-feng, CAI Jiang-hui, et al(屈彩霞,杨海峰,蔡江辉,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2020, 40(4): 1304.
[4] Luo A L, Zhang J N, Chen J J, et al. Setting the Scene for Gaia and LAMOST, 2014, 9(298): 428.
[5] Xie L. Chinese Astronomy and Astrophysics, 2019, 43(4) : 579.
[6] Pan J. Research in Astronomy and Astrophysics, 2020, 20(9): 146.
[7] Chakrabarti D, Maji T, Mondal C, et al. Physical Review D, 2017, 95(7): 074028.
[8] Robnik J, Seljak U. Monthly Notices of the Royal Astronomical Society, 2021, 504(4): 5829.
[9] Yang Y Q, Cai J H, Yang H F, et al. Expert Systems with Application, 2020, 139: 112846.
[10] Thébault E, Lesur V, Kauristie K, et al. Space Science Reviews, 2017, 206(1-4): 191.
|
[1] |
MIAO Shu-guang1, SHAO Dan1*, LIU Zhong-yu2, 3, FAN Qiang1, LI Su-wen1, DING En-jie2, 3. Study on Coal-Rock Identification Method Based on Terahertz
Time-Domain Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1755-1760. |
[2] |
WANG Ling-ling1, 2, 3, WANG Bo1, 2, 3, XIONG Feng1, 2, 3, YANG Lu-cun1, 2, LI Jing-jing4, XIAO Yuan-ming1, 2, 3, ZHOU Guo-ying1, 2*. A Comparative Study of Inorganic Elements in Cultivativing Astragalus Membranaceus From Different Habitats[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(05): 1407-1412. |
[3] |
ZHANG Wei-fang1, 2, FAN Ke-feng3, LEI Jing-wei1, 2*, JI Liang1, 2. Infrared Fingerprint and Multivariate Statistical Analysis of Rehmannia Glutinosa[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(11): 3392-3398. |
[4] |
LIU Ting-yue1, DAI Jing-jing2*, TIAN Shu-fang1. A Neural Network Recognition Method for Garnets Subclass Based on Hyper Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(06): 1758-1763. |
[5] |
LI Meng-li, YIN Yong*, YUAN Yun-xia, LI Xin, LIU Xue-ru. Feature Selection of 3D Fluorescence Data Based on Storage Room Gas and Preliminary Early Warning of Banana Spoilage[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41(02): 558-564. |
[6] |
YU Ying-tao1, WANG Ji-feng1, SUN Yu-ye1, LI Fu-juan2, WAN Chao3. Identification of Adulterated Olive Oil by Two-Dimensional Raman Correlation Spectroscopy With Cooming as a Perturbation Factor[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(12): 3727-3731. |
[7] |
JIANG Bin, QU Mei-xia, LI Qing-wei, CAO Shu-hua, ZHONG Yun-peng*. Study on Spectra Decomposition of White Dwarf Main-Sequence Binary Stars Based on Generation Antagonistic Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(10): 3298-3302. |
[8] |
ZHAO Shang-yong1, ZHOU Zhi-ming1, SONG Chao2*, SUN Chang-kai1, LEI Jun-jie3, GAO Xun1*. Classification Analysis and Heavy Metal Detection of Panax Ginseng Sample by Using LIBS Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(08): 2629-2633. |
[9] |
DING Lu1, LI Meng-ting2, LIU Yang1, ZHU Wen-bi1, LIU Dong-mei3, MOU Mei-rui1, LIU Hai-xue1*. Rapid Screening of Maize F1 Hybrids Based on Near-Infrared Spectrum and Two-Dimensional Correlation Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(07): 2235-2239. |
[10] |
CHEN Yao1, 2, HUANG Chang-ping1*, ZHANG Li-fu1, QIAO Na1, 2. Spectral Characteristics Analysis and Remote Sensing Retrieval of COD Concentration[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(03): 824-830. |
[11] |
HAN Xiu1, 2, SONG Yong-hui1, 2*, ZHANG Guang-cai2, YAN Zong-cheng2, JIN Fang-yuan2, YU Hui-bin2. Application of Solid Surface EEM Fluorescence Spectroscopy for Analyzing Organic Matter Structural Composition of Lake Sediment[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(02): 483-488. |
[12] |
HUANG Min, GUO Chun-li, HE Rui-li, XI Yong-hui. Classification of Color Matching Functions with the Method of Cluster Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(02): 454-460. |
[13] |
BAI Xue-bing, YU Jian-shu, FU Ze-tian, ZHANG Ling-xian, LI Xin-xing*. Application of Spectral Imaging Technology for Detecting Crop Disease Information: A Review[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(02): 350-355. |
[14] |
FU Hai-jun1, 2, ZHOU Shu-bin1, 3, WU Xiao-hong1, 2*, WU Bin4, SUN Jun1, 2, DAI Chun-xia1, 5. Mixed Fuzzy Maximum Entropy Clustering Analysis of FT-NIR Spectra of Tea[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(11): 3465-3469. |
[15] |
LIU Jun1, 2, FAN Wen-quan3, HU Yong-qing3, LIU Song1, 2, LI Qing-hui1, 2*. Scientific Analysis of Materials and Technology of Jades Unearthed from Cemeteries Dated to Eastern Zhou Dynasty in Xiyasi, Xinzheng[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39(11): 3637-3645. |
|
|
|
|