The “Unknown” Spectral Classification Study of LAMOST:
ODS-YOLOv7 Model
WANG Xiao-min1, GAO Jun-ping1*, PU Yuan2*, QIU Bo1*, ZHANG Jian-nan3, YAN Jing1, LI Rong1
1. Hebei University of Technology, Tianjin 300400,China
2. Guangdong Baiyun University, Guangzhou 510450,China
3. National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100012, China
Abstract:Identifying celestial spectra is essential for making new astronomical discoveries and conducting detailed studies of celestial objects. The LAMOST DR8 v1.0 release of low-resolution spectral data contains approximately 530 000 spectra named “Unknown”. The reason is that they have no category labels. And 88.56% of these spectra have signal-to-noise ratios between 0 and 10. Therefore, the effective output of LAMOST will increase if we analyze these spectra. In this paper, we propose an ODS-YOLOv7 model to deal with the problem of the “Unknown” spectral classification. It is an end-to-end category prediction model and is suitable for one-dimensional spectra. We also add a one-dimensional convolutional attention module to improve the accuracy of spectra recognition. After training on a set of known category spectra with signal-to-noise ratios between 0 and 10, the ODS-YOLOv7 model can learn the effective features of the low signal-to-noise spectra. Thus, it can enable us to predict “Unknown” spectra. Experiments show that the model has an F1-score of 0.98, 0.95, and 0.95 for the spectral identification of low signal-to-noise stars, galaxies, and quasars spectra with known labels. In the meantime, ODS-YOLOv7 obtains the best results in comparison experiments with traditional algorithms KNN, RF, DT, SVM, and deep learning algorithms 1D CNN, 1DSSCNN, ResNet, DenseNet, and VIT. The experimental results also give confidence in the predictions of the ODS-YOLOv7 model for the “Unknown” spectra in DR8 v1.0, with 92% of the confidence levels above 60%. To ensure the quality of the model output, only spectral categories with a prediction confidence level greater than 99% are selected as output in this paper. Ultimately, 37.19% and 47.03% of the “Unknown” spectra released in DR8 v1.0 and DR9 v0, respectively, are predicted by this model.In addition, the paper tests the accuracy of the model's predictions using manual authentication. To improve the interpretability of the model,the paper takes the Grad-CAM method for two-dimensional image visualisation. It improves it into an algorithm suitable for visualising one-dimensional spectral data to predict output features. Experiments show that the model focuses on different features in the visualisation of different classes of astronomical features and that the model is good at predicting low signal-to-noise “unknown” spectral classes.
王晓敏,高军萍,蒲 源,邱 波,张健楠,闫 静,李 荣. LAMOST的“Unknown”光谱分类研究:ODS-YOLOv7模型[J]. 光谱学与光谱分析, 2024, 44(07): 1960-1967.
WANG Xiao-min, GAO Jun-ping, PU Yuan, QIU Bo, ZHANG Jian-nan, YAN Jing, LI Rong. The “Unknown” Spectral Classification Study of LAMOST:
ODS-YOLOv7 Model. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1960-1967.
[1] Corral L J, Fierro-Santillán C R. arXiv preprint arXiv: 2105. 07110, 2021.
[2] Wang L L, Shen S Y, Luo A L, et al. The Astrophysical Journal Supplement Series, 2022, 258(1): 9.
[3] Weaver W B, Torres-Dodgen A V. The Astrophysical Journal, 1995, 446: 300.
[4] Sharma K, Kembhavi A, Kembhavi A, et al. Monthly Notices of the Royal Astronomical Society, 2020, 491(2): 2280.
[5] Wen Xiaoqing, Yang Jinmeng. Chinese Journal of Physics, 2021, 69: 303.
[6] HE Dong-yuan, LIU Wei, CAO Shuo, et al(何东远, 刘 伟, 曹 硕, 等). Journal of Beijing Normal University(北京师范大学学报), 2020, 56(1): 37.
[7] WANG Qi-xun, ZHAO Gang, FAN Zhou(王奇勋, 赵 刚, 范 舟). Astronomical Research and Technology(天文研究与技术), 2020, 17(1): 85.
[8] HONG Shu-xin, ZOU Zhi-qiang, XU Ling-zhe(洪舒欣, 邹志强, 徐灵哲). Acta Astronomica Sinica(天文学报), 2021, 62(5): 48.
[9] Zheng Z P, Qiu B, Luo A L, et al. Publications of the Astronomical Society of the Pacific, 2020, 132(1008): 024504.
[10] Guo Y X, Luo A L, Zhang S, et al. Monthly Notices of the Royal Astronomical Society, 2019, 485(2): 2167.
[11] YANG Yu-qing, CAI Jiang-hui, YANG Hai-feng, et al(杨雨晴, 蔡江辉, 杨海峰, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2022, 42(4): 1186.
[12] Li X R, Lin Y T, Qiu K B. Research in Astronomy and Astrophysics, 2019, 19(8): 111.
[13] He K, Zhang X, Ren S, et al. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 770.
[14] Dosovitskiy A, Beyer L, Kolesnikov A, et al. arXiv preprint arXiv: 2010. 11929, 2020.
[15] Woo S, Park J, Lee J Y, et al. Proceedings of the European Conference on Computer Vision (ECCV). 2018: 3.
[16] Selvaraju R R, Cogswell M, Das A, et al. Proceedings of the IEEE International Conference on Computer Vision,2017: 618.