Spectral Binary Star Analysis Based on Rough Set and Cluster
Voting Mechanism
WANG Qi1, YANG Hai-feng2*, CAI Jiang-hui3*
1. School of Mathematics and Information Technology, Yuncheng University, Yuncheng 044000, China
2. School of Computer Science and Technology,Taiyuan University of Science and Technology, Taiyuan 030024, China
3. School of Computer Science and Technology, North University of China, Taiyuan 030051, China
Abstract:Spectral binary star usually refers to the spectra that show double dominant component characteristics. Due to the double component's complexity and diversity, its formation is complicated. At the same time, the spectral signal-to-noise ratio is relatively low. Many of the existing analytical methods separated two-component system spectra into two spectra. Still, the separation method can't guarantee the accuracy of the spectra, and the reliability of the existing clustering methods of the single clustering is relatively low. This paper proposes a binary star spectrum analysis and evaluation method based on a rough set and cluster voting mechanism. Using the idea of multiple clustering and voting, the gradient reliability of each spectrum belongs to the corresponding category. The method consists of two parts: First, the spectral binary star data set is reconstructed by using clustering algorithms with different ideas, and each clustering algorithm label is aligned with the Hungarian algorithm as a spectral attribute to reconstruct the data set. Secondly, the voting mechanism is used to reflect the consistency of the clustering results and give the category of each spectrum. At the same time, rough sets are defined to trace the characteristics of each spectrum, and the reliability of the classification of each spectrum is given by using the up/down approximation set. LAMOST DR10 was selected to publish the spectral set of binary stars as the analysis object. Four clustering algorithms, partition-based K-means, model-based Gaussian mixture model (GMM), Spectral clustering, and Agglomerative clustering, were used to reconstruct the spectral data set. Select the lower bound of votes as 2 and obtain clustering results with reliability gradients of 1, 0.75, and 0.5 through voting. About 1/3 of the samples have a reliability of 1, indicating that the four clustering results of this batch of samples are completely consistent. The SNR of each spectrum and the number of votes arestatistically analyzed. The SNR of the samples with the low number of votes is relatively low, which is one of the reasons why they are divided into different categories by different clustering algorithms. We analyzed the physical origin of 6 spectral samples with a reliability of 1, among which binary stars, Hanoi Nebula, and target stars were the main ones. The difference in clustering labels may be caused by the difference in the flow rate of the two components or data processing such as splicing and calibration. In addition, factors may lead to pipeline misjudgment due to low spectral quality, and its sky location distribution is consistent with the research on the distribution characteristics of low-quality data.
王 琦,杨海峰,蔡江辉. 基于粗集与聚类投票机制的光谱双星特征分析[J]. 光谱学与光谱分析, 2025, 45(02): 463-468.
WANG Qi, YANG Hai-feng, CAI Jiang-hui. Spectral Binary Star Analysis Based on Rough Set and Cluster
Voting Mechanism. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(02): 463-468.
[1] LI Xin, SUN Wei, JI Li(李 鑫, 孙 玮, 纪 丽). Acta Astronomica Sinica(天文学报), 2023, 64(4): 40.
[2] CHEN Xiao, MOU Guo-bin, CHENG Zhong-qun, et al(陈 骁, 牟国斌, 成忠群, 等). Chinese Science Bulletin(科学通报), 2023, 68(31): 4170.
[3] Misra R, Roy J, Yadav J S. Journal of Astrophysics and Astronomy, 2021, 42(2): 55.
[4] Cardoso L, Gomes J, Papaderos P, et al. Astronomy & Astrophysics, 2022, 667: A11.
[5] Ramos A, Holanda N, Drake N A, et al. Monthly Notices of the Royal Astronomical Society, 2024, 527(3): 6211.
[6] Sairam L, Triaud A, Baycroft T, et al. Monthly Notices of the Royal Astronomical Society, 2024, 527(2): 2261.
[7] Yang H, Yin X, Cai J, et al. Research in Astronomy and Astrophysics, 2023, 23(5): 055006.
[8] Kovalev M, Wang S, Chen X, et al. Monthly Notices of the Royal Astronomical Society, 2023, 519(4): 5454.
[9] Abdel Rahman H, Darwish M. Scientific Reports, 2023, 13(1): 21648.
[10] Chen X, Luo A, Chen J, et al. Astronomy and Astrophysics, 2023, 671: A92.
[11] Yang H, Shi C, Cai J, et al. Monthly Notices of the Royal Astronomical Society, 2022, 517(4): 5496.
[12] Yang H, Zhou L, Cai J, et al. Monthly Notices of the Royal Astronomical Society, 2023, 518(4): 5904.
[13] YANG Yu-qing, CAI Jiang-hui, YANG Hai-feng(杨雨晴, 蔡江辉, 杨海峰). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2022, 42(4): 1186.
[14] Cai J, Hao J, Yang H, et al. A Review on Semi-Supervised Clustering, 2023, 632: 164.