Distinguishing the Rare Spectra with the Unbalanced Classification Method Based on Mutual Information
LIU Zhong-bao1*, REN Juan-juan2, KONG Xiao3
1. School of Computer and Control Engineering, North University of China, Taiyuan 030051, China 2. Key Laboratory of Optical Astronomy, NAOC, Chinese Academy of Sciences, Beijing 100012, China 3. National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100012, China
Abstract:Distinguishing the rare spectra from the majority of stellar spectra is one of quite important issues in astronomy. As the size of the rare spectra is much smaller than the majority of the spectra, many traditional classifiers can’t work effectively because they only focus on the classification accuracy and have not paid enough attentions on the rare spectra. In view of this, the relationship between the decision tree and mutual information is discussed on the basis of summarizing the traditional classifiers, and the cost-free decision tree based on mutual information is proposed in this paper to improve the performance of distinguishing the rare spectra. In the experiment, we investigate the performance of the proposed method on the K-type, F-type, G-type, M-type datasets from Sloan Digital Sky Survey (SDSS), Data Release 8. It can be concluded that the proposed method can complete the rare spectra distinguishing task compared with several traditional classifiers.
Key words:Unbalanced classification;Mutual information;Rare spectra;Decision tree
刘忠宝1*,任娟娟2,孔 啸3 . 利用基于互信息的不平衡分类方法识别稀有光谱 [J]. 光谱学与光谱分析, 2016, 36(11): 3746-3751.
LIU Zhong-bao1*, REN Juan-juan2, KONG Xiao3 . Distinguishing the Rare Spectra with the Unbalanced Classification Method Based on Mutual Information. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2016, 36(11): 3746-3751.
[1] Stark P B, Herron M M, Matteson A. Applied Spectroscopy, 1993, 47(11):1820. [2] Gulati R K, Gupta R, Gothoskar P. AJ, 1994, 426(1):340. [3] Bailer-Jones C A L, Irwin M, Hippel T. MNRAS, 1998, 298:361. [4] Hernandez R D, Barreto H P, Robles L A, et al. Experimental Astronomy, 2014, 38:193. [5] Wei P, Luo A L, Li Y B, et al. MNRAS, 2013, 431:1800. [6] Cai J H, Zhao X J, Sun S W, et al. RAA, 2013, 13(3):334. [7] Gray R O, Corbally C J. AJ, 2014, 147(80):1. [8] Bu Y D, Pan J C, Jiang B, et al. PASJ, 2013, 65:81. [9] Bu Y D, Chen F Q, Pan J C. New Astronomy, 2014, 28:35. [10] Bazarghan M. Astrophysics and Space Science, 2012, 337:93. [11] Dan D M, Hu Z Y, Zhao Y H. Spectroscopy and Specctral Analysis, 2003, 23(1):182. [12] Yang J F, Wu C F, Luo A L, et al. Pattern Recognition and Artificial Intelligence, 2006, 19(3):368. [13] Sun S W, Luo A L, Zhang J F. Astronomical Research and Technology, 2007, 4(3):276. [14] Liu R, Jin H M, Duan F Q. Spectroscopy and Specctral Analysis, 2010, 30(3):838. [15] Liu Z B, Pan G Z, Zhao W J. Journal of Electronics & Information Technology, 2013, 35(9):2047. [16] Liu Z B, Wang Z B, Zhao W J. Spectroscopy and Spectral Analysis, 2014, 34(1):263. [17] Liu Z B, Gao Y Y, Wang J Z. Spectroscopy and Spectral Analysis, 2015, 35(1):263. [18] Almeida J S, Prieto C A. AJ, 2013, 763: 1.