Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm
BAO Hao1, 2,ZHANG Yan1, 2*
1. College of Big Data and Information Engineering, Guizhou University, Guiyang 550025, China
2. Engineering Research Centre for Non-Destructive Testing of Agricultural Products, Guiyang University, Guiyang 550005, China
Abstract:As one of the primary steps in NIR spectral analysis, effective feature band selection can improve modelling efficiency and model performance. Traditional Characteristic band selection algorithms suffer from long run times and redundant feature selection, making achieving the desired results in practical engineering applications difficult. The Harris Hawk Optimisation (HHO) algorithm has the advantages of simple principles and few parameters, but it also has the shortcomings of low convergence accuracy and easy to fall into local optimum. In this paper, we propose an NIR spectral feature band selection model based on the Improved Harris Hawk Optimisation (IHHO) algorithm based on the HHO algorithm. For the HHO algorithm can only be used to solve optimization problems in continuous space, a discretization strategy is used to modify the HHO algorithm so that it can solve the discrete form of the characteristic waveform selection problem. Considering the poor quality of the initial population of the HHO algorithm, the quality of the initial population is improved using chaotic mapping and opposition-based learning to enhance the global exploration capability of the algorithm; Due to the low convergence accuracy of the HHO algorithm in local search, a new prey energy decay model and jumping strategy are proposed further to enhance the algorithm's search capability in local search. The HHO algorithm is perturbed by borrowing the variational approach of genetic algorithm. Support vector machine (SVM) identification models and partial least squares regression (PLSR) models were developed using competitive adaptive reweighted sampling (CARS), successive projections algorithm (SPA), particle swarm optimization (PSO) algorithms, genetic algorithms (GA), HHO algorithms compared to IHHO algorithms, and four qualitative analysis NIR spectral datasets and two quantitative analysis NIR spectral datasets, respectively. In the qualitative analysis experiments, the average accuracy obtained by the IHHO algorithm improved by 0.83%, 9.55%, 17.65%, and 0%, respectively, concerning the full band, and the average number of characteristic bands was only 9.97%, 2.59%, 1.36%, and 0.59% of the full band. In the quantitative analysis experiments, the average coefficient of determination obtained by the IHHO algorithm was 10.57%, 1.47%, 4.41%, 3.66% and 3.06% higher than the full band, and the average root mean square error was 0.162, 1.266 3, 1.868, 1.869 4 and 0.408 4 lower than the full band, and the average number of characteristic bands was only 9.24%, 10.53% and 0% of the full band. The average number of characteristic bands was only 9.24%, 10.53%, 6.54%, 6.91% and 7.14% of the full band. The experimental results show that the IHHO algorithm can remove redundancy in the selection of feature bands and target the most important ones, and its performance is better than several other selection algorithms. Therefore, the IHHO algorithm has good application prospects.
Key words:Near infrared spectroscopy; Feature band selection; Harris hawk optimization algorithm; Support vector machine; Partial least squares regression
鲍 浩,张 艳. 基于改进哈里斯鹰优化算法的光谱特征波段选择模型研究[J]. 光谱学与光谱分析, 2024, 44(01): 148-157.
BAO Hao,ZHANG Yan. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157.
[1] CHU Xiao-li, SHI Yun-ying, CHEN Pu, et al(褚小立,史云颖,陈 瀑,等). Journal of Analytical Testing(分析测试学报), 2019, 38(5): 603.
[2] LI Ying, MA Yu-chen, LIU Meng, et al(李 颖,马雨辰,刘 萌,等). Journal of Food Safety & Quality(食品安全质量检测学报), 2022, 13(12): 3923.
[3] Peng Guo, Ting Li, Han Gao, et al. Remote Sensing, 2021, 13(19): 4000.
[4] YU Hui-chun, FU Xiao-ya, YIN Yong, et al(于慧春,付晓雅,殷 勇,等). Journal of Nuclear Agricultural Sciences(核农学报), 2020, 34(3): 582.
[5] Yu X, Ye X, Zhang S. Digital Signal Processing, 2022, 123: 103442.
[6] Yu X, Tian X. International Journal of Pressure Vessels and Piping, 2022, 196: 104611.
[7] Ferahtia S, Rezk H, Abdelkareem M A, et al. Applied Energy, 2022, 306: 118069.
[8] WANG Wen-xia, MA Ben-xue, LUO Xiu-zhi, et al(王文霞,马本学,罗秀芝,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2020, 40(2): 543.
[9] TAO Huan-ming, GAO Mei-feng(陶焕明,高美凤). Journal of Instrumental Analysis(分析测试学报), 2021, 40(10): 1482.
[10] WU Xin-yan, BIAN Xi-hui, YANG Sheng, et al(武新燕,卞希慧,杨 盛,等). Journal of Instrumental Analysis(分析测试学报), 2020, 39(10): 1288.
[11] Ali Asghar Heidari, Seyedali Mirjalili, Hossam Faris, et al. Future Generation Computer Systems, 2019, 97: 849.
[12] XIE Yuan, GAO Wei, WANG Yi-wei, et al(谢 渊,高 玮,汪义伟,等). China Civil Engineering Journal(土木工程学报), 2022, 55(4): 33.
[13] Zenab Mohamed Elgamal, Norizan Binti Mohd Yasin, Mohammad Tubishat, et al. IEEE Access, 2020, 8: 186638.
[14] Bao Xiaoli, Jia Heming, Lang Chunbo. IEEE Access, 2019, 7: 76529.
[15] TENG Zhi-jun, LÜ Jin-ling, GUO Li-wen, et al(滕志军,吕金玲,郭力文,等). Journal of Harbin Institute of Technology(哈尔滨工业大学学报), 2018, 50(11): 40.