|
|
|
|
|
|
Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm |
BAO Hao1, 2,ZHANG Yan1, 2* |
1. College of Big Data and Information Engineering, Guizhou University, Guiyang 550025, China
2. Engineering Research Centre for Non-Destructive Testing of Agricultural Products, Guiyang University, Guiyang 550005, China
|
|
|
Abstract As one of the primary steps in NIR spectral analysis, effective feature band selection can improve modelling efficiency and model performance. Traditional Characteristic band selection algorithms suffer from long run times and redundant feature selection, making achieving the desired results in practical engineering applications difficult. The Harris Hawk Optimisation (HHO) algorithm has the advantages of simple principles and few parameters, but it also has the shortcomings of low convergence accuracy and easy to fall into local optimum. In this paper, we propose an NIR spectral feature band selection model based on the Improved Harris Hawk Optimisation (IHHO) algorithm based on the HHO algorithm. For the HHO algorithm can only be used to solve optimization problems in continuous space, a discretization strategy is used to modify the HHO algorithm so that it can solve the discrete form of the characteristic waveform selection problem. Considering the poor quality of the initial population of the HHO algorithm, the quality of the initial population is improved using chaotic mapping and opposition-based learning to enhance the global exploration capability of the algorithm; Due to the low convergence accuracy of the HHO algorithm in local search, a new prey energy decay model and jumping strategy are proposed further to enhance the algorithm's search capability in local search. The HHO algorithm is perturbed by borrowing the variational approach of genetic algorithm. Support vector machine (SVM) identification models and partial least squares regression (PLSR) models were developed using competitive adaptive reweighted sampling (CARS), successive projections algorithm (SPA), particle swarm optimization (PSO) algorithms, genetic algorithms (GA), HHO algorithms compared to IHHO algorithms, and four qualitative analysis NIR spectral datasets and two quantitative analysis NIR spectral datasets, respectively. In the qualitative analysis experiments, the average accuracy obtained by the IHHO algorithm improved by 0.83%, 9.55%, 17.65%, and 0%, respectively, concerning the full band, and the average number of characteristic bands was only 9.97%, 2.59%, 1.36%, and 0.59% of the full band. In the quantitative analysis experiments, the average coefficient of determination obtained by the IHHO algorithm was 10.57%, 1.47%, 4.41%, 3.66% and 3.06% higher than the full band, and the average root mean square error was 0.162, 1.266 3, 1.868, 1.869 4 and 0.408 4 lower than the full band, and the average number of characteristic bands was only 9.24%, 10.53% and 0% of the full band. The average number of characteristic bands was only 9.24%, 10.53%, 6.54%, 6.91% and 7.14% of the full band. The experimental results show that the IHHO algorithm can remove redundancy in the selection of feature bands and target the most important ones, and its performance is better than several other selection algorithms. Therefore, the IHHO algorithm has good application prospects.
|
Received: 2022-06-02
Accepted: 2022-11-21
|
|
Corresponding Authors:
ZHANG Yan
E-mail: Eileen_zy001@sohu.com
|
|
[1] CHU Xiao-li, SHI Yun-ying, CHEN Pu, et al(褚小立,史云颖,陈 瀑,等). Journal of Analytical Testing(分析测试学报), 2019, 38(5): 603.
[2] LI Ying, MA Yu-chen, LIU Meng, et al(李 颖,马雨辰,刘 萌,等). Journal of Food Safety & Quality(食品安全质量检测学报), 2022, 13(12): 3923.
[3] Peng Guo, Ting Li, Han Gao, et al. Remote Sensing, 2021, 13(19): 4000.
[4] YU Hui-chun, FU Xiao-ya, YIN Yong, et al(于慧春,付晓雅,殷 勇,等). Journal of Nuclear Agricultural Sciences(核农学报), 2020, 34(3): 582.
[5] Yu X, Ye X, Zhang S. Digital Signal Processing, 2022, 123: 103442.
[6] Yu X, Tian X. International Journal of Pressure Vessels and Piping, 2022, 196: 104611.
[7] Ferahtia S, Rezk H, Abdelkareem M A, et al. Applied Energy, 2022, 306: 118069.
[8] WANG Wen-xia, MA Ben-xue, LUO Xiu-zhi, et al(王文霞,马本学,罗秀芝,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2020, 40(2): 543.
[9] TAO Huan-ming, GAO Mei-feng(陶焕明,高美凤). Journal of Instrumental Analysis(分析测试学报), 2021, 40(10): 1482.
[10] WU Xin-yan, BIAN Xi-hui, YANG Sheng, et al(武新燕,卞希慧,杨 盛,等). Journal of Instrumental Analysis(分析测试学报), 2020, 39(10): 1288.
[11] Ali Asghar Heidari, Seyedali Mirjalili, Hossam Faris, et al. Future Generation Computer Systems, 2019, 97: 849.
[12] XIE Yuan, GAO Wei, WANG Yi-wei, et al(谢 渊,高 玮,汪义伟,等). China Civil Engineering Journal(土木工程学报), 2022, 55(4): 33.
[13] Zenab Mohamed Elgamal, Norizan Binti Mohd Yasin, Mohammad Tubishat, et al. IEEE Access, 2020, 8: 186638.
[14] Bao Xiaoli, Jia Heming, Lang Chunbo. IEEE Access, 2019, 7: 76529.
[15] TENG Zhi-jun, LÜ Jin-ling, GUO Li-wen, et al(滕志军,吕金玲,郭力文,等). Journal of Harbin Institute of Technology(哈尔滨工业大学学报), 2018, 50(11): 40. |
[1] |
CHENG Hui-zhu1, 2, YANG Wan-qi1, 2, LI Fu-sheng1, 2*, MA Qian1, 2, ZHAO Yan-chun1, 2. Genetic Algorithm Optimized BP Neural Network for Quantitative
Analysis of Soil Heavy Metals in XRF[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3742-3746. |
[2] |
SHEN Si-cong, ZHANG Jing-xue, CHEN Ming-hui, LI Zhi-wei, SUN Sheng-nan, YAN Xue-bing*. Estimation of Above-Ground Biomass and Chlorophyll Content of
Different Alfalfa Varieties Based on UAV Multi-Spectrum[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3847-3852. |
[3] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[4] |
HUANG Zhao-di1, CHEN Zai-liang2, WANG Chen3, TIAN Peng2, ZHANG Hai-liang2, XIE Chao-yong2*, LIU Xue-mei4*. Comparing Different Multivariate Calibration Methods Analyses for Measurement of Soil Properties Using Visible and Short Wave-Near
Infrared Spectroscopy Combined With Machine Learning Algorithms[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3535-3540. |
[5] |
LI Wen-wen1, 2, LONG Chang-jiang1, 2, 4*, LI Shan-jun1, 2, 3, 4, CHEN Hong1, 2, 4. Detection of Mixed Pesticide Residues of Prochloraz and Imazalil in
Citrus Epidermis by Surface Enhanced Raman Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3052-3058. |
[6] |
CHEN Jia-wei1, 2, ZHOU De-qiang1, 2*, CUI Chen-hao3, REN Zhi-jun1, ZUO Wen-juan1. Prediction Model of Farinograph Characteristics of Wheat Flour Based on Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3089-3097. |
[7] |
LIU Fei1, TAN Jia-jin1*, XIE Gu-ai2, SU Jun3, YE Jian-ren1. Early Diagnosis of Pine Wilt Disease Based on Hyperspectral Data and Needle Resistivity[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3280-3285. |
[8] |
MA Qian1, 2, YANG Wan-qi1, 2, LI Fu-sheng1, 2*, CHENG Hui-zhu1, 2, ZHAO Yan-chun1, 2. Research on Classification of Heavy Metal Pb in Honeysuckle Based on XRF and Transfer Learning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2729-2733. |
[9] |
WU Yong-qing1, 2, TANG Na1, HUANG Lu-yao1, CUI Yu-tong1, ZHANG Bo1, GUO Bo-li1, ZHANG Ying-quan1*. Model Construction for Detecting Water Absorption in Wheat Flour Using Vis-NIR Spectroscopy and Combined With Multivariate Statistical #br#
Analyses[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2825-2831. |
[10] |
LÜ Shi-lei1, 2, 3, WANG Hong-wei1, LI Zhen1, 2, 3*, ZHOU Xu1, ZHAO Jing1. Hyperspectral Identification Model of Cantonese Tangerine Peel Based on BWO-SVM Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2894-2901. |
[11] |
WANG Jun-jie1, YUAN Xi-ping2, 3, GAN Shu1, 2*, HU Lin1, ZHAO Hai-long1. Hyperspectral Identification Method of Typical Sedimentary Rocks in Lufeng Dinosaur Valley[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2855-2861. |
[12] |
LIU Rui-min, YIN Yong*, YU Hui-chun, YUAN Yun-xia. Extraction of 3D Fluorescence Feature Information Based on Multivariate Statistical Analysis Coupled With Wavelet Packet Energy for Monitoring Quality Change of Cucumber During Storage[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2967-2973. |
[13] |
ZHANG Hai-liang1, XIE Chao-yong1, TIAN Peng1, ZHAN Bai-shao1, CHEN Zai-liang1, LUO Wei1*, LIU Xue-mei2*. Measurement of Soil Organic Matter and Total Nitrogen Based on Visible/Near Infrared Spectroscopy and Data-Driven Machine Learning Method[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2226-2231. |
[14] |
LI Hao-dong1, 2, LI Ju-zi1*, CHEN Yan-lin1, HUANG Yu-jing1, Andy Hsitien Shen1*. Establishing Support Vector Machine SVM Recognition Model to Identify Jadeite Origin[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2252-2257. |
[15] |
LI Bin, HAN Zhao-yang, WANG Qiu, SUN Zhao-xiang, LIU Yan-de*. Research on Bruise Level Detection of Loquat Based on Hyperspectral
Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(06): 1792-1799. |
|
|
|
|