|
|
|
|
|
|
Support Vector Machine Optimized by Near-Infrared Spectroscopic
Technique Combined With Grey Wolf Optimizer Algorithm to
Realize Rapid Identification of Tobacco Origin |
GENG Ying-rui1, SHEN Huan-chao1, NI Hong-fei2, CHEN Yong1, LIU Xue-song1* |
1. College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310030, China
2. Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Hangzhou 310018, China
|
|
|
Abstract Tobacco is a natural plant with complex compositions, the quality of tobacco leaves is directly affected by several external factors such as geographic location and growth conditions. Tobacco leaves are widely planted in China, and they cultivated in different areas, they have different styles. Different blended ratios play a decisive role in the quality of cigarettes. Thus, there is an emerging need for accurate and rapid identification of the origin of tobacco leaves. Near-infrared spectroscopy technology provides a new rapid, and convenient method to automatically evaluate tobacco areas. On this basis, we proposed the grey wolf optimizer (GWO) algorithm to optimize the performance of the support vector machine model (SVM) for the first time to identify and classify tobacco leaves from different origins. This study was conducted with 824 tobacco leaf samples from eight different origins, and 617 training set samples and 207 test set samples were obtained using Set partitioning based on joint x-y distance (SPXY). The wavelength selection methods such as Competitive adaptive reweighted sampling (CARS) and Random frog (RF) algorithms were applied to reduce spectral redundant information and screen the characteristic wavelengths in the -full spectrum of the samples, and 141 and 534 were selected from all 1 609 variables, respectively. Then they were used as the input parameters of the SVM classifier. The optimization effect of GWO on the SVM model was contrasted to the Particle swarm optimization (PSO) and Genetic algorithm (GA) optimization in the same search range. The analysis showed that the spectral variables screened by RF had a better modeling performance than CARS. Among them, the RF-GWO-SVM model achieved the best predictive performance with an accuracy of 96.62% in identifying tobacco leaves from 8 producing areas. More than that, the running time of RF-GWO-SVM was 156 and 131 min shorter than RF-PSO-SVM and RF-GA-SVM, respectively. To sum up, RF-GWO-SVM has the advantages of higher accuracy and faster convergence speed. It can be seen that GWO has a more efficient optimization capability for model parameters, and the support vector machine model optimized by GWO can be used for rapid identification of tobacco origin.
|
Received: 2021-08-17
Accepted: 2021-11-16
|
|
Corresponding Authors:
LIU Xue-song
E-mail: liuxuesong@zju.edu.cn
|
|
[1] Zimmer G F, Santos R O, Teixeira I D, et al. Journal of Chemometrics, 2020, 34(12): e3303.
[2] Wu Lijun, Wang Baoxing, Zhang Lei, et al. Journal of Near Infrared Spectroscopy, 2020, 28(3): 153.
[3] Xiang Boka, Cheng Changhe, Xia Jun, et al. Vibrational Spectroscopy, 2020, 111: 103182.
[4] Li Ruidong, Zhang Xiaobing, Li Keqiang, et al. Spectroscopy Letters, 2020, 53(9): 685.
[5] Gu Li, Xue Lichun, Song Qi, et al. Journal of Bioinformatics and Computational Biology, 2016, 14(6): 1650033.
[6] Deng Jun, Chen Weile, Liang Ce, et al. Journal of Loss Prevention in the Process Industries, 2021, 71: 104439.
[7] Subudhi U, Dash S. Journal of Industrial Information Integration, 2021, 22: 100204.
[8] LI Qing-bo, BI Zhi-qi, SHI Dong-dong(李庆波, 毕智棋, 石冬冬). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2020, 40(9): 2804.
[9] Li Tao, Su Chen. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2018, 204: 131.
[10] Ren Guangxin, Liu Ying, Ning Jingming, et al. Journal of Food Composition and Analysis, 2021, 98: 103810.
[11] Tharwat A, Schenck W. Expert Systems with Applications, 2020, 167(5): 114430.
[12] Zhang Jiawei, Tittel F K, Gong Longwen, et al. Environmental Modeling & Assessment, 2016. 21(4): 531.
[13] Mirjalili S, Mirjalili S M, Lewis A. Advances in Engineering Software, 2014, 69: 46.
[14] Zhang Lin, Sun Jun, Zhou Xin, et al. Journal of Food Processing and Preservation, 2020, 44(8): e14591.
[15] Zhang Dongyan, Yang Yi, Chen Gao, et al. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2020, 248: 119139.
|
[1] |
BAO Hao1, 2,ZHANG Yan1, 2*. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157. |
[2] |
CHENG Hui-zhu1, 2, YANG Wan-qi1, 2, LI Fu-sheng1, 2*, MA Qian1, 2, ZHAO Yan-chun1, 2. Genetic Algorithm Optimized BP Neural Network for Quantitative
Analysis of Soil Heavy Metals in XRF[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3742-3746. |
[3] |
SHEN Si-cong, ZHANG Jing-xue, CHEN Ming-hui, LI Zhi-wei, SUN Sheng-nan, YAN Xue-bing*. Estimation of Above-Ground Biomass and Chlorophyll Content of
Different Alfalfa Varieties Based on UAV Multi-Spectrum[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3847-3852. |
[4] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[5] |
HUANG Zhao-di1, CHEN Zai-liang2, WANG Chen3, TIAN Peng2, ZHANG Hai-liang2, XIE Chao-yong2*, LIU Xue-mei4*. Comparing Different Multivariate Calibration Methods Analyses for Measurement of Soil Properties Using Visible and Short Wave-Near
Infrared Spectroscopy Combined With Machine Learning Algorithms[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3535-3540. |
[6] |
KANG Ming-yue1, 3, WANG Cheng1, SUN Hong-yan3, LI Zuo-lin2, LUO Bin1*. Research on Internal Quality Detection Method of Cherry Tomatoes Based on Improved WOA-LSSVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3541-3550. |
[7] |
LI Wen-wen1, 2, LONG Chang-jiang1, 2, 4*, LI Shan-jun1, 2, 3, 4, CHEN Hong1, 2, 4. Detection of Mixed Pesticide Residues of Prochloraz and Imazalil in
Citrus Epidermis by Surface Enhanced Raman Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3052-3058. |
[8] |
GUO Ge1, 3, 4, ZHANG Meng-ling3, 4, GONG Zhi-jie3, 4, ZHANG Shi-zhuang3, 4, WANG Xiao-yu2, 5, 6*, ZHOU Zhong-hua1*, YANG Yu2, 5, 6, XIE Guang-hui3, 4. Construction of Biomass Ash Content Model Based on Near-Infrared
Spectroscopy and Complex Sample Set Partitioning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3143-3149. |
[9] |
LIU Fei1, TAN Jia-jin1*, XIE Gu-ai2, SU Jun3, YE Jian-ren1. Early Diagnosis of Pine Wilt Disease Based on Hyperspectral Data and Needle Resistivity[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3280-3285. |
[10] |
MA Qian1, 2, YANG Wan-qi1, 2, LI Fu-sheng1, 2*, CHENG Hui-zhu1, 2, ZHAO Yan-chun1, 2. Research on Classification of Heavy Metal Pb in Honeysuckle Based on XRF and Transfer Learning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2729-2733. |
[11] |
LÜ Shi-lei1, 2, 3, WANG Hong-wei1, LI Zhen1, 2, 3*, ZHOU Xu1, ZHAO Jing1. Hyperspectral Identification Model of Cantonese Tangerine Peel Based on BWO-SVM Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2894-2901. |
[12] |
WANG Jun-jie1, YUAN Xi-ping2, 3, GAN Shu1, 2*, HU Lin1, ZHAO Hai-long1. Hyperspectral Identification Method of Typical Sedimentary Rocks in Lufeng Dinosaur Valley[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2855-2861. |
[13] |
JIN Cheng-liang1, WANG Yong-jun2*, HUANG He2, LIU Jun-min3. Application of High-Dimensional Infrared Spectral Data Preprocessing in the Origin Identification of Traditional Chinese Medicinal Materials[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2238-2245. |
[14] |
ZHANG Hai-liang1, XIE Chao-yong1, TIAN Peng1, ZHAN Bai-shao1, CHEN Zai-liang1, LUO Wei1*, LIU Xue-mei2*. Measurement of Soil Organic Matter and Total Nitrogen Based on Visible/Near Infrared Spectroscopy and Data-Driven Machine Learning Method[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2226-2231. |
[15] |
LI Hao-dong1, 2, LI Ju-zi1*, CHEN Yan-lin1, HUANG Yu-jing1, Andy Hsitien Shen1*. Establishing Support Vector Machine SVM Recognition Model to Identify Jadeite Origin[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2252-2257. |
|
|
|
|