|
|
|
|
|
|
Support Vector Machine Optimized by Near-Infrared Spectroscopic
Technique Combined With Grey Wolf Optimizer Algorithm to
Realize Rapid Identification of Tobacco Origin |
GENG Ying-rui1, SHEN Huan-chao1, NI Hong-fei2, CHEN Yong1, LIU Xue-song1* |
1. College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310030, China
2. Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Hangzhou 310018, China
|
|
|
Abstract Tobacco is a natural plant with complex compositions, the quality of tobacco leaves is directly affected by several external factors such as geographic location and growth conditions. Tobacco leaves are widely planted in China, and they cultivated in different areas, they have different styles. Different blended ratios play a decisive role in the quality of cigarettes. Thus, there is an emerging need for accurate and rapid identification of the origin of tobacco leaves. Near-infrared spectroscopy technology provides a new rapid, and convenient method to automatically evaluate tobacco areas. On this basis, we proposed the grey wolf optimizer (GWO) algorithm to optimize the performance of the support vector machine model (SVM) for the first time to identify and classify tobacco leaves from different origins. This study was conducted with 824 tobacco leaf samples from eight different origins, and 617 training set samples and 207 test set samples were obtained using Set partitioning based on joint x-y distance (SPXY). The wavelength selection methods such as Competitive adaptive reweighted sampling (CARS) and Random frog (RF) algorithms were applied to reduce spectral redundant information and screen the characteristic wavelengths in the -full spectrum of the samples, and 141 and 534 were selected from all 1 609 variables, respectively. Then they were used as the input parameters of the SVM classifier. The optimization effect of GWO on the SVM model was contrasted to the Particle swarm optimization (PSO) and Genetic algorithm (GA) optimization in the same search range. The analysis showed that the spectral variables screened by RF had a better modeling performance than CARS. Among them, the RF-GWO-SVM model achieved the best predictive performance with an accuracy of 96.62% in identifying tobacco leaves from 8 producing areas. More than that, the running time of RF-GWO-SVM was 156 and 131 min shorter than RF-PSO-SVM and RF-GA-SVM, respectively. To sum up, RF-GWO-SVM has the advantages of higher accuracy and faster convergence speed. It can be seen that GWO has a more efficient optimization capability for model parameters, and the support vector machine model optimized by GWO can be used for rapid identification of tobacco origin.
|
Received: 2021-08-17
Accepted: 2021-11-16
|
|
Corresponding Authors:
LIU Xue-song
E-mail: liuxuesong@zju.edu.cn
|
|
[1] Zimmer G F, Santos R O, Teixeira I D, et al. Journal of Chemometrics, 2020, 34(12): e3303.
[2] Wu Lijun, Wang Baoxing, Zhang Lei, et al. Journal of Near Infrared Spectroscopy, 2020, 28(3): 153.
[3] Xiang Boka, Cheng Changhe, Xia Jun, et al. Vibrational Spectroscopy, 2020, 111: 103182.
[4] Li Ruidong, Zhang Xiaobing, Li Keqiang, et al. Spectroscopy Letters, 2020, 53(9): 685.
[5] Gu Li, Xue Lichun, Song Qi, et al. Journal of Bioinformatics and Computational Biology, 2016, 14(6): 1650033.
[6] Deng Jun, Chen Weile, Liang Ce, et al. Journal of Loss Prevention in the Process Industries, 2021, 71: 104439.
[7] Subudhi U, Dash S. Journal of Industrial Information Integration, 2021, 22: 100204.
[8] LI Qing-bo, BI Zhi-qi, SHI Dong-dong(李庆波, 毕智棋, 石冬冬). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2020, 40(9): 2804.
[9] Li Tao, Su Chen. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2018, 204: 131.
[10] Ren Guangxin, Liu Ying, Ning Jingming, et al. Journal of Food Composition and Analysis, 2021, 98: 103810.
[11] Tharwat A, Schenck W. Expert Systems with Applications, 2020, 167(5): 114430.
[12] Zhang Jiawei, Tittel F K, Gong Longwen, et al. Environmental Modeling & Assessment, 2016. 21(4): 531.
[13] Mirjalili S, Mirjalili S M, Lewis A. Advances in Engineering Software, 2014, 69: 46.
[14] Zhang Lin, Sun Jun, Zhou Xin, et al. Journal of Food Processing and Preservation, 2020, 44(8): e14591.
[15] Zhang Dongyan, Yang Yi, Chen Gao, et al. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2020, 248: 119139.
|
[1] |
YANG Cheng-en1, WU Hai-wei1*, YANG Yu2, SU Ling2, YUAN Yue-ming1, LIU Hao1, ZHANG Ai-wu3, SONG Zi-yang3. A Model for the Identification of Counterfeited and Adulterated Sika Deer Antler Cap Powder Based on Mid-Infrared Spectroscopy and Support
Vector Machines[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2359-2365. |
[2] |
WU Ye-lan1, GUAN Hui-ning1, LIAN Xiao-qin1, YU Chong-chong1, LIAO Yu2, GAO Chao1. Study on Detection Method of Leaves With Various Citrus Pests and
Diseases by Hyperspectral Imaging[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2397-2402. |
[3] |
XU Liang-ji1, 2, MENG Xue-ying2, WEI Ren2, ZHANG Kun2. Experimental Research on Coal-Rock Identification Method Based on
Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2135-2142. |
[4] |
ZHANG Fu-jie, SHI Lei, LI Li-xia*, ZHAO Hao-ran, ZHU Yin-long. Study on Nondestructive Identification of Panax Notoginseng Powder Quality Grade Based on Hyperspectral Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2255-2261. |
[5] |
WANG Yue1, 3, 4, CHEN Nan1, 2, 3, 4, WANG Bo-yu1, 5, LIU Tao1, 3, 4*, XIA Yang1, 2, 3, 4*. Fourier Transform Near-Infrared Spectral System Based on Laser-Driven Plasma Light Source[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1666-1673. |
[6] |
FENG Rui-jie1, CHEN Zheng-guang1, 2*, YI Shu-juan3. Identification of Corn Varieties Based on Bayesian Optimization SVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1698-1703. |
[7] |
LI Quan-lun1, CHEN Zheng-guang1*, SUN Xian-da2. Rapid Detection of Total Organic Carbon in Oil Shale Based on Near
Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1691-1697. |
[8] |
LI Qing1, 2, XU Li1, 2, PENG Shan-gui1, 2, LUO Xiao1, 2, ZHANG Rong-qin1, 2, YAN Zhu-yun3, WEN Yong-sheng1, 2*. Research on Identification of Danshen Origin Based on Micro-Focused
Raman Spectroscopy Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1774-1780. |
[9] |
DAI Lu-lu1, YANG Ming-xing1, 2*, WEN Hui-lin1. Study on Chemical Compositions and Origin Discriminations of Hetian Yu From Maxianshan, Gansu Province[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(05): 1451-1458. |
[10] |
LIU Mei-chen, XUE He-ru*, LIU Jiang-ping, DAI Rong-rong, HU Peng-wei, HUANG Qing, JIANG Xin-hua. Hyperspectral Analysis of Milk Protein Content Using SVM Optimized by Sparrow Search Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(05): 1601-1606. |
[11] |
ZHANG Tian-liang, ZHANG Dong-xing, CUI Tao, YANG Li*, XIE Chun-ji, DU Zhao-hui, ZHONG Xiang-jun. Identification of Early Lodging Resistance of Maize by Hyperspectral Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(04): 1229-1234. |
[12] |
LI Yan-yan1, 2, LUO Hai-jun1, 2*, LUO Xia1, 2, FAN Xin-yan1, 2, QIN Rui1, 2. Detection of Craniocerebral Hematoma by Array Scanning Sensitivity Based on Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(02): 392-398. |
[13] |
HUI Yun-ting1, WANG De-cheng1, TANG Xin2, PENG Yao-qi1, WANG Hong-da1, ZHANG Hai-feng1, YOU Yong1*. Detection of Sorghum-Sudan Grass Seed Germination Rate Based on Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(02): 423-427. |
[14] |
JIANG Jie1, YU Quan-zhou1, 2, 3*, LIANG Tian-quan1, 2, TANG Qing-xin1, 2, 3, ZHANG Ying-hao1, 3, ZHANG Huai-zhen1, 2, 3. Analysis of Spectral Characteristics of Different Wetland Landscapes Based on EO-1 Hyperion[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(02): 517-523. |
[15] |
LI Ming-liang1, DAI Yu-jia1, QIN Shuang1, SONG Chao2*, GAO Xun1*, LIN Jing-quan1. Influence of LIBS Analysis Model on Quantitative Analysis Precision of Aluminum Alloy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(02): 587-591. |
|
|
|
|