1. College of Electronic Information Science, Fujian Jiangxia University, Fuzhou 350108, China
2. Smart Home Information Collection and Processing on Internet of Things Laboratory of Digital Fujian, Fuzhou 350108, China
3. Institute of Agricultural Quality Standards and Testing Technology, Shandong Academy of Agricultural Sciences, Jinan 250100, China
Abstract Honeysuckle is an essential medicine for clearing away heat and detoxifying. However, the sources of honeysuckle on the market are complicated, and the most famous honeysuckle produced in Pingyi, Shandong, is often counterfeited. Most existing identification methods are time-consuming, costly, and complex to operate. Therefore, a fast and efficient way to trace the honeysuckle’s origin is urgently needed. The current one-dimensional convolutional neural network (1D-CNN) identification model based on honeysuckle near-infrared spectroscopy (NIRS) data has the problems of too many parameters and too low model efficiency, high computational complexity and is prone to overfitting. This paper improves the traditional 1D-CNN structure. We use the more efficient VD (Very Deep) structure to replace the hidden layer structure in the conventional 1D-CNN and make adaptive improvements for NIRS data so that the model can be directly applied to one-dimensional NIRS data. The improvement method is divided into three steps: firstly, the design of the feature layer is converted into two constraints optimization design: the first constraint is to set the C value of each convolution layer (the ratio of the size of the convolution kernel and the receptive field) to 1/6, which can improve the efficiency of the network model; the second constraint is to take the size of the top-level sensory domain as the size of the data vector, which can achieve feature extraction of deeper data and reduce overfitting. Secondly, this design minimizes the output feature vector of the feature layer to a smaller size through the downsampling operation. Finally, use two convolutional layers of size 1×5 and a pooling layer with dropout to downsample the data size to a vector of only one vector instead of a fully connected layer for classification, thereby reducing the number of parameters. In the experiment, 500 honeysuckles samples were collected from the main producing areas of Henan, Shandong, Hebei, and Chongqing. The spectral range used in the test is 908~1 676 nm. The sample set was preprocessed by the KS algorithm, and the training set, validation set and test set were divided by the shuffle algorithm. At last, a honeysuckle origin identification model based on improved 1D-VD-CNN and near-infrared spectroscopy was constructed. The results show that the 1D-VD-CNN training set and test set’s accuracy reach 100%, and the loss value converges around 0.001. Compared with the traditional 1D-CNN model, the training set and test set accuracy of the 1D-VD-CNN model are improved by about 0.5% and 1.4%, respectively, and the number of parameters and FLOPs are reduced by nearly 1 M and 20 M, respectively. At the same time, compared with the original spectral data analysis method and the PLS-DA method, it shows that the 1D-VD-CNN model has higher efficiency and better recognition performance for honeysuckle near-infrared spectral classification.
CHEN Dong-ying,ZHANG Hao,ZHANG Zi-long, et al. Research on the Origin Traceability of Honeysuckle Based on Improved 1D-VD-CNN and Near-Infrared Spectral Data[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(05): 1471-1477.
[1] Chinese Pharmacopoeia Commission(国家药典委员会). Pharmacopoeia of the People’s Republic of China(中华人民共和国药典), 2015. 342.
[2] LI Hong-xia, WANG Xue-qin, LI Zhen-guo, et al(李红霞,王雪芹,李振国,等). China Pharmacy(中国药房),2011,22(31): 2935.
[3] QI Da-ming, DONG Cheng-ming, LIU Tian-liang, et al(齐大明,董诚明,刘天亮, 等). Chinese Traditional and Herbal Drugs(中草药),2021, 52(22):1.
[4] CHEN-Lu, ZHANG Hui, ZHANG Bing-chun, et al(陈 璐, 张 惠, 张丙春, 等). Quality and Safety of Agro-Products(农产品质量与安全), 2019, (4): 41.
[5] WANG Hai-ge, XU Wen, ZHANG Xun, et al(王海阁, 许 文, 张 勋, 等). Journal of Chinese Medicinal Materials(中药材), 2020, 43(2): 303.
[6] Zhang L, Ding X, Hou R. Journal of Analytical Methods in Chemistry, 2020, (22): 1.
[7] Chai Qinqin, Zeng Jian, Lin Donghong, et al. Journal of Pharmaceutical and Biomedical Analysis, 2021,(19): 1.
[8] LI Heng, ZHANG Qing, QIN Xian-rong, et al(李 恒, 张 氢, 秦仙蓉, 等). Vibration and Shock(振动与冲击), 2018,(19): 124.
[9] CHEN Qi, PAN Tian-hong, LI Yu-qiang, et al(陈 琦, 潘天红,李鱼强,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2021, 41(9):2776.
[10] CHAI Qin-qin, ZENG Jian, ZHANG Xun(柴琴琴,曾 建,张 勋). Zhejiang Agricultural Journal(浙江农业学报), 2022, 34(2): 391.
[11] CUI Jia-xu, YANG Bo(崔佳旭, 杨 博). Journal of Software(软件学报), 2018, 29(10): 3068.
[12] Alex K, Ilya S, Geoffrey E H. Communications of the ACM, 2017, 60(6): 84.
[13] Chai Qinqin, Zeng Jian, et al. Journal of Pharmaceutical and Biomedical Analysis, 2021, 199: 114035.
[14] Chen X, Chai Q, Lin N, et al. Analytical Methods, 2019, 11(40): 5118.
[15] Dmytro Mishkin, Sergievskiy N, Matas J. Computer Vision and Image Understanding, 2017, 42(12): 11.