|
|
|
|
|
|
Identification of New and Old Pinus Koraiensis Seeds by Near-Infrared Spectroscopy (NIRs) With t-SNE Dimensionality Reduction |
LI Hong-bo, CAO Jun, JIANG Da-peng, ZHANG Dong-yan*, ZHANG Yi-zhuo* |
College of Mechanical and Electrical Engineering, Northeast Forestry University, Harbin 150040, China |
|
|
Abstract The new and old characteristics of pinus koraiensis seeds is an important property reflecting the edible value and breeding value. The pinus koraiensis seeds with a short storage period also have high deep processing value. However, it is difficult to distinguish by appearance, weight and texture. At present, traditional biochemical methods are used to detect the chemical properties and germination percentage of pinus koraiensis seeds to judge their new and old quality. It takes a long time to meet the needs of online detection, and improper treatment of chemical reagents can cause environmental pollution. Near-infrared spectroscopy (NIRS) is widely used in the field of food detection and forestry. Therefore, it has practical significance and guiding significance for qualitative analysis of nuts with shells. In this study, near infrared spectroscopy was used to conduct nondestructive testing of pinus koraiensis seeds matured in the current year and in previous years. Firstly, the 120 pinus koraiensis seeds were randomly selected and labeled according to new and old classifications. In order to reduce the leakage of light during the measurement process and make the experimental data more generally, the near-infrared diffuse reflectance spectra of pinus koraiensis seeds samples on the same side were collected uniformly. Then, the original spectrum was pretreated by using a standard normalized variable (SNV), first derivative and Savitzky-Golay (SG) algorithm, so as to reduce the influence caused by human factors and pretreatment in the experiment process, and highlight the characteristic information of the near-infrared spectrum. After that, principal component analysis (PCA) and t-distributed Stochastic Neighbor Embedding (t-SNE) were used to reduce the dimension of the pretreated data and compare the effect of dimension reduction. Through the visualization of the data and the output of the clustering parameters, a better dimension reduction scheme was obtained by comparison. The non-linear dimensionality reduction method has a good effect in the near-infrared spectral data processing of pinus koraiensis seeds. Therefore, the t-SNE method was used to reduce the dimensionality of the data, and the optimal characteristic variables were obtained. Finally, taking the reduced dimension data as input. Using two-thirds of the sample data as a correction set to establish a support vector machine (SVM) correction model for classification of new and old seeds, and a third of the sample data were used as a validation set to validate the model performance. The results indicate that. The superposition of SNV, first derivative and SG to pretreat the spectrum can effectively eliminate the noise, it makes the absorption peak more obvious. Meanwhile, it also makes the spectral profile clearer and smoother, which is more conducive to the establishment of the later model. The method of t-SNE is used to reduce the data to two-dimension as the input of the classification model, and when the kernel function selects the RBF, the value of K is 5, γ is 82.54 and the penalty coefficient C is 383.12, the SVM classification model has the best classification effect, the accuracy can reach 97.5%, and the average time consumption is 0.02 s. Near-infrared spectroscopy can be used to achieve non-destructive testing of the new and old characteristics of pinus koraiensis seeds.
|
Received: 2019-07-25
Accepted: 2019-12-29
|
|
Corresponding Authors:
ZHANG Dong-yan, ZHANG Yi-zhuo
E-mail: zhangdy76@126.com; nefuzyz@163.com
|
|
[1] Jan U Porep,Dietmar R Kammerer,Reinhold Carle. Trends in Food Science & Technology,2015, 46: 211.
[2] Guo Zhiming,Huang Wenqian,Peng Yankun,et al. Postharvest Biology and Technology,2016, 115: 81.
[3] Veronica Loewe,Rafael Maria Navarro-Cerrillo,Juan García-Olmo,et al. Food Control,2017,73: 634.
[4] Cortes V, Rodriguez A, Blasco J, et al. Journal of Food Engineering, 2017, 204: 27.
[5] Toktam Mohammadi-Moghaddam,Seyed M A Razavi·Ameneh Sazgarnia·Masoud Taghizadeh. Food Measure,2018,12: 346.
[6] YU Hui-ling,MEN Hong-sheng,LIANG Hao,et al(于慧伶,门洪生,梁 浩,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析),2018,38(6): 1724.
[7] Patrizia Firmani,Silvia De Luca,Remo Bucci,et al. Food Control,2019, 100: 292.
[8] Arslan M, Zou X B, Tahir H E, et al. International Journal of Food Engineering, 2019, 15(3-4): 20180148. doi: https://doi.org/10.1515/ijfe-2018-0148.
[9] Tsuchikawa S,Kobori H. Journal of Wood Science,2015,61(3): 213.
[10] Shimazui T,Yoshikawa K,Kojima T,et al. Holzforschung,2015,69(3): 329.
[11] ZHANG Yi-zhuo,SU Yao-wen,LI Chao,et al(张怡卓,苏耀文,李 超,等). Journal of Forestry Engineering(林业工程学报),2016,1(6): 17.
[12] Todorovic N,Popovic Z,Milic G. Wood Science & Technology,2015,49(3): 527.
[13] WANG Zhen-hao,DU Hong-jin,LI Guo-qing,et al(王振浩, 杜虹锦, 李国庆, 等). Power System Protection and Control(电力系统保护与控制), 2018, 22: 64. |
[1] |
GAO Feng1, 2, XING Ya-ge3, 4, LUO Hua-ping1, 2, ZHANG Yuan-hua3, 4, GUO Ling3, 4*. Nondestructive Identification of Apricot Varieties Based on Visible/Near Infrared Spectroscopy and Chemometrics Methods[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 44-51. |
[2] |
BAO Hao1, 2,ZHANG Yan1, 2*. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157. |
[3] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[4] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[5] |
HU Cai-ping1, HE Cheng-yu2, KONG Li-wei3, ZHU You-you3*, WU Bin4, ZHOU Hao-xiang3, SUN Jun2. Identification of Tea Based on Near-Infrared Spectra and Fuzzy Linear Discriminant QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3802-3805. |
[6] |
LIU Xin-peng1, SUN Xiang-hong2, QIN Yu-hua1*, ZHANG Min1, GONG Hui-li3. Research on t-SNE Similarity Measurement Method Based on Wasserstein Divergence[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3806-3812. |
[7] |
LUO Li, WANG Jing-yi, XU Zhao-jun, NA Bin*. Geographic Origin Discrimination of Wood Using NIR Spectroscopy
Combined With Machine Learning Techniques[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3372-3379. |
[8] |
ZHANG Shu-fang1, LEI Lei2, LEI Shun-xin2, TAN Xue-cai1, LIU Shao-gang1, YAN Jun1*. Traceability of Geographical Origin of Jasmine Based on Near
Infrared Diffuse Reflectance Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3389-3395. |
[9] |
YANG Qun1, 2, LING Qi-han1, WEI Yong1, NING Qiang1, 2, KONG Fa-ming1, ZHOU Yi-fan1, 2, ZHANG Hai-lin1, WANG Jie1, 2*. Non-Destructive Monitoring Model of Functional Nitrogen Content in
Citrus Leaves Based on Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3396-3403. |
[10] |
HUANG Meng-qiang1, KUANG Wen-jian2, 3*, LIU Xiang1, HE Liang4. Quantitative Analysis of Cotton/Polyester/Wool Blended Fiber Content by Near-Infrared Spectroscopy Based on 1D-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3565-3570. |
[11] |
XIE Peng, WANG Zheng-hai*, XIAO Bei, CAO Hai-ling, HUANG Yi, SU Wen-lin. Hyperspectral Quantitative Inversion of Soil Selenium Content Based on sCARS-PSO-SVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3599-3606. |
[12] |
HUANG Zhao-di1, CHEN Zai-liang2, WANG Chen3, TIAN Peng2, ZHANG Hai-liang2, XIE Chao-yong2*, LIU Xue-mei4*. Comparing Different Multivariate Calibration Methods Analyses for Measurement of Soil Properties Using Visible and Short Wave-Near
Infrared Spectroscopy Combined With Machine Learning Algorithms[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3535-3540. |
[13] |
KANG Ming-yue1, 3, WANG Cheng1, SUN Hong-yan3, LI Zuo-lin2, LUO Bin1*. Research on Internal Quality Detection Method of Cherry Tomatoes Based on Improved WOA-LSSVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3541-3550. |
[14] |
HUANG Hua1, LIU Ya2, KUERBANGULI·Dulikun1, ZENG Fan-lin1, MAYIRAN·Maimaiti1, AWAGULI·Maimaiti1, MAIDINUERHAN·Aizezi1, GUO Jun-xian3*. Ensemble Learning Model Incorporating Fractional Differential and
PIMP-RF Algorithm to Predict Soluble Solids Content of Apples
During Maturing Period[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3059-3066. |
[15] |
CHEN Jia-wei1, 2, ZHOU De-qiang1, 2*, CUI Chen-hao3, REN Zhi-jun1, ZUO Wen-juan1. Prediction Model of Farinograph Characteristics of Wheat Flour Based on Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3089-3097. |
|
|
|
|