|
|
|
|
|
|
Information Extraction of Near Infrared Spectra for Complex Samples Based on Wavelet Packet Transform and Entropy Theory |
PENG Dan, YUE Jin-xia, BI Yan-lan |
School of Food Science and Technology, Henan University of Technology, Zhengzhou 450001, China |
|
|
Abstract The method of useful information extraction is one of the most important one in the detection application for complex samples using near infrared (NIR) spectroscopy. Due to various noise, baseline drift, peak overlapping and complex background, informative spectra were hidden and cannot be extracted by conventional methods. Thus, a new hybrid algorithm (EWPIE) was proposed for informative spectra extraction based on wavelet packet transform (WPT) and entropy theory in this study. In EWPIE algorithm, WPT algorithm and its reconstruction algorithm were adopted to split the raw spectra into serval subspectra in different frequency bands. To take advantage of the multiscale property of NIR spectroscopy, each subspectra was further processed through entropy-based phase and orthogonal signal correction (OSC)-based phase. In entropy-based phase, the information entropy theory was used as a filter to remove the interference, which are uncorrelated to analyte contents and would increase the uncertainty of whole spectra. According to the variation of entropy value, some subspectra representing basedline in low frequency band and some subspectra representing noise in high frequency band were filtered out. In OSC-based phase, the OSC algorithm was applied to each of the remaining subspectra and the useful spectra were obtained with accumulation of all the OSC-filtered subspectra. To validate this algorithm, a real NIR spectral dataset of milk was prepared to extract the correlated spectra about the content of fat and protein. Using the EWPIE-filtered spectra, a series of PLS prediction model were constructed. Experimental results show that the prediction ability and robustness of PLS_based prediction models developed with EWPIE algorithm are superior to those developed by conventional algorithms. The root mean square errors of the prediction models for fat and protein can reach up to 0.132% and 0.121%, which indicates that the EWPIE algorithm is a promise tool to extract the useful information from NIR spectra and has certain theoretical significance and practical application value for detection in complex systems.
|
Received: 2016-11-03
Accepted: 2017-04-12
|
|
|
[1] Maruo K, Tsurugi M, Tamura M, et al. Appl. Spectrosc., 2003, 57(10): 1236.
[2] Fernández-Cuesta álvaro, Fernández-Martínez J M, Velasco L. J. Am. Oil Chem. Soc., 2012, 89(3): 431.
[3] GUO Zhi-ming, HUANG Wen-qian, PENG Yan-kun, et al(郭志明, 黄文倩, 彭彦昆, 等). Chinese Journal of Analytical Chemistry(分析化学),2014, 42(4): 513.
[4] XIA Hui, HUANG Sheng-xiang, LI Hong-jian(夏 辉,黄生祥,李宏建). Chinese Journal of Lasers(中国激光),2008,35(1):102.
[5] Ben X, Yi H L, Tan H P. Appl. Opt., 2014, 53(7): 1427.
[6] Duarte M S, Pontes M J C, Ramos C S. J. Appl. Spectrosc., 2016, 82(6): 1042.
[7] LI Hua, WANG Ju-xiang, XING Zhi-na, et al(李 华,王菊香,邢志娜,等). Computers and Applied Chemistry(计算机与应用化学),2011,28(1):99.
[8] Cai C B, Xu L, Han Q J, et al. Talanta, 2010, 81(3): 799.
[9] Shannon C E. Bell. Syst. Tech., 1948, 27: 379.
[10] Eckschlager K, Stepanek V. Anal. Chem., 1982, 54: 1115.
[11] Wold S, Antti H, Lindgren F, et al. Chemom. Intell. Lab. Syst., 1998, 44: 175.
[12] PENG Dan, XU Ke-xin(彭 丹, 徐可欣). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2010, 30(2): 318. |
[1] |
GAO Feng1, 2, XING Ya-ge3, 4, LUO Hua-ping1, 2, ZHANG Yuan-hua3, 4, GUO Ling3, 4*. Nondestructive Identification of Apricot Varieties Based on Visible/Near Infrared Spectroscopy and Chemometrics Methods[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 44-51. |
[2] |
BAO Hao1, 2,ZHANG Yan1, 2*. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157. |
[3] |
HU Cai-ping1, HE Cheng-yu2, KONG Li-wei3, ZHU You-you3*, WU Bin4, ZHOU Hao-xiang3, SUN Jun2. Identification of Tea Based on Near-Infrared Spectra and Fuzzy Linear Discriminant QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3802-3805. |
[4] |
LIU Xin-peng1, SUN Xiang-hong2, QIN Yu-hua1*, ZHANG Min1, GONG Hui-li3. Research on t-SNE Similarity Measurement Method Based on Wasserstein Divergence[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3806-3812. |
[5] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[6] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[7] |
LUO Li, WANG Jing-yi, XU Zhao-jun, NA Bin*. Geographic Origin Discrimination of Wood Using NIR Spectroscopy
Combined With Machine Learning Techniques[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3372-3379. |
[8] |
ZHANG Shu-fang1, LEI Lei2, LEI Shun-xin2, TAN Xue-cai1, LIU Shao-gang1, YAN Jun1*. Traceability of Geographical Origin of Jasmine Based on Near
Infrared Diffuse Reflectance Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3389-3395. |
[9] |
YANG Qun1, 2, LING Qi-han1, WEI Yong1, NING Qiang1, 2, KONG Fa-ming1, ZHOU Yi-fan1, 2, ZHANG Hai-lin1, WANG Jie1, 2*. Non-Destructive Monitoring Model of Functional Nitrogen Content in
Citrus Leaves Based on Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3396-3403. |
[10] |
HUANG Meng-qiang1, KUANG Wen-jian2, 3*, LIU Xiang1, HE Liang4. Quantitative Analysis of Cotton/Polyester/Wool Blended Fiber Content by Near-Infrared Spectroscopy Based on 1D-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3565-3570. |
[11] |
HUANG Zhao-di1, CHEN Zai-liang2, WANG Chen3, TIAN Peng2, ZHANG Hai-liang2, XIE Chao-yong2*, LIU Xue-mei4*. Comparing Different Multivariate Calibration Methods Analyses for Measurement of Soil Properties Using Visible and Short Wave-Near
Infrared Spectroscopy Combined With Machine Learning Algorithms[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3535-3540. |
[12] |
KANG Ming-yue1, 3, WANG Cheng1, SUN Hong-yan3, LI Zuo-lin2, LUO Bin1*. Research on Internal Quality Detection Method of Cherry Tomatoes Based on Improved WOA-LSSVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3541-3550. |
[13] |
HUANG Hua1, LIU Ya2, KUERBANGULI·Dulikun1, ZENG Fan-lin1, MAYIRAN·Maimaiti1, AWAGULI·Maimaiti1, MAIDINUERHAN·Aizezi1, GUO Jun-xian3*. Ensemble Learning Model Incorporating Fractional Differential and
PIMP-RF Algorithm to Predict Soluble Solids Content of Apples
During Maturing Period[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3059-3066. |
[14] |
CHEN Jia-wei1, 2, ZHOU De-qiang1, 2*, CUI Chen-hao3, REN Zhi-jun1, ZUO Wen-juan1. Prediction Model of Farinograph Characteristics of Wheat Flour Based on Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3089-3097. |
[15] |
GUO Ge1, 3, 4, ZHANG Meng-ling3, 4, GONG Zhi-jie3, 4, ZHANG Shi-zhuang3, 4, WANG Xiao-yu2, 5, 6*, ZHOU Zhong-hua1*, YANG Yu2, 5, 6, XIE Guang-hui3, 4. Construction of Biomass Ash Content Model Based on Near-Infrared
Spectroscopy and Complex Sample Set Partitioning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3143-3149. |
|
|
|
|