Feature Wavelength Selection and Efficiency Analysis for Paddy Moisture Content Prediction by Near Infrared Spectroscopy
HUANG Hua1, WU Xi-yu2, ZHU Shi-ping1*
1. School of Engineering and Technology, Southwest University, Chongqing 400716, China
2. College of Food Science and Technology, Southwest University, Chongqing 400716, China
Abstract:In order to obtain the best feature wavelength region for predicting paddy moisture content (PMC) by near infrared spectroscopy (NIR), this research selected 364 paddy samples of “Gangyou 916” which moisture content varied from 32.66% to 2.24%, the pretreatments methods such as mean centering (Mean), standard normalized variate (SNV), Savitzky-Golay derivative (SG1) and multiplicative scatter correction (MSC) were performed, and adopted the feature wavelengths selection methods include interval method (IM), synergy interval method (SIM), moving window method (MWM) and backward interval method (BIM), then used partial least squares (PLS) and principal component regression (PCR) quantitative analysis algorithms for the PMC NIR modeling. The calculation formulas of complexity for wavelengths selection methods with IM, SIM, MWM and BIM are firstly provided in this paper. And this paper also compares and analyzes the program operating efficiency of these methods. The results show that the prediction ability of PLS is better than PCR, but the modeling efficiency of PLS is lower than PCR. The BIM is the optimum prediction model for PMC among the four wavelengths selection methods, which root mean square error of prediction (RMSEP) and correlation coefficient (Rp) in prediction set are 0.995 6 and 0.78%, respectively. The second is MWM, which RMSEP and RP are 0.994 3 and 0.89%, respectively. However, the program running efficiency of these two methods is relatively low. The average running time of BIM is 4.87 h, and average running time of MWM is 29.82 h. This work provided a reference comparison for a fast algorithm of near infrared spectroscopy prediction model in parallel computing and distributed computing.
Key words:Near infrared spectroscopy; Wavelength selection; The program running efficiency; Paddy moisture content
[1] The National Standards of the People’s Republic of China GB 1350—2009: 2009, Paddy(中华人民共和国国家标准:稻谷),2009.
[2] Horigane A K, Suzuki K, Yoshida M. Journal of Cereal Science, 2014, 60(1): 193.
[3] WANG Jiang-rong,ZHOU Jing, SHEN Na, et al(王江蓉, 周 京, 沈 娜, 等). Cereal & Feed Industry(粮食与饲料工业), 2015,(3): 5.
[4] LIU Jie, LI Xiao-yu, LI Pei-wu, et al(刘 洁, 李小昱, 李培武, 等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报), 2010, 26(2): 338.
[5] Sundaram J, Kandala C V K, Butts C L, et al. Transactions of the ASABE,2010, 53(1): 183.
[6] Deng B, Yun Y, Liang Y, et al. Analyst, 2014, 139(19): 4836.
[7] Liu D, Sun D, Zeng X. Food and Bioprocess Technology, 2014, 7(2): 307.
[8] YUAN Ying,WANG Wei,CHU Xuan, et al(袁 莹, 王 伟, 褚 璇, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2016, 36(1): 226.
[9] YANG Ai-xia, DING Jian-li, LI Yan-hong, et al(杨爱霞, 丁建丽, 李艳红, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2016, 36(3): 691.
[10] The National Standards of the People’s Republic of China GB 5009.3—2016: 2016, Methods for Determination of Moisture Content in Food(中华人民共和国国家标准:食品中水分的测定),2016.
[11] Mehmood T, Liland K H, Snipen L, et al. Chemometrics and Intelligent Laboratory Systems, 2012, 118(16): 62.
[12] Chen Q, Zhao J, Liu M, et al. Journal of Pharmaceutical and Biomedical Analysis, 2008, 46(3): 568.
[13] TANG Xiu-ying, NIU Li-zhao, XU Yang, et al(汤修映, 牛力钊, 徐 杨, 等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报), 2013, 29(11): 248.
[14] Jie D, Xie L, Fu X, et al. Journal of Food Engineering, 2013, 118(4): 387.