光谱学与光谱分析 |
|
|
|
|
|
Selection of Variables for MLR in Vis/NIR Spectroscopy Based on BiPLS Combined with GA |
LI Peng-fei, WANG Jia-hua, CAO Nan-ning, HAN Dong-hai* |
College of Food Science and Nutritional Engineering, China Agricultural University, Beijing 100083, China |
|
|
Abstract The feasibility of using efficient selection of variables in Vis/NIR for a rapid and conclusive determination of fruit inner qualities such as soluble solids content (SSC) of plums was investigated. A new strategy was proposed in the present paper, i.e. two-stage variable selection using the backward interval partial least squares (BiPLS) combined with genetic algorithm (GA). Firstly, it splits the whole spectral region into equidistant sub-regions and then develops all BiPLS regression models, and the informative regions which are used to constructed PLS models with the lowest error can be located. Secondly, GA method is used to select variable in these informative regions, which are used for regression variables of MLR model. The Vis/NIR spectra containing 225 individual data points were processed by Savizky-Golay filter smoothing and second-order derivative, and 9 sub-regions were selected by BiPLS procedure when the spectra were divided into 25 sub-regions. The optimal 12 variables, which were the output of the GA procedure, were selected by the higher occurrence frequency while the GA procedure ran 100 times. In order to simplify the multiple linear regression (MLR) modeling, the wavelength variables with the maximum occurrence frequency were chosen when the adjacent wavelengths were selected by GA. Finally, 638, 734, 752, 868, 910, 916 and 938 nm were used to build a MLR model. The results show that MLR model produced by BiPLS-GA performs well with correlation coefficients (R) of 0.984, root mean standard error of calibration (RMSEC) of 0.364 and root mean standard error of prediction (RMSEP) of 0.471 for SSC, which outperforms models using stepwise regression analysis (SRA). This work proved that the BiPLS-GA could determine optimal variables in Vis/NIR spectra and improve the accuracy of model.
|
Received: 2008-08-10
Accepted: 2008-12-20
|
|
Corresponding Authors:
HAN Dong-hai
E-mail: caundt@cau.edu.cn
|
|
[1] Nicolai B M, Beullens K, Bobelyn E, et al. Postharvest Biology and Technology, 2007, 46: 99. [2] Ventura M, Jager A D, Putter H D, et al. Postharvest Biology and Technology, 1998, 14: 21. [3] HAN Dong-hai, WANG Jia-hua(韩东海, 王加华). Chinese Journal of Lasers(中国激光), 2008, 35(8): 1123. [4] LIU Yan-de, YING Yi-bin, FU Xia-ping, et al. Journal of Food Engineering, 2007, 80: 986. [5] Cayuela J A. Postharvest Biology and Technology, 2008, 47: 75. [6] Kim J, Mowat A, Poole P, et al. Chemometrics and Intelligent Laboratory Systems, 2000, 51: 201. [7] CHU Xiao-li, YUAN Hong-fu, LU Wan-zhen(褚小立, 袁洪福, 陆婉珍). Progress in Chemistry(化学进展), 2007, 16(4): 528. [8] LI Gui-feng, ZHAO Guo-jian, WANG Xiang-dong, et al(李桂峰,赵国建,王向东,等). Transactions of the Chinese Society of Agricultural Engineering(农业工程学报), 2008, 24(6): 169. [9] Galvao R K H, Araújo M C U, Fragoso W D, et al. Chemometrics and Intelligent Laboratory Systems, 2008, 92: 83. [10] Leardi R. Data Handling in Science and Technology, 2003, 23: 169. [11] CHENG Biao, CHEN De-zhao, WU Xiao-hua(成 飙, 陈德钊, 吴晓华). Chinese Journal of Analytical Chemistry(分析化学), 2006, 34: 123. [12] Durand A, Devos O, Ruckebusch C, et al. Analytica Chimica Acta, 2007, 595: 72. [13] YING Yi-bin, LIU Yan-de. Journal of Food Engineering, 2008, 84: 206. [14] ZOU Xiao-bo, ZHAO Jie-wen, LI Yan-xiao. Vibrational Spectroscopy, 2007, 44: 220. |
[1] |
GAO Wei-ling, ZHANG Kai-hua*, XU Yan-fen, LIU Yu-fang*. Data Processing Method for Multi-Spectral Radiometric Thermometry Based on the Improved HPSOGA[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3659-3665. |
[2] |
HUANG Hua1, LIU Ya2, KUERBANGULI·Dulikun1, ZENG Fan-lin1, MAYIRAN·Maimaiti1, AWAGULI·Maimaiti1, MAIDINUERHAN·Aizezi1, GUO Jun-xian3*. Ensemble Learning Model Incorporating Fractional Differential and
PIMP-RF Algorithm to Predict Soluble Solids Content of Apples
During Maturing Period[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3059-3066. |
[3] |
CAI Jian-rong1, 2, HUANG Chu-jun1, MA Li-xin1, ZHAI Li-xiang1, GUO Zhi-ming1, 3*. Hand-Held Visible/Near Infrared Nondestructive Detection System for Soluble Solid Content in Mandarin by 1D-CNN Model[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2792-2798. |
[4] |
NIU Fang-peng1, 2, LI Xin-guo1, 3*, BAI Yun-gang2, ZHAO Hui4. Hyperspectral Estimation Model of Soil Organic Carbon Content Based on Genetic Algorithm Fused With Continuous Projection Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(07): 2232-2237. |
[5] |
ZHANG Mei-zhi1, ZHANG Ning1, 2, QIAO Cong1, XU Huang-rong2, GAO Bo2, MENG Qing-yang2, YU Wei-xing2*. High-Efficient and Accurate Testing of Egg Freshness Based on
IPLS-XGBoost Algorithm and VIS-NIR Spectrum[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(06): 1711-1718. |
[6] |
ZHANG Fu1, 2, 3, CAO Wei-hua1, CUI Xia-hua1, WANG Xin-yue1, FU San-ling4*, ZHANG Ya-kun1. Non-Destructive Detection of Soluble Solids in Cherry Tomatoes by
Visible/Near Infrared Spectroscopy Based on SG-CARS-IBP[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(03): 737-743. |
[7] |
ZHENG Kai-yi, SHEN Ye, ZHANG Wen, ZHOU Chen-guang, DING Fu-yuan, ZHANG Yang, ZHANG Rou-jia, SHI Ji-yong, ZOU Xiao-bo*. Interval Genetic Algorithm for Double Spectra and Its Applications in Calibration Transfer[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(12): 3783-3788. |
[8] |
LIU Meng-xuan1, 2, 3, 4, WU Qiong5, WANG Xu-quan1, 2, 4, CHEN Qi5, ZHANG Yong-gang1, 2, HUANG Song-lei1, 2*, FANG Jia-xiong1, 2*. Validity and Redundancy of Spectral Data in the Detection Algorithm of Sucrose-Doped Content in Tea[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(11): 3647-3652. |
[9] |
PENG Jiao-yu1, 2*, YANG Ke-li1, 2, BIAN Shao-ju1, 3, 4, CUI Rui-zhi1, 3, DONG Ya-ping1, 2, LI Wu1, 3. Quantitative Analysis of Monoborates (H3BO3 and B(OH)-4) in Aqueous Solution by Raman Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2456-2462. |
[10] |
GUO Yang1, GUO Jun-xian1*, SHI Yong1, LI Xue-lian1, HUANG Hua2, LIU Yan-cen1. Estimation of Leaf Moisture Content in Cantaloupe Canopy Based on
SiPLS-CARS and GA-ELM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2565-2571. |
[11] |
HUANG Qing1, XUE He-ru1*, LIU Jiang-ping1*, LIU Mei-chen1, HU Peng-wei1, SUN De-gang2. Spectral Selection Method Based on Ant Colony-Genetic Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(07): 2262-2268. |
[12] |
JIANG Qing-hu1, LIU Feng1, YU Dong-yue2, 3, LUO Hui2, 3, LIANG Qiong3*, ZHANG Yan-jun3*. Rapid Measurement of the Pharmacological Active Constituents in Herba Epimedii Using Hyperspectral Analysis Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(05): 1445-1450. |
[13] |
WU Jie1, LI Chuang-kai1, CHEN Wen-jun1, HUANG Yan-xin1, ZHAO Nan1, LI Jia-ming1, 2*, YANG Huan3, LI Xiang-you4, LÜ Qi-tao3,5, ZHANG Qing-mao1,2,5. Multiple Liner Regression for Improving the Accuracy of Laser-Induced Breakdown Spectroscopy Assisted With Laser-Induced Fluorescence (LIBS-LIF)[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(03): 795-801. |
[14] |
ZHANG Fu1, 2, 3, CUI Xia-hua1, DING Ke4*, ZHANG Ya-kun1, WANG Yong-xian1, PAN Xiao-qing5. Study on the Influence of Different Pretreatment Methods on Gender Determination of Multiposition[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(02): 434-439. |
[15] |
WANG Xue-yuan1, 2, 3, HE Jian-feng1, 2, 3*, NIE Feng-jun2, YUAN Zhao-lin1, 2, 3, LIU Lin1, 2, 3. Decomposition of X-Ray Fluorescence Overlapping Peaks Based on Quantum Genetic Algorithm With Multi-Fitness Function[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(01): 152-157. |
|
|
|
|