1. 江苏大学食品与生物工程学院,江苏 镇江 212013
2. Department of Molecular Cell Physiology, Kyoto Prefectural University of Medicine, Kyoto 602-8566, Japan
Using Ensemble Refinement (ER) Method to OptimizeTransfer Set of Near-Infrared Spectra
ZHENG Kai-yi1, ZHANG Wen1, DING Fu-yuan1, ZHOU Chen-guang1, SHI Ji-yong1, Yoshinori Marunaka2, ZOU Xiao-bo1*
1. School of Food and Biological Engineering, Jiangsu University, Zhenjiang 212013, China
2. Department of Molecular Cell Physiology, Kyoto Prefectural University of Medicine, Kyoto 602-8566, Japan
Abstract:The near-infrared spectra has been widely used in the food region with advantages of low measurement cost, easy operation, and fast analysis rate. An indirect analytical method should calibrate a feasible model between spectra and concentrations. However, the model calibrated under a specific condition may be invalid for the spectra measured under another condition. Recalibration is a solution to this problem. However, recalibrating the model between spectra and concentration cost much time and workforce. Thus, calibration transfer can correct the spectral deviation to keep the precision of prediction and avoid the expense of recalibration. In calibration transfer, the spectra used for calibrating model are called primary spectra (A), while those not calibrate model but only use the model of primary spectra are called secondary spectra (B). The procedure of calibration transfer is selecting samples as transfer set of primary spectra (At) from the calibration set, while choosing the samples of secondary spectra as transfer-set of secondary spectra (Bt) who share the same concentrations of At. Then the transfer matrix can be constructed through At and Bt. After that, the corrected secondary spectra (Bnew) can be obtained by validating a set of secondary spectra (Bv) multiplying the transfer matrix. Finally, the Bnew can be substituted for the primary spectra model for prediction. In calibration transfer, generating a transfer set is an important procedure. Selecting samples of transfer set is commonly based on the distances of spectra rather than validation errors. However, the transfer errors are important to estimate the power of calibration transfer. Hence, in this paper, ensemble refinement (ER) based on model population analysis has been proposed to refine further the transfer set generated by the KS method. Initially, the ER generates several subsets of a transfer set and then computes the validation errors of each subset. Subsequently the average error of subsets that includes the sample can be obtained for each sample. Finally, the samples with low average errors can be selected as a transfer set for calibration transfer. The corn dataset is used to examine this method. The results exhibited that in calibration transfer methods such as canonical correlation analysis combined with informative components extraction (CCA-ICE), direct standardization (DS), piecewise direct standardization (PDS) and spectral space transformation (SST), ER can select key samples for calibration transfer to reduce the errors, compared with KS method significantly.
Key words:Calibration transfer; Model population analysis; Sample selection; Partial least squares; Near-infrared spectrum
[1] Fitamo T, Triolo J M, Boldrin A, et al. Water Research, 2017, 119: 242.
[2] Wang S, Liu S, Yuan Y, et al. Infrared Physics and Technology, 2020, 106: 103.
[3] Shi J, Zhang F, Wu S, et al. Food Chemistry, 2019, 274: 925.
[4] Sun X D, Wu H L, Chen Y, et al. Chemometrics and Intelligent Laboratory Systems, 2019, 194: 103.
[5] Rodrigues R R T, Rocha J T C, Oliveira L M S L, et al. Chemometrics and Intelligent Laboratory Systems, 2017, 166: 7.
[6] Marchesini G, Serva L, Garbin E, et al. Italian Journal of Animal Science, 2017, 17(1): 66.
[7] LIANG Chen, ZHAO Zhong, CAO Yu-ting,et al. Spectroscopy and Spectral Analysis, 2017, 37(5): 1587.
[8] Milanez K D T M, Nobrega T C A, Nascimento D S, et al. Microchemical Journal, 2017, 133: 669.
[9] Yang J, Lou X, Yang H, et al. Analytical Letters, 2019, 52(14): 2188.
[10] Bin J, Li X, Fan W, et al. Analyst, 2017, 142(12): 2229.
[11] Du W, Chen Z, Zhong L, et al. Analytica Chimica Acta, 2011, 690(1): 64.
[12] CHEN Yi-yun, ZHAO Rui-Ying, QI Tian-ci, et al. Spectroscopy and Spectral Analysis, 2017, 37(7): 2133.
[13] Deng B, Long H, Tang T, et al. International Journal of Molecular Sciences, 2019, 20(4): 955.
[14] Deng B, Lu H, Tan C, et al. Chemometrics and Intelligent Laboratory Systems, 2018, 172: 223.
[15] Zhang F, Chen W, Zhang R, et al. Chemometrics and Intelligent Laboratory Systems, 2017, 171: 234.