Abstract:Fast detecting and eliminating the outliers is of great significance to improve the reliability of the near-infrared(NIR) spectroscopy analysis. In this paper, the principle of outlier determination method based on orthogonal distance and robust principal component analysis was introduced firstly with the analysis of its limitations. Then an outlier determination method based on the simplified orthogonal distance was proposed, where the spectra of the samples with high concentration were employed to estimate the first robust principal component directly and the statistical parameters of the orthogonal distance were obtained with repeated measurements to detect outliers. Finally, the outliers caused by the temperature fluctuations in the NIR transmission spectra of glucose aqueous solutions and 2% Intralipid solutions, were determined by these two methods. Results showed that, for the orthogonal distance combined with robust principal component analysis method, all the outliers induced by temperature variations could be correctly determined under the collapse value of 40%, while the false negative rates for the glucose aqueous solutions and Intralipid solutions under the collapse value of 25% were 54.5% and 72.7%, respectively. Besides, all the outliers induced by temperature variations also could be recognized with the method based on the simplified orthogonal distance, which saves the need for collapse value and shortens the tine for measurement. Therefore, the outlier determination method based on the simplified orthogonal distance is more practical than the robust principal component analysis.
Key words:Near infrared; Outlier; Orthogonal distance; Robust principal component; Collapse value
孟丹蕊,傅 博,徐可欣,刘 蓉. 一种基于简化正交距离的近红外异常光谱判断方法[J]. 光谱学与光谱分析, 2018, 38(04): 1053-1058.
MENG Dan-rui, FU Bo, XU Ke-xin, LIU Rong. An Outlier Determination Method for Near-Infrared Spectroscopy Based on the Simplified Orthogonal Distance. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2018, 38(04): 1053-1058.
[1] TIAN Xiang, LIU Si-chen, WANG Hai-gang, et al(田 翔,刘思辰,王海岗,等). Food Science(食品科学), 2017,38(16): 1.
[2] Mclauchlin A R, Ghita O, Gahkani A. Polymer Testing, 2014, 38(18): 46.
[3] Yadav J, Rani A, Singh V, et al. Biomedical Signal Processing & Control, 2015, 18: 214.
[4] CHU Xiao-li(褚小立). Molecular Spectroscopy Analytical Technology Combined with Chemometrics and Its Applications(化学计量学方法与分子光谱分析技术). Beijing: Chemical Industry Press(北京: 化学工业出版社), 2011. 89.
[5] Li W, Qu H. Chemometrics & Intelligent Laboratory Systems, 2016, 152: 140.
[6] Cárdenas V, Cordobés M, Blanco M, et al. Journal of Pharmaceutical & Biomedical Analysis, 2015, 114: 28.
[7] Shen W, Kong Q, Wang J, et al. Mathematical Problems in Engineering, 2015, 2015(5): 1.
[8] HAO Jian-ming, LI Zong-nan, XIE Jing(郝建明, 李宗南, 谢 静). Journal of Huazhong Agricultural University(华中农业大学学报), 2014, 33(5): 135.
[9] YU Fan, LI Ji-xin(于 帆,李纪鑫). Journal of Xi’an Technological University(西安工业大学学报), 2014, 34(1): 38.
[10] Li Z, Xu G, Wang J, et al. Chinese Journal of Analytical Chemistry, 2016, 44(2): 305.
[11] Engel J, Blanchet L, Buydens L M C, et al. Talanta, 2012, 99: 426.
[12] ZHANG Li-zhuo(张立卓). College Mathematics(大学数学), 2014,30(2): 94.
[13] Hubert M, Rousseeuw P J, Branden K V. Technometrics, 2010, 47(1): 64.