Transfer Learning Modeling of 2,6-Dimethylphenol Purity Based on PLS Subspace Alignment
WU Yun-fei, LUAN Xiao-li*, LIU Fei
Key Laboratory of Advanced Process Control for Light Industry of the Ministry of Education, Institute of Automation, Jiangnan University, Wuxi 214122, China
Abstract:The highly accurate on-line detection of solute concentration by using near-infrared spectroscopy analysis technology is of great significance for production optimization. Establishing a detection model needs to extract relevant information from the near-infrared spectrum, more representative samples will extract more effective information, and the model will also be more accurate. However, the purity of products has been continuously improved, and sample discrimination is reduced. The coefficient of variation of the sample is small, which leads to the diversity of samples being insufficient. Moreover,there are measurement errors when measuring the sample concentration in the laboratory, which will lead to the lack of correlation between the solute concentration and the spectrum. It is hard to establish a reliable and highly accurate near-infrared detection model. A new transfer learning modeling method based on PLS subspace alignment is proposed and applied in the near-infrared on-line detection of 2,6-dimethylphenol high purity in the product tower of the distillation purification process. In preparing the chemical monomer 2,6-dimethylphenol, there are side reactions and unreacted impurities, the materials after reaction must flow through different rectifying towers in sequence, and finally, the product with purity higher than 99% is obtained in the product tower. The product quality inspection of the product tower is particularly important. Due to the lack of diversity of the detection data of product tower detection point, the generalization ability of the detection model is weak. We create subspaces for the data sets of different detection points in the separation and purification of 2,6-dimethylphenol using the partial least squares method. Then, a mapping that aligns the other tower subspace into the product tower is learned by minimizing a Bregman matrix divergence function, reducing the feature distribution discrepancy between other towers and product towers. It avoids information loss in product tower data when projecting to a common subspace and makes full use of the feature information in other towers. The partial least squares regression modeling is completed on the transferred subspace and the final model coefficients are determined by the winner-takes-all-based weighting method. After the above method, the product tower detection model’s performance has improved. Finally, the effectiveness of this method is verified with the near-infrared detection data set of 2,6-dimethylphenol. We discuss the influence of transferring different amounts of data from other detection points on the performance of the product tower detection model. The performance improvement of the product tower detection model is improved by 52.19% in the best case, and the root means square error of prediction (RMSEP) dipped from 0.059 4 to 0.028 4. Compared with the traditional modeling methods such as support vector machine regression (SVR) and Back-Propagation (BP) neural network, it has better performance.
[1] WANG Jin-jin, LI Xiu-xi(王津津, 李秀喜). Chemical Industry and Engineering Progress(化工进展), 2011, 30(10): 2163.
[2] Pasquini C. Analytica Chimica Acta, 2018, 1026: 8.
[3] Wang K, He K, Du W, et al. Chemical Engineering Science, 2021, 242: 116672.
[4] Lian X, Zhang M, Sun X, et al. Polymer Testing, 2021, 93: 106584.
[5] Cogoni G, Liu Y A, Husain A, et al. International Journal of Pharmaceutics, 2021, 602: 120620.
[6] Hebbi V, Thakur G, Rathore A S. Journal of Biotechnology, 2021, 325: 303.
[7] Loudiyi M, Temiz H T, Sahar A, et al. Critical Reviews in Food Science and Nutrition, 2022,62(11): 3063.
[8] Yu Y, Huang J, Zhu J, et al. IEEE Sensors Journal, 2020, 21(3): 3506.
[9] Chen T, Martin E. Analytica Chimica Acta, 2009, 631(1): 13.
[10] Bian X H, Li S J, Fan M R, et al. Analytical Methods, 2016, 8(23): 4674.
[11] Yang J, Xu J, Zhang X, et al. Analytica Chimica Acta, 2019, 1081: 6.
[12] Pan S J, Yang Q. IEEE Transactions on Knowledge and Data Engineering, 2009, 22(10): 1345.
[13] Liu J, Ren Y. IEEE Transactions on Industrial Informatics, 2021, 17(9): 6073.
[14] Wang Y, Wang K, Zhou Z, et al. Frontiers of Chemical Science and Engineering, 2019, 13(3): 599.
[15] CHU Fei, PENG Chuang, JIA Run-da, et al(褚 菲, 彭 闯, 贾润达, 等). CIESC Journal(化工学报), 2021, 72(4): 2178.
[16] Shan P, Zhao Y, Wang Q, et al. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2019, 215: 97.