Qualitative Analysis of Pesticide Residues on Chinese Cabbage Based on GK Improved Possibilistic C-Means Clustering
TAN Yang1, WU Xiao-hong2, 3*, WU Bin4, SHEN Yan-jun1, LIU Jin-mao1
1. Institute of Talented Engineering Students, Jiangsu University, Zhenjiang 212013, China
2. School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China
3. High-tech Key Laboratory of Agricultural Equipment and Intelligence of Jiangsu Province, Jiangsu University, Zhenjiang 212013, China
4. Department of Information Engineering, Chuzhou Polytechnic, Chuzhou 239000, China
Abstract:Infrared spectroscopy is a technology used to identify the chemical composition of substances based on molecular vibration and quantum-jump theory. Due to the unique absorbance of different functional groups, the spectral data related to the absorbance and the wavelength (or wavenumber) can be obtained when the infrared beam irradiates the molecular. However, the spectral data from experiments always have high dimensions and overlap, making it difficult to process the data. Thus, this paper proposed an improved Gustafson-Kessel possibilistic c-means clustering (GKIPCM), introducing the Mahalanobis distance from GK clustering and the iterative equations of fuzzy membership values and cluster centers from improved possibilistic c-means clustering (IPCM). GKIPCM makes the data adapt to different mathematical distance measures and avoids identical cluster centers. Furthermore, GKIPCM has higher classification accuracy, which is less sensitive to parameters. In the experiments, four groups of washed Chinese cabbage were the objects of spectral analysis and different concentrations of lambda-cyhalothrin pesticide were sprayed on the Chinese cabbages. Spectral data of Chinese cabbages were collected with Agilent Cary 630 FTIR spectrometer. Firstly, multiplicative scatter correction (MSC) was applied to reduce the noise and eliminate data offset when pre-processing the data. Secondly, principal component analysis (PCA) was utilized to reduce dimensions due to the wide wavenumber range (4 300~590 cm-1) and the high data dimensions (971). After conducting PAC, the dimensionality of data was reduced to 23, and the total contribution of 23 principal components reached 99.60%. Nonetheless, the feature information was still mixed. So the linear discriminant analysis (LDA) was used to extract features of the spectral data, and the LDA algorithm reduced the dimensionality of the spectral data to 3. Finally, the fuzzy c-means clustering (FCM) was employed to obtain the optimal initial cluster centers. Then, the GKIPCM algorithm was applied to cluster four different groups of spectral data. Comparisons were made among the clustering results of GKIPCM, GK and IPCM. The running time and accuracy of GKIPCM were 0.218 8 seconds and 97.22%, and those of GK and IPCM were 0.093 8 seconds and 63.89%, 0.062 5 seconds and 91.67%, respectively. According to the results of the experiments, the GKIPCM algorithm finished the qualitative analysis of different concentrations of pesticide residues by analyzing the spectral data.
Key words:Chinese cabbage; Pesticide residues; Infrared spectroscopy; Principal component analysis; Linear discriminant analysis; Fuzzy clustering
谭 阳,武小红,武 斌,沈砚君,刘锦茂. GK可能C均值模糊聚类的白菜红外光谱分析[J]. 光谱学与光谱分析, 2022, 42(05): 1465-1470.
TAN Yang, WU Xiao-hong, WU Bin, SHEN Yan-jun, LIU Jin-mao. Qualitative Analysis of Pesticide Residues on Chinese Cabbage Based on GK Improved Possibilistic C-Means Clustering. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(05): 1465-1470.
[1] Azam S M R, Ma H l, Xu B G, et al. Trends in Food Science and Technology, 2020, 97: 417.
[2] Choudhary S, Raheja N, Kumar S, et al. Journal of Entomology and Zoology Studies, 2018, 6(3): 330.
[3] Alenyorege E A, Ma H l, Ayim I, et al. Journal of Food Safety and Food Quality, 2018, 69(3): 80.
[4] Jiang S Y, Sun J, Zhou X, et al. Journal of Food Process Engineering, 2017, 40(4): e12510.
[5] Zhou X, Sun J, Lu B, et al. Journal of Food Process Engineering, 2018, 41(8): e12903.
[6] Sun J, Zhou X, Mao H P, et al. Journal of Food Process Engineering, 2016, 40(4): e12509.
[7] Wu M M, Sun J, Lu B, et al. Journal of Food Process Engineering, 2019, 42(3): e13005.
[8] Sun J, Cong S, Mao H P, et al. Journal of Food Process Engineering, 2018, 41(2): e12654.
[9] Sinaga K P, Yang M S. IEEE Access, 2020, 8(1): 80716.
[10] Moradi Fard M, Thonet T, Gaussier E. Pattern Recognition Letters, 2020, 138: 185.
[11] Yuan Z, Chen H M, Li T R, et al. Information Sciences, 2021, 572: 67.
[12] Zhang L L, Luo M N, Liu J, et al. Pattern Recognition Letters, 2020, 130: 275.
[13] Yu H Y, Fan J L, Lan R. Applied Soft Computing Journal, 2019, 80: 845.
[14] Zhou H X, Fu H J, Wu X H, et al. Journal of Food Processing and Preservation, 2020, 44(10): e14795.
[15] Chaomurilige, Yu J, Yang M S. Information Sciences, 2017, 417: 435.
[16] Wu X H, Zhou J, Wu B, et al. Journal of Food Process Engineering, 2019, 42(8): e13298.