| 
					
						| 
								
									|  |  |  |   |  |  
    					|  |  
    					| Discrimination of Chuzhou Chrysanthemum Tea Grades Using Noise 
Discriminant C-Means Clustering |  
						| WU Bin1, XIE Chen-ao2, CHEN Yong2, WU Xiao-hong2, JIA Hong-wen1 |  
						| 1. School of Information Engineering, Chuzhou Polytechnic, Chuzhou 239000, China 2. School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China
 
 |  
						|  |  
					
						| 
								
									| 
											
                        					 
												
													
													    |  |  
														| 
													
													    | Abstract  Near-infrared (NIR) spectroscopy detection technology can reflect the measured sample's organic chemical composition and structural information by detecting the spectral features in the NIR region. During the material composition analysis, NIR spectroscopy often involves a significant amount of wavelength data, resulting in relatively high data dimensions. Furthermore, spectra are susceptible to phenomena such as overlap and redundancy, which impact the model's performance. Therefore, we proposed a noise discriminant C-means clustering (NDCM) algorithm that combined fast generalized noise clustering (FGNC) and fuzzy linear discrimination analysis (FLDA). NDCM can realize the extraction of data identification information and data space compression in the fuzzy clustering process, which can achieve higher clustering accuracy. The fuzzy membership degree and the cluster centers obtained by fuzzy C-means clustering (FCM) on the near-infrared spectral data of Chuzhou chrysanthemum tea are used as the initial fuzzy membership degree and initial clustering centers of NDCM, respectively, so that NDCM has the advantages of fast clustering speed and high accuracy. The FCM algorithm is sensitive to noisy data, while the NDCM algorithm can perform better when dealing with noisy data in spectra. In this study, 240 samples of Chuzhou chrysanthemum tea with three quality grades, namely special grade, first grade and second grade, were selected as experimental samples. A portable NIR spectrometer (NIR-M-F1-C) was used to collect the NIR spectra of Chuzhou chrysanthemum tea, and they are the 400-dimensional data. At first, the NIR spectra were pretreated with Savitzky-Golay filtering and multivariate scattering correction (MSC) to reduce spectral scattering and noise. Secondly, the dimensionality of the spectral data was reduced by principal component analysis (PCA), and the dimensionality of the data after PCA reduction was 6. Next, linear discriminant analysis (LDA) was applied to extract the discriminant information in the spectral data of Chuzhou Chrysanthemum tea and further transform the data space dimension into 2 dimensions. Finally, three algorithms, i.e. FCM, FGNC and NDCM, were utilized to perform cluster analysis on the processed data to accurately classify chrysanthemum tea. The experimental results exhibited that when the weight index m=2.5, the clustering accuracy rates of FCM, FGNC and NDCM were 92.42%, 98.48%, and 100%, respectively. The clustering time of NDCM was slightly longer compared to FGNC. FCM had 27 iterations to reach convergence, while FGNC and NDCM took 13 and 10 times, respectively. NIR spectroscopy combined with MSC, Savitzky-Golay filtering, PCA, LDA and NDCM can provide a clustering model to accurately identify Chuzhou chrysanthemum tea quality. |  
															| Received: 2023-04-02    
						    						    							Accepted: 2023-11-06 |  
															|  |  |  
													
														  
															| [1] Sharma N, Kumar M, Kumari N, et al. Heliyon, 2023: e20232. [2] Lopez-Hortas L, Rodriguez P, Diaz-Reinoso B, et al. The Journal of Supercritical Fluids, 2022, 188: 105652.
 [3] Wu N, Zhang C, Bai X, et al. Molecules, 2018, 23(11): 2831.
 [4] Liu C, Lu W, Gao B, et al. Food Science & Nutrition, 2020, 8(4): 1968.
 [5] Zhang T, Wu X, Wu B, et al. Journal of Food Process Engineering, 2022, 45(8): e14040.
 [6] Luo W, Tian P, Fan G, et al. Infrared Physics & Technology, 2022, 123: 104037.
 [7] Aouadi B, Zaukuu J L Z, Vitális F, et al. Sensors, 2020, 20(19): 5479.
 [8] Jia J, Zhou X, Li Y, et al. LWT, 2022, 164: 113625.
 [9] WU Bin,FU Hai-jun,WU Xiao-hong,et al(武 斌, 傅海军, 武小红, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2020, 40(2): 512.
 [10] Jia J, Zhou X, Li Y, et al. LWT, 2022, 164: 113625.
 [11] Wu X, Zhu J, Wu B, et al. Computers and Electronics in Agriculture, 2018, 147: 64.
 [12] He J, Chen L, Chu B, et al. Molecules, 2018, 23(9): 2395.
 [13] Wu N, Zhang C, Bai X, et al. Molecules, 2018, 23(11): 2831.
 [14] Davé R N. Pattern Recognition Letters, 1991, 12(11): 657.
 [15] Davé R N, Sen S. Generalized Noise Clustering as a Robust Fuzzy c-M-Estimators Model. in: Proceedings Conference of the North American Fuzzy Information Proceesing Society, Pensacola Beach, F L, 1998: 256.
 [16] WU Bin,WU Xiao-hong,JIA Hong-wen,et al(武 斌, 武小红, 贾红雯,等). Computer Engineering and Applications(计算机工程与应用), 2013, 49(13): 145.
 [17] Barton S, Alakkari S, O'Dwyer K, et al. Sensors, 2021, 21(14): 4623.
 [18] Pokhrel D R, Sirisomboon P, Khurnpoon L, et al. Sensors, 2023, 23(11): 5327.
 
 |  
													
														
															| 
																																																																																						
																				
																					| [1] | WANG Shu-tao1, WAN Jin-cong1*, LIU Shi-yu2, ZHANG Jin-qing1, WANG Yu-tian1. Qualitative Modeling Method of Mango Species in Near Infrared Based on Attention Mechanism Residual Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(08): 2262-2267. |  
																					| [2] | HU Cai-ping1*, FU Zhao-min2*, XU Hong-jia2, WU Bin3, SUN Jun4. Discrimination of Lettuce Storage Time Based on Near-Infrared Spectroscopy Combined With Fuzzy Uncorrelated QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(08): 2268-2272. |  
																					| [3] | XIAO Nan1, LI Han-lin1, WENG Ding-kang1, HU Dong1, SUN Tong1*, XIONG Yong-sen2. Rapid Identification of Apple Moldy Core Disease by Near Infrared 
Spectroscopy With Information Fusion of Different Illumination 
Patterns[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(08): 2388-2394. |  
																					| [4] | LI Zhen-yu1, ZHAO Peng1, 2*, WANG Cheng-kun3. Tree Class Recognition in Open Set Based on an Improved Fuzzy 
Reasoning Classifier[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1868-1876. |  
																					| [5] | XIAO Huai-chun1, LIU Yang1, WEI Bing-xue1, GAO Jia-rong1, LIU Yan-de2, XIAO Hui1. Identification of Visible and Short Wave Near Infrared Spectra of 
Super-Enriched Plants in Uranium Ore Area[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1813-1819. |  
																					| [6] | HUANG Hua1, LIU Ya2, MA Yi-hang1, XIANG Si-han1, HE Jia-ning1, WANG Shi-ting1, GUO Jun-xian3*. Prediction of Soluble Solid Contents in Apples Using Vis-NIRS and 
Functional Linear Regression Model[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1905-1912. |  
																					| [7] | CUI Hao-fan1, LIU Hong-zhi1, GUO Qin1*, GU Feng-ying1, ZHANG Yu2, WANG Qiang1*. Establishment of High-Throughput Model of Peanut Protein Components and Subunits by Near-Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1982-1987. |  
																					| [8] | YANG Sen1, WANG Zhen-min1*, SONG Wen-long1, XING Jian1, DAI Jing-min2. Optimization of Polished Rice Varieties Discrimination Based on 
Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1988-1992. |  
																					| [9] | NIU Xiao-ying1, 2, 3, MU Xiao-qing1, 2, 3, SUN Jie1, 2, 3, ZHAO Zhi-lei1, 2, 3*, ZHANG Chun-jiang4. Qualitative and Quantitative Analyses of Cooked Donkey Meat 
Adulteration Based on NIR Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 1993-2001. |  
																					| [10] | NI Jin1, SUO Li-min1*, LIU Hai-long1, ZHAO Rui2. Identification of Corn Varieties Based on Northern Goshawk Optimization Kernel Based Extreme Learning Machine[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(06): 1584-1590. |  
																					| [11] | YU Shui1, HUAN Ke-wei1*, LIU Xiao-xi2, WANG Lei1. Quantitative Analysis Modeling of Near Infrared Spectroscopy With 
Parallel Convolution Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(06): 1627-1635. |  
																					| [12] | WEI Zi-chao1, 2, LU Miao1, 2, LEI Wen-ye1, 2, WANG Hao-yu1, 2, WEI Zi-yuan1, 2, GAO Pan1, 2, WANG Dong1, 2, CHEN Xu1, 2*, HU Jin1, 2*. A Nondestructive Method Combined Chlorophyll Fluorescence With Visible-NIR Spectroscopy for Detecting the Severity of Heat Stress on Tomato Seedlings[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(06): 1613-1619. |  
																					| [13] | LIU Zhen-fang, HUANG Min*, ZHU Qi-bing, ZHAO Xin, YAN Sheng-qi. Spatially Offset Raman Spectroscopy Analysis Technology and Application in Food Subsurface Detection[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(05): 1201-1208. |  
																					| [14] | GE Qing, LIU Jin*, HAN Tong-shuai, LIU Wen-bo, LIU Rong, XU Ke-xin. Influence of Medium's Optical Properties on Glucose Detection 
Sensitivity in Tissue Phantoms[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(05): 1262-1268. |  
																					| [15] | LIU Yu-ming1, 2, 3, WANG Qiao-hua1, 2, 3*, CHEN Yuan-zhe1, LIU Cheng-kang1, FAN Wei1, ZHU Zhi-hui1, LIU Shi-wei1. Non-Destructive Near-Infrared Spectroscopy of Physical and Chemical 
Indicator of Pork Meat[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(05): 1346-1353. |  |  
											 
											 |  |  |