Study on the Origin Identification and Saponins Content Prediction of Panax notoginseng by FTIR Combined with Chemometrics
LI Yun1, 2, 3, XU Fu-rong1, ZHANG Jin-yu1, 2, 3, WANG Yuan-zhong2, 3*
1. College of Traditional Chinese Medicine, Yunnan University of Traditional Chinese Medicine, Kunming 650500, China
2. Institute of Medicinal Plants, Yunnan Academy of Agricultural Sciences, Kunming 650200, China
3. Yunnan Technical Center for Quality of Chinese Materia Medica, Kunming 650200, China
Abstract:Different origins have significant impact on the secondary metabolites of traditional Chinese medicine (TCM), so identification of origins is helpful for scientific and rational utilization of TCM. Additionally, the detection of active ingredient content is the main way to evaluate the quality of TCM. In this study, we established a rapidly method to identify the origins and detect active ingredient content of Panax notoginseng in order to provide some research bases for scientific, rational and specific utilization and rapid quality assessment of P. notoginseng. A total of 117 Fourier transform infrared (FTIR) spectra of P. notoginseng originated from five regions were collected. The discrete wavelet transform was used to process the original spectra in order to remove part of the high-frequency signal caused by noise while partial least squares discriminant analysis (PLS-DA) was used to screen the data with the contribution rate greater than one. Moreover, 70% of the 117 individuals were selected to form the training set by using Kennard-stone algorithm as well as the other 30% were used as prediction set. Training set data were used to establish the discriminant model of support vector machine and the cross-validation method was used for screening optimal parameters as well as the prediction data were utilized to verify the results of discriminant model. The pre-processing data to predict saponins content were processed by standard normal variable transform and discrete wavelet transform. Processed date of FTIR spectra were set as variable X and the total contents of four kinds of saponins in P. notoginseng samples measured by high performance liquid chromatography (HPLC) were set as variable Y. The orthogonal signal correction was used to remove the unrelated data of FTIR to saponins content of P. notoginseng samples. 80% of the individual data were selected as training set and the other 20% were utilized to form the prediction set. The partial least squares regression model was established by training set and the prediction set was utilized to verify the results of the model. The results showed that, (1) The optimal parameters c and g of support vector machine calculated by cross-validation was 2.828 43 and 0.062 5 respectively and the optimal accuracy of training set was 91.463 4%. (2) The support vector machine model was set as the optimal parameter and the accuracy of prediction set was 94.285 7% which showed a high accuracy. (3) The correlation coefficient (R2) and the root mean square error of estimation (RMSEE) of partial least squares regression model established by training set was 0.941 8 and 4.530 7, respectively. (4) The R2 and the root mean square error of prediction (RMSEP) of partial least squares regression model verified by prediction set was 0.962 3 and 3.855 9 respectively showing predictive value of saponins content was close to the value detected by HPLC. FTIR combined with support vector machine could effectively identify different origins of P. notoginseng. Orthogonal single collection and partial least squares regression could accurately predict the value of total four saponins content of P. notoginseng. It could provide a simple, rapid, non-destructive, high sensitive detection method for the quality control of P. notoginseng.
李 运,徐福荣,张金渝,王元忠. FTIR结合化学计量学对三七产地鉴别及皂苷含量预测研究[J]. 光谱学与光谱分析, 2017, 37(08): 2418-2423.
LI Yun, XU Fu-rong, ZHANG Jin-yu, WANG Yuan-zhong. Study on the Origin Identification and Saponins Content Prediction of Panax notoginseng by FTIR Combined with Chemometrics. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2017, 37(08): 2418-2423.
[1] Zhao Z, Guo P, Brand E. Journal of Ethnopharmacology, 2012, 140(3): 476.
[2] Li W F, Jiang J G, Chen J. Archives of Medical Research, 2008, 39(2): 246.
[3] Chinese Pharmacopoeia Commission(中华人民共和国药典委员会). Pharmacopoeia of the People’s Republic of China Part One (中华人民共和国药典一部). Beijing: China Medical Science Press(北京: 中国医药科技出版社), 2010.
[4] Ng T B. Journal of Pharmacy and Pharmacology, 2006, 58(8): 1007.
[5] Zhao G R, Xiang Z J, Ye T X, et al. Food Chemistry, 2006, 99(4): 767.
[6] Fan J S, Liu D N, Huang G, et al. Journal of Ethnopharmacology, 2012, 142(3): 732.
[7] Dai S, Xu B, Luo G, et al. Open Chemical Engineering Journal, 2015, 9: 47.
[8] Uríková V, Sádecká J. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2015, 148: 131.
[9] Zhao Y L, Zhang J, Jin H, et al. Journal of AOAC International, 2015, 98(1): 22.
[10] Zhao Y L, Zhang J, Yuan T J, et al. PloS One, 2014, 9(2): e89100.
[11] Li Y M, Sun S Q, Zhou Q, et al. Vibrational Spectroscopy, 2004, 36(2): 227.
[12] Zhu X, Li G, Shan Y. International Journal of Food Science and Technology, 2015, 50(5): 1123.
[13] Li X L, He Y, Wu C Q, et al. Journal of Food Engineering, 2007, 82(3): 316.
[14] Uncu O, Ozen B. LWT-Food Science and Technology, 2015, 63(2): 978.
[15] Luo Q, Yun Y, Fan W, et al. RSC Advances, 2015, 5(7): 5046.
[16] Zhao N, Wu Z, Zhang Q, et al. Scientific Reports, 2015, 5: 11647.
[17] Lee J, Chang K, Jun C H, et al. Chemometrics and Intelligent Laboratory Systems, 2015, 147: 139.
[18] Suykens J A K, Vandewalle J. Neural Processing Letters, 1999, 9(3): 293.
[19] MATLAB Chinese BBS(MATLAB中文论坛). MATLAB 30 Case Analysis of Neural Network(MATLAB神经网络30个案例分析). Beijing: Beihang University Press(北京: 北京航空航天大学出版社), 2010.
[20] Chang C C, Lin C J. ACM Transactions on Intelligent Systems and Technology (TIST), 2011, 2(3): 27.
[21] Wold S, Antti H, Lindgren F, et al. Chemometrics and Intelligent Laboratory Systems, 1998, 44(1): 175.