Application of Terahertz Spectroscopy in the Detection of Bioactive Peptides
WANG Pu1, HE Ming-xia1*, LI Meng2, QU Qiu-hong2, LIU Rui3, CHEN Yong-de4
1. State Key Laboratory of Precision Measuring Technology and Instruments, Tianjin University, Tianjin 300072, China
2. LET Terahertz (Tianjin) Technology Co., Ltd., Tianjin 300019, China
3. School of Food Engineering and Biotechnology, Tianjin University of Science and Technology, Tianjin 300222, China
4. Bai Defu Biological Technology Co., Ltd., Tangshan 063000, China
Abstract:Bioactive peptides, as the new darling of human health in the 21st century, have been proved that they have a good effect on human life activities, and their detection methods are also of great concern. Terahertz time-domain spectroscopy technology has incomparable advantage in detecting bioactive peptides because of its unique properties. In this paper, three bioactive peptides, bovine bone peptide, sea cucumber peptide and fish peptide, were used to obtain the absorption coefficient curve of 0.5~2 THz by the transmission terahertz time domain spectroscopy system. From the terahertz absorption coefficient curve, the absorption coefficient of the fish peptide is higher than that of sea cucumber peptide and fish bone peptide. Because of the interaction between the amino acid species of bioactive peptides and peptide bonds, there is no obvious absorption peak in the terahertz frequency band. In order to better detect and distinguish them, a classification discriminant model is established to find the most suitable for such substances. After the S-G smoothing and normalization preprocess performed on the terahertz original absorption coefficient data, two-thirds of the pre-processed data are randomly selected into training sets, and the rest are prediction set. The classification discriminant model is introduced. The model includes two parts: the classifier and the optimal parameter selection. The classifier selects the supervised classification method such as support vector machine, random forest and extreme learning machine, and uses the intelligent optimization algorithm such as genetic algorithm, particle swarm optimization and grid search to select the support vector machine optimal parameters. In order to reduce the original spectral data dimension and improve the computational speed of the model, Principal Component Analysis is used for preprocessing, and the results after dimensionality reduction are imported into the classification model. Considering the factors such as accuracy and running time, although the support vector machine based on particle swarm optimization has the highest accuracy rate of 98.3%, the running time is longer than 180 seconds; the ultimate learning machine can have the shortest running time of 0.2 seconds. However, the accuracy rate is 73.3%. The support vector machine based on grid search has an accuracy rate of 96% and a running time of 11 seconds. It can use a shorter time in the case of higher accuracy, and proves that the support vector machine based on grid search is better for detecting bioactive peptide. The results show that the use of terahertz time-domain spectroscopy combined with machine learning algorithms can achieve rapid and non-destructive detection of bioactive peptides, providing a new idea for the detection of bioactive peptides. It also demonstrates that THz-TDS combined with machine learning is a way better way for the identification of inconspicuous peptides.
王 璞,何明霞,李 萌,曲秋红,刘 锐,陈永德. 太赫兹光谱技术在生物活性肽检测中应用研究[J]. 光谱学与光谱分析, 2020, 40(09): 2696-2701.
WANG Pu, HE Ming-xia, LI Meng, QU Qiu-hong, LIU Rui, CHEN Yong-de. Application of Terahertz Spectroscopy in the Detection of Bioactive Peptides. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(09): 2696-2701.
[1] ZHANG Zhi-hui, SU Xiu-lan(张志慧, 苏秀兰). China Medical Herald(中国医药导报), 2019, 16(10): 37.
[2] ZHOU Ting-yi, GAO Xin-chang, et al(周亭屹, 高新昌,等). Science and Technology of Food Industry(食品工业科技), 2019, (12): 335.
[3] HE Ming-xia,GUO Shuai(何明霞,郭 帅). Journal of Electronic Measurement and Instrument(电子测量与仪器学报),2012,26(8):663.
[4] Kutteruf M R, Brown C M, Iwaki L K, et al. Chemical Physics Letters, 2003, 375(3/4): 337.
[5] LI Li-long(李利龙) . Master Degree Dissertation(硕士学位论文). Changsha University of Science and Technology(长沙理工大学), 2014.
[6] HE Xiao-qun(何晓群). Multivariate Statistical Analysis(多元统计分析). Beijing: China Renmin University Press(北京: 中国人民大学出版社), 2008. 152.
[7] Vapnik V N. The Nature of Statistical Learning Theory. New York: Springer-Verlag, 1995.
[8] Olaru C,Wehenkel L. Fuzzy Sets and Systems, 2003, 138(2): 221.
[9] Breiman L. Machine Learning, 2001, 45(1): 5.
[10] Bendu H, Deepak B B V L, Murugan S. Applied Energy,2017, 187: 601.