Abstract:β-lactam antibiotics (BAs) are an important class of anti-infective drugs in clinical practice. Penicillin G acylase (PGA) is a key technology used in the new enzymatic process to produce of BAs. PGAs derived from different bacterial origins have different protein sequence structures, thermal stability, and stereo-selectivity, which cause different catalytic activity and are crucial for antibiotic synthesis and production. Infrared spectroscopy (IR) can be used to characterize the structure of high molecular weight proteins. Proteomics-based mass spectrometry can identify different protein substances at the peptide level, but its complexity makes it harder to operate. The simple IR method provides a powerful analytical tool for rapidly characterizing PGAs. This article explored the selection of ultrafiltration and drying membrane preparation methods for the pre-treatment of PGA samples. This way could purify PGA raw solutions and remove matrix interference, while it could also overcome the problem of low PGA solution concentration to enhance IR signal response. Besides, a high-through put IR method was optimized and established to analyze 11 batches of PGAs from different sources. All IR spectra of PGAs showed classical IR absorption peaks of amide groups at the characteristic region (1 700~1 500 cm-1). There were still differential IR absorption peaks within the 1 200~750 cm-1 fingerprint region. A traceability model was established by selecting differentiated absorption peak spectral bands at fingerprint regions (830~795,1 027~1 020,1 085~1 080 cm-1). Based on the analysis of proteomics mass spectrometry, a weighted k-nearest neighbor (KNN) algorithm was employed to analyze different PGAs. It showed that 11 batches of PGAs were divided into two classes: those including PGA 1, PGA 3, PGA 7, and PGA 8 belonged to class Ⅰ and were identified as the proteins fermented from E. coli, while the rest of those—PGA 2, PGA 4~6, and PGA 9~11 belonged to class Ⅱ and were produced from Achromobacter sp. CCM 4824. This result verified the applicability and feasibility of the established traceability model, which was consistent with the proteomics result. Then, three batches of unknown PGAs were collected for determination by the traceability model to externally validate the accuracy. Finally, the robustness of the established model was further validated by examining 11 batches of PGAs on different days. The results demonstrate that the high-throughput IR method based on the weighted KNN could rapidly trace PGAs from different bacterial origins. This method is simple, accurate, and durable. It provides a new detection tool for the structural characterization and protein classification of the catalytic enzymes used in producing BAs by enzymatic process.
Key words:Infrared spectra;Penicillin G acylase;Bacterial classification
王 琰,张培培,赵 瑜. 基于加权KNN的高通量红外光谱法溯源分析青霉素G酰基转移酶菌属分类[J]. 光谱学与光谱分析, 2025, 45(04): 1008-1014.
WANG Yan, ZHANG Pei-pei, ZHAO Yu. Traceability Analysis of Penicillin G Acylase Genus Classification Using High-Throughput Infrared Spectroscopy Based on the Weighted KNN. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(04): 1008-1014.
[1] Lima L M,Silva B N M D,Barbosa G,et al. European Journal of Medicinal Chemistry,2020,208:112829.
[2] Luis C P,Raúl R H,Juan C C,et al. Current Pharmaceutical Biotechnology,2020,21(4):287.
[3] Pan X,Li A,Peng Z Y,et al. Bioorganic Chemistry,2020,99:103765.
[4] Fan Y X,Li Y B,Liu Q F. Biotechnology and Applied Biochemistry,2021,68(1):136.
[5] Srirangan K,Orr V,Akawi L,et al. Biotechnology Advances,2013,31(8):1319.
[6] Marešová H,Placková M,Grulich M,et al. Applied Microbiology and Biotechnology,2014,98(7):2867.
[7] Marešová H,Palyzová A,Placková M,et al. Folia Microbiologica,2017,62(5):417.
[8] Bečk S,Štepánek V,Vyasarayani R W,et al. Applied Microbiology and Biotechnology,2014,98(3):1195.
[9] Mayer J,Pippel J,Günther G,et al. Applied Microbiology and Biotechnology,2019,103(18):7537.
[10] Mörtstedt H,Makower Å,Edlund P O,et al. Journal of Pharmaceutical and Biomedical Analysis,2020,185(18):113256.
[11] Wiöniewski J R,Wegler C,Artursson P. Journal of Proteome Research,2019,18(1):217.
[12] Long Z,Wei C,Dong X F,et al. Journal of Chromatography B,2021,1181:122884.
[13] XU Yang-biao,WANG Hai-shui(徐杨彪,王海水). Spectroscopy and Spectral Analysis(光谱学与光谱分析),2022,42(2):642.
[14] Wisniewski J R,Wegler C,Artursson P. Journal of Proteome Research,2016,36(10):81.
[15] Harshitha C G,Sharma N,Singh R,et al. Journal of Food Science and Technology-Mysore,2023,60(1):64.
[16] León E S,Coat R,Moutel B,et al. Bioprocess and Biosystems Engineering,2014,37(11):2371.
[17] Wang Y,Hu X Y,Long Z,et al. Journal of Chromatography A,2022,1678:463365.
[18] Ye S,Wang D,Min S. Chemometrics and Intelligent Laboratory Systems,2008,91(2):194.
[19] Shubham P,Vivek S,Garima A. International Journal of Engineering and Computer Science,2019,8(11):24869.
[20] Kutsanedzie F Y H,Chen Q S,Hassan M M,et al. Food Chemistry, 2018, 240: 231.