光谱学与光谱分析 |
|
|
|
|
|
A Novel Metabolomic Data Scaling Method Based on K-L Divergence |
DENG Ling-li1, 2, Cheng Kian-Kai3, SHEN Gui-ping1, ZHOU Ling1, LIU Xin-zhuo1, DONG Ji-yang1*, CHEN Zhong1 |
1. Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, Xiamen 361005, China 2. Department of Communication Engineering, Xiamen University, Xiamen 361005, China 3. Department of Bioprocess Engineering & Institute of Bioproduct Development, Universiti Teknologi Malaysia, Skudai 81310, Malaysia |
|
|
Abstract A new scaling method in the current study based on Kullback-Leibler (K-L) divergence is proposed for NMR metabolomic data. The proposed method (called K-L scaling) is a supervised scaling method as group information is incorporated in the scaling procedure. Notably, K-L divergence measures the difference between two different datasets by their probability distributions, it can be used for the analysis of data that either follows Gaussian or non-Gaussian distributions. In K-L scaling, all variables were first standardized to unit variance, then their variance was adjusted using Kullback-Leibler divergence to highlight the significant variables. K-L scaling can tell effectively the difference in spectral data points between two experimental groups, and then enhances the weights of biological-relevant variables, and at the same time reduces the weight of noise and uninformative variables. The developed method was applied to a 1H-NMR metabolomic dataset acquired from human urine. Analysis results of the dataset showed that this new scaling method is efficient in suppressing the contribution of noise in the resulting multivariate model. In addition, it can increase the weights of important variables, and improve the interpretability and predictability of subsequent principal component regression (PCR) and partial least squares discriminant analysis (PLS-DA). Furthermore, the scaling method facilitated the identification of metabolic signatures. The current result suggested that the developed K-L scaling method may become a useful alternative for the preprocessing of NMR-based metabolomic data.
|
Received: 2013-09-26
Accepted: 2014-02-15
|
|
Corresponding Authors:
DONG Ji-yang
E-mail: jydong@xmu.edu.cn
|
|
[1] Katajamaa M, Oreic M. Journal of Chromatography A, 2007, 1158(1-2): 318. [2] Van den Berg R A, Hoefsloot H C, Westerhuis J A, et al. BMC Genomics, 2006, 7(1): 142. [3] Jackson J E. A User’s Guide to Principal Components. Hoboken, NJ: Wiley-Interscience, 2003. [4] Eriksson L, Antti H, Gottfries J, et al. Analytical and Bioanalytical Chemistry, 2004, 380(3): 419. [5] Keun H C, Ebbels T M D, Antti H, et al. Analytica Chimica Acta, 2003, 490(1-2): 265. [6] Dong J Y, Li W, Deng L L, et al. Chemical Journal of Chinese Universities-Chinese, 2011, 32(2): 262. [7] Rached Z, Alajaji F, Campbell L L. IEEE Transactions on Information Theory, 2004, 50(5): 917. [8] Xu J J, Yang S Y, Cai S H, et al. Analytical and Bioanalytical Chemistry, 2010, 396(4): 1451. [9] DONG Ji-yang, ZHOU Ling, DENG Ling-li(董继扬, 周 玲, 邓伶莉). Chinese Patent (中国专利),2013SR060215, 2013. [10] Dong J Y, Cheng K K, Xu J J, et al. Chemometrics and Intelligent Laboratory Systems, 2011, 108(2): 123. [11] Heyman H M, Meyer J J M. South African Journal of Botany, 2012, 82(0): 21. [12] Craig A, Cloarec O, Holmes E, et al. Analytical Chemistry, 2006, 78(7): 2262. |
[1] |
BAI Feng-yuan1, 2, ZHAO Dong-mei1, 2, CAI Ren-jun1, 2, SONG Su-fei1, 2, LIU Tao1, 2*, XU Qiu-ling1, 2*. Serum Metabolomics Study of CCI Model Rats Based on UHPLC-QE-MS[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(08): 2528-2531. |
[2] |
HU Yin, WANG Min-chang, PAN Qing, NING Yan-li, KANG Ying, WANG Ming, LUAN Jie-yu, CHEN Zhi-qun. Spectroscopic Analysis of Endo and Exo-Tetrahydrodicyclopentadiene[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40(10): 3161-3166. |
[3] |
ZHONG Jun, JIANG Xue-mei* . Recent Progress in Nuclear Magnetic Resonance Spectrum for Drug Research and Development[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2015, 35(01): 282-286. |
[4] |
GAO Xiu-xiang1,XU Yi-zhuang1*,ZHAO Mei-xian2,QI Jian1,LI Hui-zhen1,WU Jin-guang1. Progress in Nuclear Magnetic Resonance Spectroscopy for Early Cancer Diagnosis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2008, 28(08): 1942-1950. |
[5] |
YAN Jian-hui1,YAO Guo-wei1,YANG Li-na1,HU Gao-fei1,2,YANG Xin-lin1. Novel Synthesis and Spectral Characterization of Nelfinavir[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2008, 28(01): 98-102. |
[6] |
YANG Xue-mei1,FU En-qin2*. Synthesis and Spectral Characterization of Novel Chiral Macrocyclic Dioxopolyamines[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2007, 27(07): 1403-1407. |
[7] |
ZHOU Wei-qiang1,2,XU Jing-kun2,WANG Jing-wu1*,PU Shou-zhi2. Spectral Analysis of the Effect of Different Polymerization Potential on Electrosynthesized Poly (5-Nitroindole) Films [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2007, 27(06): 1106-1109. |
[8] |
GUO Xiao-he1,CHANG Jun-biao1,2,CHENG Sen-xiang3,WANG Qiang3 . Study on Spectroscopy of 3-Substituted Phenyl-5-(3′-Indolyl)-Isoxazoline Derivatives[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2007, 27(01): 70-73. |
[9] |
XU Chun-yan, HUANG Ming-zhi*, XUE Chuan-xin. Synthesis and Spectral Characterization of 10-Hydroxycamptothecin[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2005, 25(11): 1772-1774. |
[10] |
AN Yue1, ZHU Zai-ming1, HU Jie-han2, GE Jun-jie1 . Study on Spectroscopy of α, α’-Dioxoketene Cyclic s, s-Acetals [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2004, 24(09): 1069-1071. |
[11] |
WANG Xiao-yan, HUANG Xiang-hong . Synthesis and Spectral Characterization of 4-Amino-4’-Chlorobenzophenone [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2004, 24(05): 573-575. |
|
|
|
|