Jiang Jiayan, Li Zhipeng, Chen Cheng, Jiang Weili, Xu Biao, Zhao Qi
School of Public Health and Key Laboratory of Public Health Safety, Fudan University, Shanghai, People's Republic of China.
Department of Chronic Communicable Disease, Center for Disease Control and Prevention of Jiangsu Province, Nanjing, Jiangsu, People's Republic of China.
Infect Drug Resist. 2021 Nov 15;14:4795-4807. doi: 10.2147/IDR.S330493. eCollection 2021.
To investigate the dysregulated pathways and identify reliable diagnostic biomarkers for tuberculosis using integrated analysis of metabolomics and transcriptomics.
Three groups of samples, untargeted metabolomics analysis of healthy controls (HC), latent tuberculosis infection patients (LTBI), and active tuberculosis patients (TB), were analyzed using gas chromatography time-of-flight mass spectrometry (GC-TOF MS) and ultra-high performance liquid chromatography-quantitative mass spectrometry (UHPLC-QE-MS). Both univariate and multivariate and statistical analyses were used to select differential metabolites (DMs) among group comparison, and LASSO regression analysis was employed to discover potential diagnostic biomarkers. Metabolite set enrichment analysis was performed to identify the altered metabolic pathways specifically in patients with TB. Meanwhile, a transcriptomic dataset GSEG4992 was downloaded from the GEO database to explore the differentially expressed genes (DEGs) between TB and HC identified in significantly enriched pathways. Finally, an integrative analysis of DMs and DEGs was performed to investigate the possible molecular mechanisms of TB.
Thirty-three specific metabolites were significantly different between TB and HC, of which 7 (5-hydroxyindoleacetic acid, isoleucyl-isoleucine, heptadecanoic acid, indole acetaldehyde, 5-ethyl-2,4-dimethyloxazole, and 2-hydroxycaproic acid, unknown 71) were chosen as combinational potential biomarkers for TB. The area under the curve (AUC) value of these biomarkers was 0.97 (95% CI: 0.92-1.00). Metabolites set enrichment analysis (MSEA) displayed that there were 3 significantly enriched pathways among all. The genes in 3 significantly enriched pathways were further analyzed, of which 9(ALDH3B1, BCAT1, BCAT2, GLYAT, GOT1, IL4I1, MIF, SDS, SDSL) were expressed differentially. The area under the curve (AUC) values of these DEGs enriched in pathways mostly were greater than 0.8. As a result, a connected network of metabolites and genes in the pathways were established, which provides insights into the credibility of selected metabolites.
The newly identified metabolic biomarkers display a high potential to be developed into a promising tool for TB screening, diagnosis, and therapeutic effect monitoring.
通过代谢组学和转录组学的综合分析,研究结核病失调的通路并确定可靠的诊断生物标志物。
使用气相色谱-飞行时间质谱(GC-TOF MS)和超高效液相色谱-定量质谱(UHPLC-QE-MS)对三组样本进行分析,这三组样本分别为健康对照(HC)、潜伏性结核感染患者(LTBI)和活动性肺结核患者(TB)的非靶向代谢组学分析。在组间比较中,使用单变量和多变量统计分析来选择差异代谢物(DMs),并采用LASSO回归分析来发现潜在的诊断生物标志物。进行代谢物集富集分析以确定结核病患者中特异性改变的代谢途径。同时,从GEO数据库下载转录组数据集GSEG4992,以探索在显著富集途径中鉴定出的结核病与健康对照之间的差异表达基因(DEGs)。最后,对差异代谢物和差异表达基因进行综合分析,以研究结核病可能的分子机制。
结核病与健康对照之间有33种特异性代谢物存在显著差异,其中7种(5-羟基吲哚乙酸、异亮氨酰-异亮氨酸、十七烷酸、吲哚乙醛、5-乙基-2,4-二甲基恶唑、2-羟基己酸、未知物71)被选为结核病的联合潜在生物标志物。这些生物标志物的曲线下面积(AUC)值为0.97(95%CI:0.92-1.00)。代谢物集富集分析(MSEA)显示,所有代谢途径中有3条显著富集。对3条显著富集途径中的基因进行进一步分析,其中9种(醛脱氢酶3B1、支链氨基酸转氨酶1、支链氨基酸转氨酶2、甘氨酸-N-乙酰转移酶、谷草转氨酶1、白细胞介素4诱导蛋白1、巨噬细胞移动抑制因子、磺基转移酶、磺基转移酶样蛋白)存在差异表达。这些在途径中富集后的差异表达基因的曲线下面积(AUC)值大多大于0.8。结果,在这些途径中建立了代谢物和基因的连接网络,这为所选代谢物的可信度提供了见解。
新鉴定的代谢生物标志物具有很高的潜力,有望开发成为结核病筛查、诊断和治疗效果监测的有效工具。