Stanford Institute for Immunity, Transplantation and Infection, Stanford, CA, USA; Division of Biomedical Informatics Research, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA.
Stanford Institute for Immunity, Transplantation and Infection, Stanford, CA, USA.
Lancet Respir Med. 2016 Mar;4(3):213-24. doi: 10.1016/S2213-2600(16)00048-5. Epub 2016 Feb 20.
Active pulmonary tuberculosis is difficult to diagnose and treatment response is difficult to effectively monitor. A WHO consensus statement has called for new non-sputum diagnostics. The aim of this study was to use an integrated multicohort analysis of samples from publically available datasets to derive a diagnostic gene set in the peripheral blood of patients with active tuberculosis.
We searched two public gene expression microarray repositories and retained datasets that examined clinical cohorts of active pulmonary tuberculosis infection in whole blood. We compared gene expression in patients with either latent tuberculosis or other diseases versus patients with active tuberculosis using our validated multicohort analysis framework. Three datasets were used as discovery datasets and meta-analytical methods were used to assess gene effects in these cohorts. We then validated the diagnostic capacity of the three gene set in the remaining 11 datasets.
A total of 14 datasets containing 2572 samples from 10 countries from both adult and paediatric patients were included in the analysis. Of these, three datasets (N=1023) were used to discover a set of three genes (GBP5, DUSP3, and KLF2) that are highly diagnostic for active tuberculosis. We validated the diagnostic power of the three gene set to separate active tuberculosis from healthy controls (global area under the ROC curve (AUC) 0·90 [95% CI 0·85-0·95]), latent tuberculosis (0·88 [0·84-0·92]), and other diseases (0·84 [0·80-0·95]) in eight independent datasets composed of both children and adults from ten countries. Expression of the three-gene set was not confounded by HIV infection status, bacterial drug resistance, or BCG vaccination. Furthermore, in four additional cohorts, we showed that the tuberculosis score declined during treatment of patients with active tuberculosis.
Overall, our integrated multicohort analysis yielded a three-gene set in whole blood that is robustly diagnostic for active tuberculosis, that was validated in multiple independent cohorts, and that has potential clinical application for diagnosis and monitoring treatment response. Prospective laboratory validation will be required before it can be used in a clinical setting.
National Institute of Allergy and Infectious Diseases, National Library of Medicine, the Stanford Child Health Research Institute, the Society for University Surgeons, and the Bill and Melinda Gates Foundation.
活动性肺结核的诊断较为困难,且治疗效果的监测也较为困难。世界卫生组织的一份共识声明呼吁开发新的非痰检诊断方法。本研究旨在通过整合公共可用数据集的多队列分析,从活动性肺结核患者的外周血中提取诊断基因集。
我们检索了两个公共基因表达微阵列数据库,并保留了检测全血中活动性肺结核感染临床队列的数据集。我们使用已验证的多队列分析框架,比较了潜伏性结核或其他疾病患者与活动性肺结核患者的基因表达情况。三个数据集被用作发现数据集,并使用元分析方法评估这些队列中基因的效应。然后,我们在其余 11 个数据集中验证了三个基因集的诊断能力。
共纳入了来自 10 个国家的 14 个数据集,包含了 1023 名成人和儿童患者的 2572 个样本。其中,三个数据集(N=1023)用于发现一组三个基因(GBP5、DUSP3 和 KLF2),它们对活动性肺结核具有高度诊断价值。我们验证了该三个基因集在来自十个国家的 8 个独立数据集(包含儿童和成人)中区分活动性肺结核与健康对照者(全球 AUC 为 0.90[95%CI 0.85-0.95])、潜伏性结核(0.88[0.84-0.92])和其他疾病(0.84[0.80-0.95])的诊断能力。三个基因集的表达不受 HIV 感染状态、细菌药物耐药性或卡介苗接种的影响。此外,在另外四个队列中,我们发现肺结核评分在活动性肺结核患者的治疗过程中下降。
总的来说,我们的多队列分析生成了一个全血中的三个基因集,对活动性肺结核具有稳健的诊断价值,在多个独立队列中得到了验证,并具有用于诊断和监测治疗反应的潜在临床应用价值。在临床应用之前,还需要进行前瞻性实验室验证。
美国国立过敏和传染病研究所、美国国家医学图书馆、斯坦福儿童健康研究所、美国大学外科医生学会和比尔及梅琳达·盖茨基金会。