Department of Pediatrics and Child Health, Aga Khan University, Karachi, Pakistan.
Centre for Public Health Kinetics, New Delhi, Delhi, India.
JAMA Netw Open. 2020 Dec 1;3(12):e2029655. doi: 10.1001/jamanetworkopen.2020.29655.
Worldwide, preterm birth (PTB) is the single largest cause of deaths in the perinatal and neonatal period and is associated with increased morbidity in young children. The cause of PTB is multifactorial, and the development of generalizable biological models may enable early detection and guide therapeutic studies.
To investigate the ability of transcriptomics and proteomics profiling of plasma and metabolomics analysis of urine to identify early biological measurements associated with PTB.
DESIGN, SETTING, AND PARTICIPANTS: This diagnostic/prognostic study analyzed plasma and urine samples collected from May 2014 to June 2017 from pregnant women in 5 biorepository cohorts in low- and middle-income countries (LMICs; ie, Matlab, Bangladesh; Lusaka, Zambia; Sylhet, Bangladesh; Karachi, Pakistan; and Pemba, Tanzania). These cohorts were established to study maternal and fetal outcomes and were supported by the Alliance for Maternal and Newborn Health Improvement and the Global Alliance to Prevent Prematurity and Stillbirth biorepositories. Data were analyzed from December 2018 to July 2019.
Blood and urine specimens that were collected early during pregnancy (median sampling time of 13.6 weeks of gestation, according to ultrasonography) were processed, stored, and shipped to the laboratories under uniform protocols. Plasma samples were assayed for targeted measurement of proteins and untargeted cell-free ribonucleic acid profiling; urine samples were assayed for metabolites.
The PTB phenotype was defined as the delivery of a live infant before completing 37 weeks of gestation.
Of the 81 pregnant women included in this study, 39 had PTBs (48.1%) and 42 had term pregnancies (51.9%) (mean [SD] age of 24.8 [5.3] years). Univariate analysis demonstrated functional biological differences across the 5 cohorts. A cohort-adjusted machine learning algorithm was applied to each biological data set, and then a higher-level machine learning modeling combined the results into a final integrative model. The integrated model was more accurate, with an area under the receiver operating characteristic curve (AUROC) of 0.83 (95% CI, 0.72-0.91) compared with the models derived for each independent biological modality (transcriptomics AUROC, 0.73 [95% CI, 0.61-0.83]; metabolomics AUROC, 0.59 [95% CI, 0.47-0.72]; and proteomics AUROC, 0.75 [95% CI, 0.64-0.85]). Primary features associated with PTB included an inflammatory module as well as a metabolomic module measured in urine associated with the glutamine and glutamate metabolism and valine, leucine, and isoleucine biosynthesis pathways.
This study found that, in LMICs and high PTB settings, major biological adaptations during term pregnancy follow a generalizable model and the predictive accuracy for PTB was augmented by combining various omics data sets, suggesting that PTB is a condition that manifests within multiple biological systems. These data sets, with machine learning partnerships, may be a key step in developing valuable predictive tests and intervention candidates for preventing PTB.
在全球范围内,早产(PTB)是围产期和新生儿期死亡的最大单一原因,并与幼儿发病率增加有关。PTB 的病因是多因素的,开发可推广的生物学模型可能能够实现早期检测并指导治疗研究。
研究血浆转录组学和蛋白质组学分析以及尿液代谢组学分析鉴定与 PTB 相关的早期生物学测量的能力。
设计、设置和参与者:本诊断/预后研究分析了 2014 年 5 月至 2017 年 6 月期间来自低中等收入国家(LMICs;即孟加拉国的 Matlab;赞比亚的卢萨卡;孟加拉国的锡尔赫特;巴基斯坦的卡拉奇和坦桑尼亚的奔巴)5 个生物库队列中孕妇的血浆和尿液样本。这些队列是为研究母婴结局而建立的,并得到了改善孕产妇和新生儿健康联盟以及预防早产和死产全球联盟的支持。数据于 2018 年 12 月至 2019 年 7 月进行分析。
在妊娠早期(根据超声检查中位数为 13.6 周妊娠)采集并处理、储存和运输血液和尿液标本,并按照统一的方案进行处理。对血浆样本进行靶向测量蛋白质和非靶向游离核糖核酸分析;对尿液样本进行代谢物分析。
PTB 表型定义为在完成 37 周妊娠之前分娩的活产婴儿。
在这项研究中,包括 81 名孕妇,其中 39 名患有 PTB(48.1%),42 名足月妊娠(51.9%)(平均[标准差]年龄为 24.8[5.3]岁)。单变量分析显示,5 个队列之间存在功能生物学差异。将队列调整后的机器学习算法应用于每个生物学数据集,然后将更高层次的机器学习建模将结果组合到最终的综合模型中。与每个独立生物学模型(转录组学 AUC,0.73[95%CI,0.61-0.83];代谢组学 AUC,0.59[95%CI,0.47-0.72];和蛋白质组学 AUC,0.75[95%CI,0.64-0.85])相比,综合模型更准确,其接受者操作特征曲线(AUC)为 0.83(95%CI,0.72-0.91)。与 PTB 相关的主要特征包括与谷氨酰胺和谷氨酸代谢以及缬氨酸、亮氨酸和异亮氨酸生物合成途径相关的尿液中炎症模块和代谢模块。
本研究发现,在 LMICs 和高 PTB 环境中,足月妊娠期间的主要生物学适应遵循一种可推广的模式,通过结合各种组学数据集,PTB 的预测准确性得到提高,这表明 PTB 是一种在多个生物学系统中表现出来的疾病。这些数据集与机器学习合作伙伴关系可能是开发有价值的预测测试和预防 PTB 的干预候选物的关键步骤。