Sufriyana Herdiantri, Salim Hotimah Masdan, Muhammad Akbar Reza, Wu Yu-Wei, Su Emily Chia-Yu
Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, 250 Wu-Xing Street, Taipei 11031, Taiwan.
Department of Medical Physiology, Faculty of Medicine, Universitas Nahdlatul Ulama Surabaya, 57 Raya Jemursari Road, Surabaya 60237, Indonesia.
Comput Struct Biotechnol J. 2022;20:4206-4224. doi: 10.1016/j.csbj.2022.08.011. Epub 2022 Aug 8.
A well-known blood biomarker (soluble fms-like tyrosinase-1 [sFLT-1]) for preeclampsia, i.e., a pregnancy disorder, was found to predict severe COVID-19, including in males. True biomarker may be masked by more-abrupt changes related to endothelial instead of placental dysfunction. This study aimed to identify blood biomarkers that represent maternal-fetal interface tissues for predicting preeclampsia but not COVID-19 infection.
The surrogate transcriptome of tissues was determined by that in maternal blood, utilizing four datasets ( = 1354) which were collected before the COVID-19 pandemic. Applying machine learning, a preeclampsia prediction model was chosen between those using blood transcriptome (differentially expressed genes [DEGs]) and the blood-derived surrogate for tissues. We selected the best predictive model by the area under the receiver operating characteristic (AUROC) using a dataset for developing the model, and well-replicated in datasets both with and without an intervention. To identify eligible blood biomarkers that predicted any-onset preeclampsia from the datasets but that were not positive in the COVID-19 dataset ( = 47), we compared several methods of predictor discovery: (1) the best prediction model; (2) gene sets of standard pipelines; and (3) a validated gene set for predicting any-onset preeclampsia during the pandemic ( = 404). We chose the most predictive biomarkers from the best method with the significantly largest number of discoveries by a permutation test. The biological relevance was justified by exploring and reanalyzing low- and high-level, multiomics information.
A prediction model using the surrogates developed for predicting any-onset preeclampsia (AUROC of 0.85, 95 % confidence interval [CI] 0.77 to 0.93) was the only that was well-replicated in an independent dataset with no intervention. No model was well-replicated in datasets with a vitamin D intervention. None of the blood biomarkers with high weights in the best model overlapped with blood DEGs. Blood biomarkers were transcripts of integrin-α5 (ITGA5), interferon regulatory factor-6 (IRF6), and P2X purinoreceptor-7 (P2RX7) from the prediction model, which was the only method that significantly discovered eligible blood biomarkers ( = 3/100 combinations, 3.0 %; =.036). Most of the predicted events (73.70 %) among any-onset preeclampsia were cluster A as defined by ITGA5 (Z-score ≥ 1.1), but were only a minority (6.34 %) among positives in the COVID-19 dataset. The remaining were predicted events (26.30 %) among any-onset preeclampsia or those among COVID-19 infection (93.66 %) if IRF6 Z-score was ≥-0.73 (clusters B and C), in which none was the predicted events among either late-onset preeclampsia (LOPE) or COVID-19 infection if P2RX7 Z-score was <0.13 (cluster C). Greater proportions of predicted events among LOPE were cluster A (82.85 % vs 70.53 %) compared to early-onset preeclampsia (EOPE). The biological relevance by multiomics information explained the biomarker mechanism, polymicrobial infection in any-onset preeclampsia by ITGA5, viral co-infection in EOPE by ITGA5-IRF6, a shared prediction with COVID-19 infection by ITGA5-IRF6-P2RX7, and non-replicability in datasets with a vitamin D intervention by ITGA5.
In a model that predicts preeclampsia but not COVID-19 infection, the important predictors were genes in maternal blood that were not extremely expressed, including the proposed blood biomarkers. The predictive performance and biological relevance should be validated in future experiments.
一种用于预测先兆子痫(一种妊娠疾病)的著名血液生物标志物(可溶性fms样酪氨酸激酶-1 [sFLT-1])被发现可预测严重的2019冠状病毒病(COVID-19),包括男性患者。真正的生物标志物可能会被与内皮功能障碍而非胎盘功能障碍相关的更突然的变化所掩盖。本研究旨在识别代表母胎界面组织的血液生物标志物,以预测先兆子痫而非COVID-19感染。
利用在COVID-19大流行之前收集的四个数据集(n = 1354),通过母体血液中的组织替代转录组来确定组织的替代转录组。应用机器学习,在使用血液转录组(差异表达基因[DEGs])和组织的血液衍生替代物的模型中选择先兆子痫预测模型。我们使用一个用于开发模型的数据集,通过受试者操作特征曲线下面积(AUROC)选择最佳预测模型,并在有和没有干预的数据集中进行了充分验证。为了从数据集中识别可预测任何类型先兆子痫但在COVID-19数据集(n = 47)中为阴性的合格血液生物标志物,我们比较了几种预测指标发现方法:(1)最佳预测模型;(2)标准流程的基因集;(3)用于预测大流行期间任何类型先兆子痫的验证基因集(n = 404)。我们通过排列检验从发现数量显著最多的最佳方法中选择最具预测性的生物标志物。通过探索和重新分析低水平和高水平的多组学信息来证明生物学相关性。
使用为预测任何类型先兆子痫而开发的替代物的预测模型(AUROC为0.85,95%置信区间[CI]为0.77至0.93)是唯一在无干预的独立数据集中得到充分验证的模型。在有维生素D干预的数据集中,没有模型得到充分验证。最佳模型中权重较高的血液生物标志物均与血液DEGs不重叠。预测模型中的血液生物标志物是整合素-α5(ITGA5)、干扰素调节因子-6(IRF6)和P2X嘌呤受体-7(P2RX7)的转录本,这是唯一能显著发现合格血液生物标志物的方法(100种组合中有3种,3.0%;P = 0.036)。任何类型先兆子痫中大多数预测事件(73.70%)属于ITGA5定义的A组(Z评分≥1.1),但在COVID-19数据集中的阳性病例中仅占少数(6.