Dept. of Bioinformatics, Semmelweis University, Tuzolto U. 7-9, 1094, Budapest, Hungary.
Geroscience. 2023 Jun;45(3):1889-1898. doi: 10.1007/s11357-023-00742-4. Epub 2023 Mar 1.
Progress in ovarian cancer treatment lags behind other tumor types. With diagnosis usually at an advanced stage, there is a high demand for reliable prognostic biomarkers capable of the selection of effective chemo- and targeted therapies. Our goal was to establish a large-scale transcriptomic database and use it to uncover and rank survival-associated genes. Ovarian cancer cohorts with transcriptome-level gene expression data and clinical follow-up were identified from public repositories. All samples were normalized and entered into an integrated database. Cox univariate survival analysis was performed for all genes and was followed by multivariate analysis for selected genes involving clinical and pathological variables. False discovery rate was computed for multiple hypothesis testing and a 1% cutoff was used to determine statistical significance. The complete integrated database comprises 1816 samples from 17 datasets. Altogether, 2468 genes were correlated to progression-free survival (PFS), and 704 genes were correlated with overall survival (OS). The most significant genes were WBP1L, ASAP3, CNNM2, and NCAPH2 for progression-free survival and CSE1L, NUAK1, ALPK2, and SHKBP1 for overall survival. Genes significant for PFS were also preferentially significant for predicting OS as well. All data including HR and p values as well as the used cutoff values for all genes for both PFS and OS are provided to enable the ranking of future biomarker candidates across all genes. Our results help to prioritize genes and to neglect those which are most likely to fail in studies aiming to establish new clinically useful biomarkers and therapeutic targets in serous ovarian cancer.
卵巢癌治疗的进展落后于其他肿瘤类型。由于通常在晚期诊断,因此迫切需要可靠的预后生物标志物,以选择有效的化疗和靶向治疗方法。我们的目标是建立一个大规模的转录组数据库,并利用它来发现和排名与生存相关的基因。从公共存储库中确定了具有转录组水平基因表达数据和临床随访的卵巢癌队列。所有样本均经过归一化处理并输入到集成数据库中。对所有基因进行 Cox 单因素生存分析,然后对涉及临床和病理变量的选定基因进行多因素分析。对多重假设检验进行了错误发现率计算,并使用 1%的截止值来确定统计学意义。完整的集成数据库包含来自 17 个数据集的 1816 个样本。总共,有 2468 个基因与无进展生存期(PFS)相关,有 704 个基因与总生存期(OS)相关。无进展生存的最显著基因是 WBP1L、ASAP3、CNNM2 和 NCAPH2,总生存的最显著基因是 CSE1L、NUAK1、ALPK2 和 SHKBP1。与 PFS 相关的基因也优先用于预测 OS。提供了所有数据,包括 HR 和 p 值以及用于 PFS 和 OS 的所有基因的截止值,以能够对所有基因的未来生物标志物候选物进行排名。我们的结果有助于确定基因的优先级,并忽略那些在旨在建立新的、临床上有用的生物标志物和治疗靶点的研究中最有可能失败的基因。