Department of Medical and Molecular Genetics, Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.
Department of BioHealth Informatics, School of Informatics and Computing, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202, USA.
BMC Med Genomics. 2020 Apr 3;13(Suppl 5):49. doi: 10.1186/s12920-020-0688-z.
While several multigene signatures are available for predicting breast cancer prognosis, particularly in early stage disease, effective molecular indicators are needed, especially for triple-negative carcinomas, to improve treatments and predict diagnostic outcomes. The objective of this study was to identify transcriptional regulatory networks to better understand mechanisms giving rise to breast cancer development and to incorporate this information into a model for predicting clinical outcomes.
Gene expression profiles from 1097 breast cancer patients were retrieved from The Cancer Genome Atlas (TCGA). Breast cancer-specific transcription regulatory information was identified by considering the binding site information from ENCODE and the top co-expressed targets in TCGA using a nonlinear approach. We then used this information to predict breast cancer patient survival outcome.
We built a multiple regulator-based prediction model for breast cancer. This model was validated in more than 5000 breast cancer patients from the Gene Expression Omnibus (GEO) databases. We demonstrated our regulator model was significantly associated with clinical stage and that cell cycle and DNA replication related pathways were significantly enriched in high regulator risk patients.
Our findings demonstrate that transcriptional regulator activities can predict patient survival. This finding provides additional biological insights into the mechanisms of breast cancer progression.
虽然有几种多基因标记可用于预测乳腺癌的预后,特别是在早期疾病中,但需要有效的分子指标,特别是对于三阴性乳腺癌,以改善治疗方法并预测诊断结果。本研究的目的是确定转录调控网络,以更好地了解导致乳腺癌发生的机制,并将这些信息纳入预测临床结果的模型中。
从癌症基因组图谱(TCGA)中检索了 1097 名乳腺癌患者的基因表达谱。通过考虑 ENCODE 的结合位点信息和 TCGA 中最高共表达的靶标,使用非线性方法确定了乳腺癌特异性转录调控信息。然后,我们使用这些信息来预测乳腺癌患者的生存结果。
我们构建了一个基于多个调节剂的乳腺癌预测模型。该模型在来自基因表达综合数据库(GEO)的 5000 多名乳腺癌患者中进行了验证。我们证明我们的调节剂模型与临床分期显著相关,并且在高调节剂风险患者中细胞周期和 DNA 复制相关途径显著富集。
我们的研究结果表明,转录调节剂的活性可以预测患者的生存。这一发现为乳腺癌进展的机制提供了更多的生物学见解。