Cheng Chao, Li Lei M, Alves Pedro, Gerstein Mark
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA.
BMC Genomics. 2009 May 15;10:225. doi: 10.1186/1471-2164-10-225.
Aberrant activation or expression of transcription factors has been implicated in the tumorigenesis of various types of cancer. In spite of the prevalent application of microarray experiments for profiling gene expression in cancer samples, they provide limited information regarding the activities of transcription factors. However, the association between transcription factors and cancers is largely dependent on the transcription regulatory activities rather than mRNA expression levels.
In this paper, we propose a computational approach that integrates microarray expression data with the transcription factor binding site information to systematically identify transcription factors associated with patient survival given a specific cancer type. This approach was applied to two gene expression data sets for breast cancer and acute myeloid leukemia. We found that two transcription factor families, the steroid nuclear receptor family and the ATF/CREB family, are significantly correlated with the survival of patients with breast cancer; and that a transcription factor named T-cell acute lymphocytic leukemia 1 is significantly correlated with acute myeloid leukemia patient survival.
Our analysis identifies transcription factors associating with patient survival and provides insight into the regulatory mechanism underlying the breast cancer and leukemia. The transcription factors identified by our method are biologically meaningful and consistent with prior knowledge. As an insightful tool, this approach can also be applied to other microarray cancer data sets to help researchers better understand the intricate relationship between transcription factors and diseases.
转录因子的异常激活或表达与多种癌症的肿瘤发生有关。尽管微阵列实验在分析癌症样本中的基因表达方面得到了广泛应用,但它们提供的关于转录因子活性的信息有限。然而,转录因子与癌症之间的关联很大程度上取决于转录调控活性而非mRNA表达水平。
在本文中,我们提出了一种计算方法,该方法将微阵列表达数据与转录因子结合位点信息相结合,以系统地识别给定特定癌症类型下与患者生存相关的转录因子。该方法应用于乳腺癌和急性髓系白血病的两个基因表达数据集。我们发现,类固醇核受体家族和ATF/CREB家族这两个转录因子家族与乳腺癌患者的生存显著相关;并且一个名为T细胞急性淋巴细胞白血病1的转录因子与急性髓系白血病患者的生存显著相关。
我们的分析确定了与患者生存相关的转录因子,并深入了解了乳腺癌和白血病的潜在调控机制。我们方法识别出的转录因子具有生物学意义且与先验知识一致。作为一种有洞察力的工具,该方法也可应用于其他微阵列癌症数据集,以帮助研究人员更好地理解转录因子与疾病之间的复杂关系。