Department of Genetics at Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil.
Center for Cell-Based Therapy (CEPID/FAPESP), National Institute of Science and Technology in Stem Cell and Cell Therapy (INCTC/CNPq), Regional Blood Center of Ribeirão Preto, Ribeirão Preto, Brazil.
BMC Genomics. 2021 Aug 20;22(1):624. doi: 10.1186/s12864-021-07918-2.
BACKGROUND: Finding meaningful gene-gene interaction and the main Transcription Factors (TFs) in co-expression networks is one of the most important challenges in gene expression data mining. RESULTS: Here, we developed the R package "CeTF" that integrates the Partial Correlation with Information Theory (PCIT) and Regulatory Impact Factors (RIF) algorithms applied to gene expression data from microarray, RNA-seq, or single-cell RNA-seq platforms. This approach allows identifying the transcription factors most likely to regulate a given network in different biological systems - for example, regulation of gene pathways in tumor stromal cells and tumor cells of the same tumor. This pipeline can be easily integrated into the high-throughput analysis. To demonstrate the CeTF package application, we analyzed gastric cancer RNA-seq data obtained from TCGA (The Cancer Genome Atlas) and found the HOXB3 gene as the second most relevant TFs with a high regulatory impact (TFs-HRi) regulating gene pathways in the cell cycle. CONCLUSION: This preliminary finding shows the potential of CeTF to list master regulators of gene networks. CeTF was designed as a user-friendly tool that provides many highly automated functions without requiring the user to perform many complicated processes. It is available on Bioconductor ( http://bioconductor.org/packages/CeTF ) and GitHub ( http://github.com/cbiagii/CeTF ).
背景:在共表达网络中发现有意义的基因-基因相互作用和主要转录因子(TFs)是基因表达数据挖掘中最重要的挑战之一。
结果:在这里,我们开发了 R 包“CeTF”,该包集成了偏相关与信息理论(PCIT)和调节影响因子(RIF)算法,应用于微阵列、RNA-seq 或单细胞 RNA-seq 平台的基因表达数据。该方法允许识别最有可能在不同生物系统中调节给定网络的转录因子 - 例如,肿瘤基质细胞和同一肿瘤的肿瘤细胞中基因途径的调节。该流水线可以轻松集成到高通量分析中。为了演示 CeTF 包的应用,我们分析了来自 TCGA(癌症基因组图谱)的胃癌 RNA-seq 数据,发现 HOXB3 基因是第二个具有高调节影响(TFs-HRi)的相关 TFs,可调节细胞周期中的基因途径。
结论:这一初步发现表明 CeTF 具有列出基因网络主控调节器的潜力。CeTF 被设计为用户友好的工具,提供了许多高度自动化的功能,而无需用户执行许多复杂的过程。它可在 Bioconductor(http://bioconductor.org/packages/CeTF)和 GitHub(http://github.com/cbiagii/CeTF)上使用。
Bioinformatics. 2019-12-15
BMC Bioinformatics. 2018-10-24
Bioinformatics. 2022-4-28
BMC Cancer. 2019-10-25
BMC Bioinformatics. 2021-5-25
Comput Struct Biotechnol J. 2020-6-29
F1000Res. 2019-10-18
Nucleic Acids Res. 2019-7-2
Eur Rev Med Pharmacol Sci. 2018-11
Nucleic Acids Res. 2019-1-8