Suppr超能文献

组织特异性预测直接调控基因。

Tissue-specific prediction of directly regulated genes.

机构信息

Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia.

出版信息

Bioinformatics. 2011 Sep 1;27(17):2354-60. doi: 10.1093/bioinformatics/btr399. Epub 2011 Jun 30.

Abstract

UNLABELLED

Direct binding by a transcription factor (TF) to the proximal promoter of a gene is a strong evidence that the TF regulates the gene. Assaying the genome-wide binding of every TF in every cell type and condition is currently impractical. Histone modifications correlate with tissue/cell/condition-specific ('tissue specific') TF binding, so histone ChIP-seq data can be combined with traditional position weight matrix (PWM) methods to make tissue-specific predictions of TF-promoter interactions.

RESULTS

We use supervised learning to train a naïve Bayes predictor of TF-promoter binding. The predictor's features are the histone modification levels and a PWM-based score for the promoter. Training and testing uses sets of promoters labeled using TF ChIP-seq data, and we use cross-validation on 23 such datasets to measure the accuracy. A PWM+histone naïve Bayes predictor using a single histone modification (H3K4me3) is substantially more accurate than a PWM score or a conservation-based score (phylogenetic motif model). The naïve Bayes predictor is more accurate (on average) at all sensitivity levels, and makes only half as many false positive predictions at sensitivity levels from 10% to 80%. On average, it correctly predicts 80% of bound promoters at a false positive rate of 20%. Accuracy does not diminish when we test the predictor in a different cell type (and species) from training. Accuracy is barely diminished even when we train the predictor without using TF ChIP-seq data.

AVAILABILITY

Our tissue-specific predictor of promoters bound by a TF is called Dr Gene and is available at http://bioinformatics.org.au/drgene.

CONTACT

t.bailey@imb.uq.edu.au

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

未加标签

转录因子(TF)与基因近端启动子的直接结合是 TF 调节基因的有力证据。目前,检测每个 TF 在每种细胞类型和条件下的全基因组结合是不切实际的。组蛋白修饰与组织/细胞/条件特异性(“组织特异性”)TF 结合相关,因此组蛋白 ChIP-seq 数据可以与传统的位置权重矩阵(PWM)方法结合使用,以对 TF-启动子相互作用进行组织特异性预测。

结果

我们使用有监督学习来训练 TF-启动子结合的朴素贝叶斯预测器。预测器的特征是组蛋白修饰水平和基于 PWM 的启动子评分。训练和测试使用使用 TF ChIP-seq 数据标记的启动子集,我们使用 23 个此类数据集的交叉验证来衡量准确性。使用单个组蛋白修饰(H3K4me3)的 PWM+组蛋白朴素贝叶斯预测器比 PWM 评分或基于保守性的评分(系统发育基序模型)更准确。在所有灵敏度水平上,朴素贝叶斯预测器都更准确(平均而言),并且在灵敏度水平为 10%至 80%时,假阳性预测的数量减少了一半。平均而言,在假阳性率为 20%的情况下,它可以正确预测 80%的结合启动子。当我们在与训练不同的细胞类型(和物种)中测试预测器时,准确性不会降低。即使我们在不使用 TF ChIP-seq 数据的情况下训练预测器,准确性也几乎没有降低。

可用性

我们的 TF 结合的启动子的组织特异性预测器称为 Dr Gene,可在 http://bioinformatics.org.au/drgene 上获得。

联系方式

t.bailey@imb.uq.edu.au

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
Tissue-specific prediction of directly regulated genes.组织特异性预测直接调控基因。
Bioinformatics. 2011 Sep 1;27(17):2354-60. doi: 10.1093/bioinformatics/btr399. Epub 2011 Jun 30.
2
Genome-wide in silico prediction of gene expression.全基因组计算机预测基因表达。
Bioinformatics. 2012 Nov 1;28(21):2789-96. doi: 10.1093/bioinformatics/bts529. Epub 2012 Sep 6.
4
TICA: Transcriptional Interaction and Coregulation Analyzer.TICA:转录相互作用和协同调控分析器。
Genomics Proteomics Bioinformatics. 2018 Oct;16(5):342-353. doi: 10.1016/j.gpb.2018.05.004. Epub 2018 Dec 19.
7
8
T-Gene: improved target gene prediction.T-Gene:改进的靶基因预测。
Bioinformatics. 2020 Jun 1;36(12):3902-3904. doi: 10.1093/bioinformatics/btaa227.

本文引用的文献

1
Histone modification levels are predictive for gene expression.组蛋白修饰水平可预测基因表达。
Proc Natl Acad Sci U S A. 2010 Feb 16;107(7):2926-31. doi: 10.1073/pnas.0909344107. Epub 2010 Feb 1.
5
MEME SUITE: tools for motif discovery and searching.MEME套件:用于基序发现和搜索的工具。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W202-8. doi: 10.1093/nar/gkp335. Epub 2009 May 20.
6
MotifMap: a human genome-wide map of candidate regulatory motif sites.MotifMap:人类全基因组候选调控基序位点图谱。
Bioinformatics. 2009 Jan 15;25(2):167-74. doi: 10.1093/bioinformatics/btn605. Epub 2008 Nov 18.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验