Suppr超能文献

利用新型计算分析蛋白质结合阵列数据鉴定 Nkx2.2 在胰腺中的直接靶标。

Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas.

机构信息

Department of Genetics and Development, Columbia University, New York, NY 10032, USA.

出版信息

BMC Bioinformatics. 2011 Feb 25;12:62. doi: 10.1186/1471-2105-12-62.

Abstract

BACKGROUND

The creation of a complete genome-wide map of transcription factor binding sites is essential for understanding gene regulatory networks in vivo. However, current prediction methods generally rely on statistical models that imperfectly model transcription factor binding. Generation of new prediction methods that are based on protein binding data, but do not rely on these models may improve prediction sensitivity and specificity.

RESULTS

We propose a method for predicting transcription factor binding sites in the genome by directly mapping data generated from protein binding microarrays (PBM) to the genome and calculating a moving average of several overlapping octamers. Using this unique algorithm, we predicted binding sites for the essential pancreatic islet transcription factor Nkx2.2 in the mouse genome and confirmed >90% of the tested sites by EMSA and ChIP. Scores generated from this method more accurately predicted relative binding affinity than PWM based methods. We have also identified an alternative core sequence recognized by the Nkx2.2 homeodomain. Furthermore, we have shown that this method correctly identified binding sites in the promoters of two critical pancreatic islet β-cell genes, NeuroD1 and insulin2, that were not predicted by traditional methods. Finally, we show evidence that the algorithm can also be applied to predict binding sites for the nuclear receptor Hnf4α.

CONCLUSIONS

PBM-mapping is an accurate method for predicting Nkx2.2 binding sites and may be widely applicable for the creation of genome-wide maps of transcription factor binding sites.

摘要

背景

创建完整的转录因子结合位点全基因组图谱对于理解体内基因调控网络至关重要。然而,目前的预测方法通常依赖于不能完美模拟转录因子结合的统计模型。生成新的预测方法,这些方法基于蛋白质结合数据,但不依赖于这些模型,可能会提高预测的灵敏度和特异性。

结果

我们提出了一种通过直接将蛋白质结合微阵列(PBM)产生的数据映射到基因组并计算几个重叠八聚体的移动平均值来预测基因组中转录因子结合位点的方法。使用这种独特的算法,我们预测了在小鼠基因组中胰岛转录因子 Nkx2.2 的结合位点,并通过 EMSA 和 ChIP 验证了 >90%的测试位点。与基于 PWM 的方法相比,该方法生成的分数更准确地预测了相对结合亲和力。我们还确定了由 Nkx2.2 同源域识别的替代核心序列。此外,我们还表明,该方法可以正确识别两个关键胰岛β细胞基因 NeuroD1 和 insulin2 的启动子中的结合位点,而这些结合位点不能被传统方法预测。最后,我们证明该算法也可应用于预测核受体 Hnf4α 的结合位点。

结论

PBM 映射是一种预测 Nkx2.2 结合位点的准确方法,可能广泛适用于创建转录因子结合位点的全基因组图谱。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验