Suppr超能文献

通过基于转换器的模型揭示组织特异性 TF-DNA 结合与染色质特征之间的关系。

Uncovering the Relationship between Tissue-Specific TF-DNA Binding and Chromatin Features through a Transformer-Based Model.

机构信息

School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China.

School of Electronic Information and Artificial Intelligence, Leshan Normal University, Leshan 614000, China.

出版信息

Genes (Basel). 2022 Oct 26;13(11):1952. doi: 10.3390/genes13111952.

Abstract

Chromatin features can reveal tissue-specific TF-DNA binding, which leads to a better understanding of many critical physiological processes. Accurately identifying TF-DNA bindings and constructing their relationships with chromatin features is a long-standing goal in the bioinformatic field. However, this has remained elusive due to the complex binding mechanisms and heterogeneity among inputs. Here, we have developed the GHTNet (General Hybrid Transformer Network), a transformer-based model to predict TF-DNA binding specificity. The GHTNet decodes the relationship between tissue-specific TF-DNA binding and chromatin features via a specific input scheme of alternative inputs and reveals important gene regions and tissue-specific motifs. Our experiments show that the GHTNet has excellent performance, achieving about a 5% absolute improvement over existing methods. The TF-DNA binding mechanism analysis shows that the importance of TF-DNA binding features varies across tissues. The best predictor is based on the DNA sequence, followed by epigenomics and shape. In addition, cross-species studies address the limited data, thus providing new ideas in this case. Moreover, the GHTNet is applied to interpret the relationship among TFs, chromatin features, and diseases associated with AD46 tissue. This paper demonstrates that the GHTNet is an accurate and robust framework for deciphering tissue-specific TF-DNA binding and interpreting non-coding regions.

摘要

染色质特征可以揭示组织特异性 TF-DNA 结合,这有助于更好地理解许多关键的生理过程。准确识别 TF-DNA 结合并构建它们与染色质特征的关系是生物信息学领域的一个长期目标。然而,由于复杂的结合机制和输入的异质性,这一目标仍然难以实现。在这里,我们开发了 GHTNet(通用混合 Transformer 网络),这是一种基于 Transformer 的模型,用于预测 TF-DNA 结合特异性。GHTNet 通过替代输入的特定输入方案来解码组织特异性 TF-DNA 结合与染色质特征之间的关系,并揭示重要的基因区域和组织特异性基序。我们的实验表明,GHTNet 具有出色的性能,比现有方法提高了约 5%的绝对精度。TF-DNA 结合机制分析表明,TF-DNA 结合特征的重要性因组织而异。最佳预测器基于 DNA 序列,其次是表观基因组学和形状。此外,跨物种研究解决了数据有限的问题,从而为这种情况提供了新的思路。此外,GHTNet 还应用于解释与 AD46 组织相关的 TF、染色质特征和疾病之间的关系。本文证明了 GHTNet 是一个准确而强大的框架,用于破译组织特异性 TF-DNA 结合并解释非编码区域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da8f/9690320/90e3d90af224/genes-13-01952-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验