• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用深度学习准确预测果蝇高分辨率拓扑关联域 (TAD) 的边界。

Accurate prediction of boundaries of high resolution topologically associated domains (TADs) in fruit flies using deep learning.

机构信息

Computer Science and Engineering Technology, University of Houston-Downtown, Houston, TX 77002, USA.

Biology and Biochemistry, University of Houston, Houston, TX 77204, USA.

出版信息

Nucleic Acids Res. 2019 Jul 26;47(13):e78. doi: 10.1093/nar/gkz315.

DOI:10.1093/nar/gkz315
PMID:31049567
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6648328/
Abstract

Genomes are organized into self-interacting chromatin regions called topologically associated domains (TADs). A significant number of TAD boundaries are shared across multiple cell types and conserved across species. Disruption of TAD boundaries may affect the expression of nearby genes and could lead to several diseases. Even though detection of TAD boundaries is important and useful, there are experimental challenges in obtaining high resolution TAD locations. Here, we present computational prediction of TAD boundaries from high resolution Hi-C data in fruit flies. By extensive exploration and testing of several deep learning model architectures with hyperparameter optimization, we show that a unique deep learning model consisting of three convolution layers followed by a long short-term-memory layer achieves an accuracy of 96%. This outperforms feature-based models' accuracy of 91% and an existing method's accuracy of 73-78% based on motif TRAP scores. Our method also detects previously reported motifs such as Beaf-32 that are enriched in TAD boundaries in fruit flies and also several unreported motifs.

摘要

基因组组织成称为拓扑关联域(TAD)的自我相互作用染色质区域。大量的 TAD 边界在多种细胞类型中共享,并在物种间保守。TAD 边界的破坏可能会影响附近基因的表达,并可能导致多种疾病。尽管 TAD 边界的检测很重要且有用,但在获得高分辨率 TAD 位置方面存在实验挑战。在这里,我们提出了一种从果蝇高分辨率 Hi-C 数据中预测 TAD 边界的计算方法。通过对几种深度学习模型结构进行广泛的探索和测试,并进行超参数优化,我们表明,由三个卷积层和一个长短期记忆层组成的独特深度学习模型的准确率为 96%。这优于基于特征的模型的准确率 91%,以及基于 motif TRAP 分数的现有方法的准确率 73-78%。我们的方法还检测到了先前报道的果蝇 TAD 边界中富集的 Beaf-32 等motif,以及一些未报道的 motif。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46b1/6648328/ac441a824c48/gkz315fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46b1/6648328/5809a4020baf/gkz315fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46b1/6648328/cf7682e03eef/gkz315fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46b1/6648328/27c055c008f0/gkz315fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46b1/6648328/ac441a824c48/gkz315fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46b1/6648328/5809a4020baf/gkz315fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46b1/6648328/cf7682e03eef/gkz315fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46b1/6648328/27c055c008f0/gkz315fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46b1/6648328/ac441a824c48/gkz315fig4.jpg

相似文献

1
Accurate prediction of boundaries of high resolution topologically associated domains (TADs) in fruit flies using deep learning.使用深度学习准确预测果蝇高分辨率拓扑关联域 (TAD) 的边界。
Nucleic Acids Res. 2019 Jul 26;47(13):e78. doi: 10.1093/nar/gkz315.
2
StackTADB: a stacking-based ensemble learning model for predicting the boundaries of topologically associating domains (TADs) accurately in fruit flies.StackTADB:一种基于堆叠的集成学习模型,可准确预测果蝇中拓扑关联域(TAD)的边界。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac023.
3
A comparison of topologically associating domain callers over mammals at high resolution.在高分辨率下比较哺乳动物的拓扑关联结构域调用器。
BMC Bioinformatics. 2022 Apr 12;23(1):127. doi: 10.1186/s12859-022-04674-2.
4
SpectralTAD: an R package for defining a hierarchy of topologically associated domains using spectral clustering.SpectralTAD:一个使用谱聚类定义层次结构拓扑关联域的 R 包。
BMC Bioinformatics. 2020 Jul 20;21(1):319. doi: 10.1186/s12859-020-03652-w.
5
Topologically associating domain boundaries that are stable across diverse cell types are evolutionarily constrained and enriched for heritability.在不同细胞类型中稳定存在的拓扑关联域边界受到进化约束,并富集了遗传性。
Am J Hum Genet. 2021 Feb 4;108(2):269-283. doi: 10.1016/j.ajhg.2021.01.001.
6
A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information.一种结合组蛋白标记和序列信息预测拓扑关联结构域边界的计算方法。
BMC Genomics. 2019 Dec 27;20(Suppl 13):980. doi: 10.1186/s12864-019-6303-z.
7
Active chromatin and transcription play a key role in chromosome partitioning into topologically associating domains.活跃染色质和转录在染色体划分为拓扑相关结构域的过程中起关键作用。
Genome Res. 2016 Jan;26(1):70-84. doi: 10.1101/gr.196006.115. Epub 2015 Oct 30.
8
Topologically associating domains and their role in the evolution of genome structure and function in .拓扑关联域及其在 基因组结构和功能进化中的作用。
Genome Res. 2021 Mar;31(3):397-410. doi: 10.1101/gr.266130.120. Epub 2021 Feb 9.
9
Pattern recognition of topologically associating domains using deep learning.基于深度学习的拓扑关联域模式识别。
BMC Bioinformatics. 2022 Dec 8;22(Suppl 10):634. doi: 10.1186/s12859-022-05075-1.
10
Stratification of TAD boundaries reveals preferential insulation of super-enhancers by strong boundaries.拓扑关联结构域(TAD)边界的分层显示,强边界对超级增强子具有优先绝缘作用。
Nat Commun. 2018 Feb 7;9(1):542. doi: 10.1038/s41467-018-03017-1.

引用本文的文献

1
The emerging sequence grammar of 3D genome organisation.三维基因组组织中新兴的序列语法
Hum Genet. 2025 Aug 25. doi: 10.1007/s00439-025-02772-8.
2
deepTAD: an approach for identifying topologically associated domains based on convolutional neural network and transformer model.深度TAD:一种基于卷积神经网络和Transformer模型识别拓扑相关结构域的方法。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf127.
3
Machine and Deep Learning Methods for Predicting 3D Genome Organization.机器和深度学习方法预测三维基因组结构。

本文引用的文献

1
Predicting RNA-protein binding sites and motifs through combining local and global deep convolutional neural networks.通过结合局部和全局深度卷积神经网络预测 RNA 与蛋白质的结合位点和基序。
Bioinformatics. 2018 Oct 15;34(20):3427-3436. doi: 10.1093/bioinformatics/bty364.
2
Sequential regulatory activity prediction across chromosomes with convolutional neural networks.基于卷积神经网络的跨染色体顺序调控活性预测
Genome Res. 2018 May;28(5):739-750. doi: 10.1101/gr.227819.117. Epub 2018 Mar 27.
3
High-resolution TADs reveal DNA sequences underlying genome organization in flies.
Methods Mol Biol. 2025;2856:357-400. doi: 10.1007/978-1-0716-4136-1_22.
4
Machine and deep learning methods for predicting 3D genome organization.用于预测三维基因组组织的机器学习和深度学习方法。
ArXiv. 2024 Mar 4:arXiv:2403.03231v1.
5
Pattern recognition of topologically associating domains using deep learning.基于深度学习的拓扑关联域模式识别。
BMC Bioinformatics. 2022 Dec 8;22(Suppl 10):634. doi: 10.1186/s12859-022-05075-1.
6
Application of Hi-C and other omics data analysis in human cancer and cell differentiation research.Hi-C及其他组学数据分析在人类癌症与细胞分化研究中的应用。
Comput Struct Biotechnol J. 2021 Apr 8;19:2070-2083. doi: 10.1016/j.csbj.2021.04.016. eCollection 2021.
高分辨率拓扑关联结构域揭示果蝇基因组组织背后的DNA序列。
Nat Commun. 2018 Jan 15;9(1):189. doi: 10.1038/s41467-017-02525-w.
4
Chromatin accessibility prediction via a hybrid deep convolutional neural network.基于混合深度卷积神经网络的染色质可及性预测。
Bioinformatics. 2018 Mar 1;34(5):732-738. doi: 10.1093/bioinformatics/btx679.
5
Conserved role of Ovo in germline development in mouse and Drosophila.Ovo 在小鼠和果蝇生殖细胞发育中的保守作用。
Sci Rep. 2017 Jan 6;7:40056. doi: 10.1038/srep40056.
6
Different enhancer classes in Drosophila bind distinct architectural proteins and mediate unique chromatin interactions and 3D architecture.果蝇中不同的增强子类别结合不同的结构蛋白,并介导独特的染色质相互作用和三维结构。
Nucleic Acids Res. 2017 Feb 28;45(4):1714-1730. doi: 10.1093/nar/gkw1114.
7
Convolutional neural network architectures for predicting DNA-protein binding.用于预测DNA-蛋白质结合的卷积神经网络架构。
Bioinformatics. 2016 Jun 15;32(12):i121-i127. doi: 10.1093/bioinformatics/btw255.
8
Chromatin Domains: The Unit of Chromosome Organization.染色质结构域:染色体组织的单位
Mol Cell. 2016 Jun 2;62(5):668-80. doi: 10.1016/j.molcel.2016.05.018.
9
Activation of proto-oncogenes by disruption of chromosome neighborhoods.染色体邻域破坏导致原癌基因激活。
Science. 2016 Mar 25;351(6280):1454-1458. doi: 10.1126/science.aad9024. Epub 2016 Mar 3.
10
TERT rearrangements are frequent in neuroblastoma and identify aggressive tumors.TERT 重排在神经母细胞瘤中很常见,并可识别侵袭性肿瘤。
Nat Genet. 2015 Dec;47(12):1411-4. doi: 10.1038/ng.3438. Epub 2015 Nov 2.