• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用差分核识别拓扑关联域。

Identifying topologically associating domains using differential kernels.

机构信息

Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America.

Department of Cell Biology, Yale School of Medicine, New Haven, Connecticut, United States of America.

出版信息

PLoS Comput Biol. 2024 Jul 15;20(7):e1012221. doi: 10.1371/journal.pcbi.1012221. eCollection 2024 Jul.

DOI:10.1371/journal.pcbi.1012221
PMID:39008525
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11249266/
Abstract

Chromatin is a polymer complex of DNA and proteins that regulates gene expression. The three-dimensional (3D) structure and organization of chromatin controls DNA transcription and replication. High-throughput chromatin conformation capture techniques generate Hi-C maps that can provide insight into the 3D structure of chromatin. Hi-C maps can be represented as a symmetric matrix [Formula: see text], where each element represents the average contact probability or number of contacts between chromatin loci i and j. Previous studies have detected topologically associating domains (TADs), or self-interacting regions in [Formula: see text] within which the contact probability is greater than that outside the region. Many algorithms have been developed to identify TADs within Hi-C maps. However, most TAD identification algorithms are unable to identify nested or overlapping TADs and for a given Hi-C map there is significant variation in the location and number of TADs identified by different methods. We develop a novel method to identify TADs, KerTAD, using a kernel-based technique from computer vision and image processing that is able to accurately identify nested and overlapping TADs. We benchmark this method against state-of-the-art TAD identification methods on both synthetic and experimental data sets. We find that the new method consistently has higher true positive rates (TPR) and lower false discovery rates (FDR) than all tested methods for both synthetic and manually annotated experimental Hi-C maps. The TPR for KerTAD is also largely insensitive to increasing noise and sparsity, in contrast to the other methods. We also find that KerTAD is consistent in the number and size of TADs identified across replicate experimental Hi-C maps for several organisms. Thus, KerTAD will improve automated TAD identification and enable researchers to better correlate changes in TADs to biological phenomena, such as enhancer-promoter interactions and disease states.

摘要

染色质是 DNA 和蛋白质的聚合物复合物,可调节基因表达。染色质的三维(3D)结构和组织控制着 DNA 的转录和复制。高通量染色质构象捕获技术产生了 Hi-C 图谱,可以深入了解染色质的 3D 结构。Hi-C 图谱可以表示为对称矩阵[公式:见正文],其中每个元素代表染色质位点 i 和 j 之间的平均接触概率或接触次数。先前的研究已经检测到拓扑关联域(TAD),即在[公式:见正文]中自我相互作用的区域,其中接触概率大于区域之外的接触概率。已经开发了许多算法来识别 Hi-C 图谱中的 TAD。然而,大多数 TAD 识别算法无法识别嵌套或重叠的 TAD,并且对于给定的 Hi-C 图谱,不同方法识别的 TAD 的位置和数量存在很大差异。我们使用计算机视觉和图像处理中的基于核的技术开发了一种新的 TAD 识别方法 KerTAD,该方法能够准确识别嵌套和重叠的 TAD。我们在合成数据集和实验数据集上,将此方法与最先进的 TAD 识别方法进行了基准测试。我们发现,对于合成和手动注释的实验 Hi-C 图谱,新方法的真阳性率(TPR)始终高于所有测试方法,假阳性率(FDR)均较低。与其他方法相比,KerTAD 的 TPR 对噪声和稀疏性的增加也不敏感。我们还发现,对于几个生物体的重复实验 Hi-C 图谱,KerTAD 识别的 TAD 的数量和大小是一致的。因此,KerTAD 将改善自动化 TAD 识别,并使研究人员能够更好地将 TAD 的变化与生物学现象(例如增强子-启动子相互作用和疾病状态)相关联。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/17c2aad41c63/pcbi.1012221.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/d04ceeb8bef3/pcbi.1012221.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/bf81a02246d2/pcbi.1012221.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/98c0f995a381/pcbi.1012221.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/2c8e51a463cb/pcbi.1012221.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/e637fa891f86/pcbi.1012221.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/54d0f16b8cf1/pcbi.1012221.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/62697ec18ea5/pcbi.1012221.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/8de6ea0b3276/pcbi.1012221.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/d1c941a2c7b5/pcbi.1012221.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/17c2aad41c63/pcbi.1012221.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/d04ceeb8bef3/pcbi.1012221.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/bf81a02246d2/pcbi.1012221.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/98c0f995a381/pcbi.1012221.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/2c8e51a463cb/pcbi.1012221.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/e637fa891f86/pcbi.1012221.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/54d0f16b8cf1/pcbi.1012221.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/62697ec18ea5/pcbi.1012221.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/8de6ea0b3276/pcbi.1012221.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/d1c941a2c7b5/pcbi.1012221.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf04/11249266/17c2aad41c63/pcbi.1012221.g010.jpg

相似文献

1
Identifying topologically associating domains using differential kernels.使用差分核识别拓扑关联域。
PLoS Comput Biol. 2024 Jul 15;20(7):e1012221. doi: 10.1371/journal.pcbi.1012221. eCollection 2024 Jul.
2
MrTADFinder: A network modularity based approach to identify topologically associating domains in multiple resolutions.MrTADFinder:一种基于网络模块性的方法,用于在多个分辨率下识别拓扑关联结构域。
PLoS Comput Biol. 2017 Jul 24;13(7):e1005647. doi: 10.1371/journal.pcbi.1005647. eCollection 2017 Jul.
3
Methods for the Analysis of Topologically Associating Domains (TADs).分析拓扑关联结构域(TADs)的方法。
Methods Mol Biol. 2022;2301:39-59. doi: 10.1007/978-1-0716-1390-0_3.
4
A comparison of topologically associating domain callers over mammals at high resolution.在高分辨率下比较哺乳动物的拓扑关联结构域调用器。
BMC Bioinformatics. 2022 Apr 12;23(1):127. doi: 10.1186/s12859-022-04674-2.
5
SpectralTAD: an R package for defining a hierarchy of topologically associated domains using spectral clustering.SpectralTAD:一个使用谱聚类定义层次结构拓扑关联域的 R 包。
BMC Bioinformatics. 2020 Jul 20;21(1):319. doi: 10.1186/s12859-020-03652-w.
6
Identification of hierarchical chromatin domains.分层染色质结构域的鉴定。
Bioinformatics. 2016 Jun 1;32(11):1601-9. doi: 10.1093/bioinformatics/btv485. Epub 2015 Aug 26.
7
HiTAD: detecting the structural and functional hierarchies of topologically associating domains from chromatin interactions.HiTAD:从染色质相互作用中检测拓扑相关结构域的结构和功能层次
Nucleic Acids Res. 2017 Nov 2;45(19):e163. doi: 10.1093/nar/gkx735.
8
A Comparison of Topologically Associating Domain Callers Based on Hi-C Data.基于 Hi-C 数据的拓扑关联域调用器比较。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):15-29. doi: 10.1109/TCBB.2022.3147805. Epub 2023 Feb 3.
9
DiffDomain enables identification of structurally reorganized topologically associating domains.DiffDomain 能够识别结构上重新组织的拓扑关联结构域。
Nat Commun. 2024 Jan 13;15(1):502. doi: 10.1038/s41467-024-44782-6.
10
Computational Analysis of Hi-C Data.Hi-C 数据的计算分析。
Methods Mol Biol. 2021;2157:103-125. doi: 10.1007/978-1-0716-0664-3_7.

本文引用的文献

1
A comparison of topologically associating domain callers over mammals at high resolution.在高分辨率下比较哺乳动物的拓扑关联结构域调用器。
BMC Bioinformatics. 2022 Apr 12;23(1):127. doi: 10.1186/s12859-022-04674-2.
2
A Comparison of Topologically Associating Domain Callers Based on Hi-C Data.基于 Hi-C 数据的拓扑关联域调用器比较。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):15-29. doi: 10.1109/TCBB.2022.3147805. Epub 2023 Feb 3.
3
Chromatin architecture transitions from zebrafish sperm through early embryogenesis.
染色质结构从斑马鱼精子到早期胚胎发生发生转变。
Genome Res. 2021 Jun;31(6):981-994. doi: 10.1101/gr.269860.120. Epub 2021 May 18.
4
SpectralTAD: an R package for defining a hierarchy of topologically associated domains using spectral clustering.SpectralTAD:一个使用谱聚类定义层次结构拓扑关联域的 R 包。
BMC Bioinformatics. 2020 Jul 20;21(1):319. doi: 10.1186/s12859-020-03652-w.
5
TADs and Their Borders: Free Movement or Building a Wall?TADs 及其边界:自由流动还是筑墙?
J Mol Biol. 2020 Feb 7;432(3):643-652. doi: 10.1016/j.jmb.2019.11.025. Epub 2019 Dec 27.
6
Defining Functionally Relevant Spatial Chromatin Domains: It is a TAD Complicated.定义具有功能相关性的空间染色质域:这很复杂。
J Mol Biol. 2020 Feb 7;432(3):653-664. doi: 10.1016/j.jmb.2019.12.006. Epub 2019 Dec 18.
7
TADs as the Caller Calls Them.TADs(转录激活域),按调用者的称呼。
J Mol Biol. 2020 Feb 7;432(3):638-642. doi: 10.1016/j.jmb.2019.09.026. Epub 2019 Oct 23.
8
Comparison of normalization methods for Hi-C data.Hi-C 数据标准化方法比较。
Biotechniques. 2020 Feb;68(2):56-64. doi: 10.2144/btn-2019-0105. Epub 2019 Oct 7.
9
Chromatin conformation remains stable upon extensive transcriptional changes driven by heat shock.染色质构象在热休克驱动的广泛转录变化下保持稳定。
Proc Natl Acad Sci U S A. 2019 Sep 24;116(39):19431-19439. doi: 10.1073/pnas.1901244116. Epub 2019 Sep 10.
10
Cooler: scalable storage for Hi-C data and other genomically labeled arrays.Cooler:用于Hi-C数据和其他基因组标记阵列的可扩展存储。
Bioinformatics. 2020 Jan 1;36(1):311-316. doi: 10.1093/bioinformatics/btz540.