• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种适用于跨物种染色质结构、单细胞和多组学分析的可推广的Hi-C基础模型。

A generalizable Hi-C foundation model for chromatin architecture, single-cell and multi-omics analysis across species.

作者信息

Wang Xiao, Zhang Yuanyuan, Ray Suhita, Jha Anupama, Fang Tangqi, Hang Shengqi, Doulatov Sergei, Noble William Stafford, Wang Sheng

机构信息

Department of Genome Sciences, University of Washington, Seattle, WA, USA.

Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, 98105, USA.

出版信息

bioRxiv. 2024 Dec 20:2024.12.16.628821. doi: 10.1101/2024.12.16.628821.

DOI:10.1101/2024.12.16.628821
PMID:39763871
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11702576/
Abstract

Nuclear DNA is organized into a compact three-dimensional (3D) structure that impacts critical cellular processes. High-throughput chromosome conformation capture (Hi-C) is the most widely used method for measuring 3D genome architecture, while linear epigenomic assays, such as ATAC-seq, DNase-seq, and ChIP-seq, are extensively employed to characterize epigenomic regulation. However, the integrative analysis of chromatin interactions and associated epigenomic regulation remains challenging due to the pairwise nature of Hi-C data, mismatched resolution between Hi-C and epigenomic assays, and inconsistencies among analysis tools. Here we propose HiCFoundation, a Hi-C-based foundation model for integrative analysis linking chromatin structure to downstream regulatory function. HiCFoundation is trained from hundreds of Hi-C assays encompassing 118 million contact matrix submatrices. The model achieves state-of-the-art performance on multiple types of 3D genome analysis, including reproducibility analysis, resolution enhancement, and loop detection. We further demonstrate the model's generalizability through genome architecture analysis of 316 species. Notably, by enhancing low-coverage experimental Hi-C data, HiCFoundation reveals genome-wide loop loss during differentiation of hematopoietic stem and progenitor cells (HSPCs) to neutrophils. Additionally, HiCFoundation is able to predict multiple types of epigenomic activity from Hi-C input and further interprets the link between Hi-C input and epigenomic output to reveal the relationship between chromatin conformation and genome function. Finally, HiCFoundation can analyze single-cell Hi-C data, shedding light on genome structure at single-cell resolution. HiCFoundation thus provides a unified, efficient, generalizable, and interpretable foundation for genome architecture, single-cell and multi-omics analysis across species, paving the path for systematically studying genome 3D architecture and its regulatory mechanisms.

摘要

核DNA被组织成一种紧凑的三维(3D)结构,这种结构会影响关键的细胞过程。高通量染色体构象捕获(Hi-C)是测量3D基因组结构最广泛使用的方法,而线性表观基因组分析,如ATAC-seq、DNase-seq和ChIP-seq,则被广泛用于表征表观基因组调控。然而,由于Hi-C数据的成对性质、Hi-C与表观基因组分析之间不匹配的分辨率以及分析工具之间的不一致性,染色质相互作用与相关表观基因组调控的综合分析仍然具有挑战性。在这里,我们提出了HiCFoundation,这是一种基于Hi-C的基础模型,用于将染色质结构与下游调控功能联系起来的综合分析。HiCFoundation是从数百个Hi-C实验中训练出来的,这些实验包含1.18亿个接触矩阵子矩阵。该模型在多种类型的3D基因组分析中取得了领先的性能,包括可重复性分析、分辨率增强和环检测。我们通过对316个物种的基因组结构分析进一步证明了该模型的通用性。值得注意的是,通过增强低覆盖度的实验性Hi-C数据,HiCFoundation揭示了造血干细胞和祖细胞(HSPCs)向中性粒细胞分化过程中全基因组范围的环丢失。此外,HiCFoundation能够从Hi-C输入预测多种类型的表观基因组活性,并进一步解释Hi-C输入与表观基因组输出之间的联系,以揭示染色质构象与基因组功能之间的关系。最后,HiCFoundation可以分析单细胞Hi-C数据,以单细胞分辨率揭示基因组结构。因此,HiCFoundation为跨物种的基因组结构、单细胞和多组学分析提供了一个统一、高效、通用且可解释的基础,为系统研究基因组3D结构及其调控机制铺平了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/d9fc7fb1b94a/nihpp-2024.12.16.628821v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/63c7a47cb428/nihpp-2024.12.16.628821v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/f64d464aac23/nihpp-2024.12.16.628821v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/5679cd88d7d5/nihpp-2024.12.16.628821v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/fad2c4950e89/nihpp-2024.12.16.628821v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/fbe3a073424d/nihpp-2024.12.16.628821v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/d9fc7fb1b94a/nihpp-2024.12.16.628821v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/63c7a47cb428/nihpp-2024.12.16.628821v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/f64d464aac23/nihpp-2024.12.16.628821v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/5679cd88d7d5/nihpp-2024.12.16.628821v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/fad2c4950e89/nihpp-2024.12.16.628821v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/fbe3a073424d/nihpp-2024.12.16.628821v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d0/11702576/d9fc7fb1b94a/nihpp-2024.12.16.628821v1-f0006.jpg

相似文献

1
A generalizable Hi-C foundation model for chromatin architecture, single-cell and multi-omics analysis across species.一种适用于跨物种染色质结构、单细胞和多组学分析的可推广的Hi-C基础模型。
bioRxiv. 2024 Dec 20:2024.12.16.628821. doi: 10.1101/2024.12.16.628821.
2
Quantifying conformational heterogeneity of 3D genome organization in fruit fly.量化果蝇三维基因组组织的构象异质性。
PLoS One. 2025 Jul 3;20(7):e0326927. doi: 10.1371/journal.pone.0326927. eCollection 2025.
3
Short-Term Memory Impairment短期记忆障碍
4
DeepExDC interprets genomic compartmentalization changes in single-cell Hi-C data.DeepExDC可解读单细胞Hi-C数据中的基因组区室化变化。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf301.
5
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
6
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
7
Normozoospermic infertile men possess subpopulations of sperm varying in DNA accessibility, relating to differing reproductive outcomes.正常精子的不育男性拥有DNA可及性不同的精子亚群,这与不同的生殖结果相关。
Hum Reprod. 2025 Jul 1;40(7):1266-1281. doi: 10.1093/humrep/deaf081.
8
Quantifying Conformational Heterogeneity of 3D Genome Organization in Fruit Fly.果蝇三维基因组组织构象异质性的量化
bioRxiv. 2025 May 27:2025.05.24.655945. doi: 10.1101/2025.05.24.655945.
9
pC-SAC: A method for high-resolution 3D genome reconstruction from low-resolution Hi-C data.pC-SAC:一种从低分辨率Hi-C数据进行高分辨率3D基因组重建的方法。
Nucleic Acids Res. 2025 Apr 10;53(7). doi: 10.1093/nar/gkaf289.
10
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

本文引用的文献

1
Comparing chromatin contact maps at scale: methods and insights.大规模比较染色质接触图谱:方法与见解。
Nat Methods. 2025 Apr;22(4):824-833. doi: 10.1038/s41592-025-02630-5. Epub 2025 Mar 19.
2
Enhancing Hi-C contact matrices for loop detection with Capricorn: a multiview diffusion model.利用 Capricorn 提高 Hi-C 接触矩阵进行环检测:一种多视图扩散模型。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i471-i480. doi: 10.1093/bioinformatics/btae211.
3
GAGE-seq concurrently profiles multiscale 3D genome organization and gene expression in single cells.
GAGE-seq 可同时在单细胞中对多尺度 3D 基因组结构和基因表达进行分析。
Nat Genet. 2024 Aug;56(8):1701-1711. doi: 10.1038/s41588-024-01745-3. Epub 2024 May 14.
4
scGHOST: identifying single-cell 3D genome subcompartments.scGHOST:鉴定单细胞 3D 基因组亚区室。
Nat Methods. 2024 May;21(5):814-822. doi: 10.1038/s41592-024-02230-9. Epub 2024 Apr 8.
5
Nuclear morphology is shaped by loop-extrusion programs.核形态由环挤出程序塑造。
Nature. 2024 Mar;627(8002):196-203. doi: 10.1038/s41586-024-07086-9. Epub 2024 Feb 14.
6
A Comprehensive Evaluation of Generalizability of Deep Learning-Based Hi-C Resolution Improvement Methods.深度学习的 Hi-C 分辨率提升方法的泛化能力综合评估。
Genes (Basel). 2023 Dec 29;15(1):54. doi: 10.3390/genes15010054.
7
Linking genome structures to functions by simultaneous single-cell Hi-C and RNA-seq.通过单细胞 Hi-C 和 RNA-seq 同时将基因组结构与功能联系起来。
Science. 2023 Jun 9;380(6649):1070-1076. doi: 10.1126/science.adg3797. Epub 2023 Jun 8.
8
Epiphany: predicting Hi-C contact maps from 1D epigenomic signals.顿悟:从一维表观基因组信号预测 Hi-C 接触图谱。
Genome Biol. 2023 Jun 6;24(1):134. doi: 10.1186/s13059-023-02934-9.
9
New genetic and epigenetic insights into the chemokine system: the latest discoveries aiding progression toward precision medicine.新的遗传和表观遗传对趋化因子系统的深入了解:最新发现有助于朝着精准医学迈进。
Cell Mol Immunol. 2023 Jul;20(7):739-776. doi: 10.1038/s41423-023-01032-x. Epub 2023 May 17.
10
Normalization and de-noising of single-cell Hi-C data with BandNorm and scVI-3D.使用 BandNorm 和 scVI-3D 对单细胞 Hi-C 数据进行归一化和去噪。
Genome Biol. 2022 Oct 17;23(1):222. doi: 10.1186/s13059-022-02774-z.