• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测细胞类型特异性表观基因组图谱,同时考虑远端遗传效应。

Predicting cell type-specific epigenomic profiles accounting for distal genetic effects.

机构信息

UK Dementia Research Institute at Imperial College London, London, W12 0BZ, UK.

Department of Brain Sciences, Imperial College London, London, W12 0BZ, UK.

出版信息

Nat Commun. 2024 Nov 16;15(1):9951. doi: 10.1038/s41467-024-54441-5.

DOI:10.1038/s41467-024-54441-5
PMID:39550354
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11569248/
Abstract

Understanding how genetic variants affect the epigenome is key to interpreting GWAS, yet profiling these effects across the non-coding genome remains challenging due to experimental scalability. This necessitates accurate computational models. Existing machine learning approaches, while progressively improving, are confined to the cell types they were trained on, limiting their applicability. Here, we introduce Enformer Celltyping, a deep learning model which incorporates distal effects of DNA interactions, up to 100,000 base-pairs away, to predict epigenetic signals in previously unseen cell types. Using DNA and chromatin accessibility data for epigenetic imputation, Enformer Celltyping outperforms current best-in-class approaches and generalises across cell types and biological regions. Moreover, we propose a framework for evaluating models on genetic variant effect prediction using regulatory quantitative trait loci mapping studies, highlighting current limitations in genomic deep learning models. Despite this, Enformer Celltyping can also be used to study cell type-specific genetic enrichment of complex traits.

摘要

了解遗传变异如何影响表观基因组对于解释 GWAS 至关重要,但由于实验的可扩展性,对非编码基因组中的这些影响进行描绘仍然具有挑战性。这需要准确的计算模型。现有的机器学习方法虽然在不断改进,但仅限于它们所训练的细胞类型,限制了它们的适用性。在这里,我们介绍了 Enformer Celltyping,这是一种深度学习模型,它将 DNA 相互作用的远端效应纳入其中,距离可达 100,000 个碱基,以预测以前未见的细胞类型中的表观遗传信号。使用 DNA 和染色质可及性数据进行表观遗传推断,Enformer Celltyping 的表现优于当前的最佳方法,并在细胞类型和生物区域上具有通用性。此外,我们提出了一种使用调控数量性状基因座作图研究评估模型在遗传变异效应预测方面的框架,突出了基因组深度学习模型目前的局限性。尽管如此,Enformer Celltyping 也可用于研究复杂性状的特定于细胞类型的遗传富集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/726c03cb1b47/41467_2024_54441_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/c731353e2558/41467_2024_54441_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/ad0186568ad6/41467_2024_54441_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/eb3cb3764560/41467_2024_54441_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/9665bad10631/41467_2024_54441_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/ae6c40f649b2/41467_2024_54441_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/c783e67e338e/41467_2024_54441_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/1e1251d4186e/41467_2024_54441_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/726c03cb1b47/41467_2024_54441_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/c731353e2558/41467_2024_54441_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/ad0186568ad6/41467_2024_54441_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/eb3cb3764560/41467_2024_54441_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/9665bad10631/41467_2024_54441_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/ae6c40f649b2/41467_2024_54441_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/c783e67e338e/41467_2024_54441_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/1e1251d4186e/41467_2024_54441_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4228/11569248/726c03cb1b47/41467_2024_54441_Fig8_HTML.jpg

相似文献

1
Predicting cell type-specific epigenomic profiles accounting for distal genetic effects.预测细胞类型特异性表观基因组图谱,同时考虑远端遗传效应。
Nat Commun. 2024 Nov 16;15(1):9951. doi: 10.1038/s41467-024-54441-5.
2
Predicting regulatory variants using a dense epigenomic mapped CNN model elucidated the molecular basis of trait-tissue associations.使用密集表观基因组映射卷积神经网络模型预测调控变异,阐明了性状-组织关联的分子基础。
Nucleic Acids Res. 2021 Jan 11;49(1):53-66. doi: 10.1093/nar/gkaa1137.
3
Chromatin Profiling Techniques: Exploring the Chromatin Environment and Its Contributions to Complex Traits.染色质谱分析技术:探索染色质环境及其对复杂性状的贡献。
Int J Mol Sci. 2021 Jul 16;22(14):7612. doi: 10.3390/ijms22147612.
4
cepip: context-dependent epigenomic weighting for prioritization of regulatory variants and disease-associated genes.cepip:用于调控变异和疾病相关基因优先级排序的上下文依赖表观基因组加权法
Genome Biol. 2017 Mar 16;18(1):52. doi: 10.1186/s13059-017-1177-3.
5
Current genomic deep learning models display decreased performance in cell type-specific accessible regions.目前的基因组深度学习模型在细胞类型特异性可及区域的表现有所下降。
Genome Biol. 2024 Aug 1;25(1):202. doi: 10.1186/s13059-024-03335-2.
6
Deep learning models predict regulatory variants in pancreatic islets and refine type 2 diabetes association signals.深度学习模型预测胰腺胰岛中的调控变异,并细化 2 型糖尿病关联信号。
Elife. 2020 Jan 27;9:e51503. doi: 10.7554/eLife.51503.
7
DeepHistone: a deep learning approach to predicting histone modifications.DeepHistone:一种深度学习方法,用于预测组蛋白修饰。
BMC Genomics. 2019 Apr 4;20(Suppl 2):193. doi: 10.1186/s12864-019-5489-4.
8
Unveiling chromatin dynamics with virtual epigenome.利用虚拟表观基因组揭示染色质动力学。
Nat Commun. 2025 Apr 12;16(1):3491. doi: 10.1038/s41467-025-58481-3.
9
A review of deep learning models for the prediction of chromatin interactions with DNA and epigenomic profiles.用于预测染色质与DNA相互作用及表观基因组图谱的深度学习模型综述。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae651.
10
Chromatin accessibility and gene expression during adipocyte differentiation identify context-dependent effects at cardiometabolic GWAS loci.脂肪细胞分化过程中的染色质可及性和基因表达鉴定出心脏代谢 GWAS 位点的上下文相关效应。
PLoS Genet. 2021 Oct 26;17(10):e1009865. doi: 10.1371/journal.pgen.1009865. eCollection 2021 Oct.

引用本文的文献

1
Predicting gene expression from DNA sequence using deep learning models.使用深度学习模型从DNA序列预测基因表达。
Nat Rev Genet. 2025 May 13. doi: 10.1038/s41576-025-00841-2.
2
Pinpointing the integration of artificial intelligence in liver cancer immune microenvironment.精准定位人工智能在肝癌免疫微环境中的整合情况。
Front Immunol. 2024 Dec 20;15:1520398. doi: 10.3389/fimmu.2024.1520398. eCollection 2024.
3
Predicting gene expression from histone marks using chromatin deep learning models depends on histone mark function, regulatory distance and cellular states.

本文引用的文献

1
Benchmarking of deep neural networks for predicting personal gene expression from DNA sequence highlights shortcomings.用于从DNA序列预测个人基因表达的深度神经网络基准测试凸显了不足之处。
Nat Genet. 2023 Dec;55(12):2060-2064. doi: 10.1038/s41588-023-01524-6. Epub 2023 Nov 30.
2
Evaluating deep learning for predicting epigenomic profiles.评估用于预测表观基因组图谱的深度学习。
Nat Mach Intell. 2022 Dec;4(12):1088-1100. doi: 10.1038/s42256-022-00570-9. Epub 2022 Dec 5.
3
The ENCODE Imputation Challenge: a critical assessment of methods for cross-cell type imputation of epigenomic profiles.
使用染色质深度学习模型从组蛋白标记预测基因表达取决于组蛋白标记功能、调控距离和细胞状态。
Nucleic Acids Res. 2025 Feb 8;53(4). doi: 10.1093/nar/gkae1212.
ENCODE 插补挑战:对跨细胞类型表观基因组谱插补方法的批判性评估。
Genome Biol. 2023 Apr 18;24(1):79. doi: 10.1186/s13059-023-02915-y.
4
Current sequence-based models capture gene expression determinants in promoters but mostly ignore distal enhancers.目前基于序列的模型可以捕捉启动子中的基因表达决定因素,但大多忽略了远端增强子。
Genome Biol. 2023 Mar 27;24(1):56. doi: 10.1186/s13059-023-02899-9.
5
Mapping genomic loci implicates genes and synaptic biology in schizophrenia.基因组定位研究提示精神分裂症的发病与基因及突触生物学有关。
Nature. 2022 Apr;604(7906):502-508. doi: 10.1038/s41586-022-04434-5. Epub 2022 Apr 8.
6
New insights into the genetic etiology of Alzheimer's disease and related dementias.阿尔茨海默病及相关痴呆症的遗传学病因新见解。
Nat Genet. 2022 Apr;54(4):412-436. doi: 10.1038/s41588-022-01024-z. Epub 2022 Apr 4.
7
Fine-mapping causal variants - why finding 'the one' can be futile.精细定位因果变异——为何寻找“那个”变异可能徒劳无功。
Nat Rev Genet. 2022 May;23(5):261. doi: 10.1038/s41576-022-00484-7.
8
Genetic associations at regulatory phenotypes improve fine-mapping of causal variants for 12 immune-mediated diseases.调控表型的基因关联改善了12种免疫介导疾病因果变异的精细定位。
Nat Genet. 2022 Mar;54(3):251-262. doi: 10.1038/s41588-022-01025-y. Epub 2022 Mar 14.
9
Navigating the pitfalls of applying machine learning in genomics.在基因组学中应用机器学习的陷阱。
Nat Rev Genet. 2022 Mar;23(3):169-181. doi: 10.1038/s41576-021-00434-9. Epub 2021 Nov 26.
10
Effective gene expression prediction from sequence by integrating long-range interactions.通过整合长程相互作用,从序列中有效预测基因表达。
Nat Methods. 2021 Oct;18(10):1196-1203. doi: 10.1038/s41592-021-01252-x. Epub 2021 Oct 4.