• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过大语言模型预测TET和DNMT3基因敲除突变体中差异甲基化的胞嘧啶

Predicting differentially methylated cytosines in TET and DNMT3 knockout mutants via a large language model.

作者信息

Sereshki Saleh, Lonardi Stefano

机构信息

Department of Computer Science and Engineering, University of California, Riverside, 900 University Ave, Riverside, CA 92521, United States.

出版信息

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf092.

DOI:10.1093/bib/bbaf092
PMID:40079264
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11904404/
Abstract

DNA methylation is an epigenetic marker that directly or indirectly regulates several critical cellular processes. While cytosines in mammalian genomes generally maintain stable methylation patterns over time, other cytosines that belong to specific regulatory regions, such as promoters and enhancers, can exhibit dynamic changes. These changes in methylation are driven by a complex cellular machinery, in which the enzymes DNMT3 and TET play key roles. The objective of this study is to design a machine learning model capable of accurately predicting which cytosines have a fluctuating methylation level [hereafter called differentially methylated cytosines (DMCs)] from the surrounding DNA sequence. Here, we introduce L-MAP, a transformer-based large language model that is trained on DNMT3-knockout and TET-knockout data in human and mouse embryonic stem cells. Our extensive experimental results demonstrate the high accuracy of L-MAP in predicting DMCs. Our experiments also explore whether a classifier trained on human knockout data could predict DMCs in the mouse genome (and vice versa), and whether a classifier trained on DNMT3 knockout data could predict DMCs in TET knockouts (and vice versa). L-MAP enables the identification of sequence motifs associated with the enzymatic activity of DNMT3 and TET, which include known motifs but also novel binding sites that could provide new insights into DNA methylation in stem cells. L-MAP is available at https://github.com/ucrbioinfo/dmc_prediction.

摘要

DNA甲基化是一种表观遗传标记,可直接或间接调节多个关键细胞过程。虽然哺乳动物基因组中的胞嘧啶通常随时间保持稳定的甲基化模式,但属于特定调控区域(如启动子和增强子)的其他胞嘧啶可表现出动态变化。这些甲基化变化由复杂的细胞机制驱动,其中DNMT3和TET酶发挥关键作用。本研究的目的是设计一种机器学习模型,能够根据周围的DNA序列准确预测哪些胞嘧啶具有波动的甲基化水平(以下称为差异甲基化胞嘧啶,DMCs)。在此,我们介绍L-MAP,这是一种基于Transformer的大语言模型,它在人类和小鼠胚胎干细胞的DNMT3基因敲除和TET基因敲除数据上进行训练。我们广泛的实验结果证明了L-MAP在预测DMCs方面的高准确性。我们的实验还探讨了在人类基因敲除数据上训练的分类器是否可以预测小鼠基因组中的DMCs(反之亦然),以及在DNMT3基因敲除数据上训练的分类器是否可以预测TET基因敲除中的DMCs(反之亦然)。L-MAP能够识别与DNMT3和TET酶活性相关的序列基序,其中包括已知基序以及可能为干细胞中的DNA甲基化提供新见解的新结合位点。可通过https://github.com/ucrbioinfo/dmc_prediction获取L-MAP。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a73/11904404/e72d9383c096/bbaf092f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a73/11904404/cf8114339747/bbaf092f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a73/11904404/f231855314e4/bbaf092f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a73/11904404/a6edabc799cf/bbaf092f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a73/11904404/e72d9383c096/bbaf092f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a73/11904404/cf8114339747/bbaf092f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a73/11904404/f231855314e4/bbaf092f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a73/11904404/a6edabc799cf/bbaf092f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a73/11904404/e72d9383c096/bbaf092f4.jpg

相似文献

1
Predicting differentially methylated cytosines in TET and DNMT3 knockout mutants via a large language model.通过大语言模型预测TET和DNMT3基因敲除突变体中差异甲基化的胞嘧啶
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf092.
2
Predicting Differentially Methylated Cytosines in TET and DNMT3 Knockout Mutants via a Large Language Model.通过大语言模型预测TET和DNMT3基因敲除突变体中的差异甲基化胞嘧啶
bioRxiv. 2024 Sep 4:2024.05.02.592257. doi: 10.1101/2024.05.02.592257.
3
TETs compete with DNMT3 activity in pluripotent cells at thousands of methylated somatic enhancers.TETs 在多能细胞中与 DNMT3 活性在数千个甲基化的体基因增强子上竞争。
Nat Genet. 2020 Aug;52(8):819-827. doi: 10.1038/s41588-020-0639-9. Epub 2020 Jun 8.
4
Epigenetic engineering of yeast reveals dynamic molecular adaptation to methylation stress and genetic modulators of specific DNMT3 family members.酵母的表观遗传学工程揭示了对甲基化应激和特定 DNMT3 家族成员的遗传调节剂的动态分子适应。
Nucleic Acids Res. 2020 May 7;48(8):4081-4099. doi: 10.1093/nar/gkaa161.
5
DNMT3A and TET1 cooperate to regulate promoter epigenetic landscapes in mouse embryonic stem cells.DNMT3A 和 TET1 合作调控小鼠胚胎干细胞启动子的表观遗传景观。
Genome Biol. 2018 Jul 12;19(1):88. doi: 10.1186/s13059-018-1464-7.
6
DNMT1 regulates the timing of DNA methylation by DNMT3 in an enzymatic activity-dependent manner in mouse embryonic stem cells.DNMT1 通过酶活性依赖性方式调节小鼠胚胎干细胞中 DNA 甲基化的时间。
PLoS One. 2022 Jan 5;17(1):e0262277. doi: 10.1371/journal.pone.0262277. eCollection 2022.
7
A genome-scale map of DNA methylation turnover identifies site-specific dependencies of DNMT and TET activity.全基因组 DNA 甲基化周转率图谱鉴定出 DNMT 和 TET 活性的位点特异性依赖性。
Nat Commun. 2020 May 29;11(1):2680. doi: 10.1038/s41467-020-16354-x.
8
Distinct roles of DNMT1-dependent and DNMT1-independent methylation patterns in the genome of mouse embryonic stem cells.DNA甲基转移酶1(DNMT1)依赖性和非DNMT1依赖性甲基化模式在小鼠胚胎干细胞基因组中的不同作用。
Genome Biol. 2015 Jun 2;16(1):115. doi: 10.1186/s13059-015-0685-2.
9
Vitamin C induces Tet-dependent DNA demethylation and a blastocyst-like state in ES cells.维生素 C 诱导胚胎干细胞中的 Tet 依赖性 DNA 去甲基化和类囊胚状态。
Nature. 2013 Aug 8;500(7461):222-6. doi: 10.1038/nature12362. Epub 2013 Jun 30.
10
The inactive Dnmt3b3 isoform preferentially enhances Dnmt3b-mediated DNA methylation.无活性的 Dnmt3b3 异构体优先增强 Dnmt3b 介导的 DNA 甲基化。
Genes Dev. 2020 Nov 1;34(21-22):1546-1558. doi: 10.1101/gad.341925.120. Epub 2020 Oct 1.

本文引用的文献

1
Development of a novel representation of drug 3D structures and enhancement of the TSR-based method for probing drug and target interactions.开发一种新的药物 3D 结构表示方法,并增强基于 TSR 的方法以探测药物与靶标相互作用。
Comput Biol Chem. 2024 Oct;112:108117. doi: 10.1016/j.compbiolchem.2024.108117. Epub 2024 Jun 4.
2
Deciphering the TET3 interactome in primary thymic developing T cells.解析原发性胸腺发育中T细胞的TET3相互作用组。
iScience. 2024 Apr 18;27(5):109782. doi: 10.1016/j.isci.2024.109782. eCollection 2024 May 17.
3
ZFP281 controls transcriptional and epigenetic changes promoting mouse pluripotent state transitions via DNMT3 and TET1.
ZFP281 通过 DNMT3 和 TET1 控制转录和表观遗传变化,促进小鼠多能状态的转变。
Dev Cell. 2024 Feb 26;59(4):465-481.e6. doi: 10.1016/j.devcel.2023.12.018. Epub 2024 Jan 17.
4
TET Enzymes and 5hmC Levels in Carcinogenesis and Progression of Breast Cancer: Potential Therapeutic Targets.TET 酶和 5hmC 水平在乳腺癌发生和进展中的作用:潜在的治疗靶点。
Int J Mol Sci. 2023 Dec 24;25(1):272. doi: 10.3390/ijms25010272.
5
A genome-wide screen reveals new regulators of the 2-cell-like cell state.全基因组筛选揭示了 2 细胞样细胞状态的新调控因子。
Nat Struct Mol Biol. 2023 Aug;30(8):1105-1118. doi: 10.1038/s41594-023-01038-z. Epub 2023 Jul 24.
6
TET2 and TET3 loss disrupts small intestine differentiation and homeostasis.TET2 和 TET3 的缺失会破坏小肠的分化和稳态。
Nat Commun. 2023 Jul 6;14(1):4005. doi: 10.1038/s41467-023-39512-3.
7
On the prediction of non-CG DNA methylation using machine learning.基于机器学习的非CG DNA甲基化预测研究
NAR Genom Bioinform. 2023 May 17;5(2):lqad045. doi: 10.1093/nargab/lqad045. eCollection 2023 Jun.
8
Evaluate the developmental competence of human 8-cell embryos by single-cell RNA sequencing.通过单细胞 RNA 测序评估人类 8 细胞胚胎的发育能力。
Reprod Fertil. 2023 Apr 19;4(2). doi: 10.1530/RAF-22-0119. Print 2023 Apr 1.
9
ZNF320 is a hypomethylated prognostic biomarker involved in immune infiltration of hepatocellular carcinoma and associated with cell cycle.ZNF320 是一种低甲基化的预后生物标志物,参与肝癌的免疫浸润,与细胞周期相关。
Aging (Albany NY). 2022 Oct 26;14(20):8411-8436. doi: 10.18632/aging.204350.
10
Pronounced sequence specificity of the TET enzyme catalytic domain guides its cellular function.TET 酶催化结构域的明显序列特异性指导其细胞功能。
Sci Adv. 2022 Sep 9;8(36):eabm2427. doi: 10.1126/sciadv.abm2427. Epub 2022 Sep 7.