• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多模态学习解码染色质相关蛋白的全局结合图谱。

Multimodal learning decodes the global binding landscape of chromatin-associated proteins.

作者信息

Tan Jimin, Fu Xi, Ling Xinyu, Mo Shentong, Bai Jiangshan, Rabadán Raúl, Fenyö David, Boeke Jef D, Tsirigos Aristotelis, Xia Bo

机构信息

Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.

出版信息

bioRxiv. 2025 Aug 17:2025.08.17.670761. doi: 10.1101/2025.08.17.670761.

DOI:10.1101/2025.08.17.670761
PMID:40832322
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12363962/
Abstract

Chromatin-associated proteins (CAPs), including over 1,600 transcription factors, bind directly or indirectly to the genomic DNA to regulate gene expression and determine a myriad of cell types. Mapping their genome-wide binding and co-binding landscape is essential towards a mechanistic understanding of their functions in gene regulation and resulting cellular phenotypes. However, due to the lack of techniques that effectively scale across proteins and biological samples, their genome-wide binding profiles remain challenging to obtain, particularly in primary cells. Here we present Chromnitron, a multimodal foundation model that accurately predicts CAP binding landscapes across hundreds of proteins in unseen cell types. Via perturbation experiments, we show that the model learned principles of CAP binding from multimodal features including DNA sequence motifs, chromatin accessibility levels, and protein functional domains. Applying Chromnitron to study cell fate transitions, we discovered novel CAPs regulating the T cell exhaustion process. Furthermore, Chromnitron can predict the dynamic CAP binding landscapes during development, revealing the global orchestration of protein and regulatory element activities in neurogenesis. We expect Chromnitron to accelerate discovery and engineering in regulatory genomics, particularly in human primary cells, and empower future therapeutic opportunities.

摘要

染色质相关蛋白(CAPs),包括1600多种转录因子,直接或间接与基因组DNA结合,以调节基因表达并决定多种细胞类型。绘制它们在全基因组范围内的结合和共结合图谱,对于从机制上理解它们在基因调控和由此产生的细胞表型中的功能至关重要。然而,由于缺乏能有效适用于多种蛋白质和生物样本的技术,获取它们在全基因组范围内的结合图谱仍然具有挑战性,尤其是在原代细胞中。在此,我们展示了Chromnitron,这是一种多模态基础模型,能够准确预测在未见过的细胞类型中数百种蛋白质的CAP结合图谱。通过扰动实验,我们表明该模型从包括DNA序列基序、染色质可及性水平和蛋白质功能域在内的多模态特征中学习到了CAP结合的原理。将Chromnitron应用于研究细胞命运转变,我们发现了调节T细胞耗竭过程的新型CAPs。此外,Chromnitron可以预测发育过程中动态的CAP结合图谱,揭示神经发生过程中蛋白质和调控元件活动的全局协调。我们期望Chromnitron能加速调控基因组学的发现和工程研究,尤其是在人类原代细胞中,并为未来的治疗机会提供助力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/cea6e78e644c/nihpp-2025.08.17.670761v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/f02f9eaf6c56/nihpp-2025.08.17.670761v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/77a96aa140d0/nihpp-2025.08.17.670761v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/15d8af2f5ef2/nihpp-2025.08.17.670761v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/95203bacefdd/nihpp-2025.08.17.670761v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/2c52b0db85bd/nihpp-2025.08.17.670761v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/cea6e78e644c/nihpp-2025.08.17.670761v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/f02f9eaf6c56/nihpp-2025.08.17.670761v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/77a96aa140d0/nihpp-2025.08.17.670761v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/15d8af2f5ef2/nihpp-2025.08.17.670761v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/95203bacefdd/nihpp-2025.08.17.670761v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/2c52b0db85bd/nihpp-2025.08.17.670761v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/12363962/cea6e78e644c/nihpp-2025.08.17.670761v1-f0006.jpg

相似文献

1
Multimodal learning decodes the global binding landscape of chromatin-associated proteins.多模态学习解码染色质相关蛋白的全局结合图谱。
bioRxiv. 2025 Aug 17:2025.08.17.670761. doi: 10.1101/2025.08.17.670761.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
DNA methylation affects gene expression but not global chromatin structure in .DNA甲基化影响基因表达,但不影响……中的整体染色质结构。 (原文句子不完整,缺少具体研究对象等关键信息)
J Bacteriol. 2025 Jul 14:e0054024. doi: 10.1128/jb.00540-24.
4
Chromosome organization by one-sided and two-sided loop extrusion.染色体通过单侧和双侧环挤压进行组织。
Elife. 2020 Apr 6;9:e53558. doi: 10.7554/eLife.53558.
5
Short-Term Memory Impairment短期记忆障碍
6
Multiple long-range interactions generate CTCF insulator-dependent viral chromatin domains in quiescent HSV-1 genomes.多种长程相互作用在静止的单纯疱疹病毒1型(HSV-1)基因组中产生依赖CCCTC结合因子(CTCF)绝缘子的病毒染色质结构域。
mBio. 2025 Aug 28:e0163825. doi: 10.1128/mbio.01638-25.
7
Functional genomic analysis of non-canonical DNA regulatory elements of the aryl hydrocarbon receptor.芳烃受体非经典DNA调控元件的功能基因组分析
bioRxiv. 2025 May 22:2023.05.01.538985. doi: 10.1101/2023.05.01.538985.
8
Δ133p53α and Δ160p53α isoforms of the tumor suppressor protein p53 exert dominant-negative effect primarily by co-aggregation.肿瘤抑制蛋白p53的Δ133p53α和Δ160p53α亚型主要通过共聚集发挥显性负效应。
Elife. 2025 Jul 21;14:RP106469. doi: 10.7554/eLife.106469.
9
Interplay between tumor mutation burden and the tumor microenvironment predicts the prognosis of pan-cancer anti-PD-1/PD-L1 therapy.肿瘤突变负荷与肿瘤微环境之间的相互作用可预测泛癌抗PD-1/PD-L1治疗的预后。
Front Immunol. 2025 Jul 24;16:1557461. doi: 10.3389/fimmu.2025.1557461. eCollection 2025.
10
The landscape of long non-coding RNA during cSCC progression.皮肤鳞状细胞癌进展过程中长链非编码RNA的情况
Br J Dermatol. 2025 Mar 27. doi: 10.1093/bjd/ljaf108.

本文引用的文献

1
Chromatin-dependent motif syntax defines differentiation trajectories.染色质依赖性基序语法定义分化轨迹。
Mol Cell. 2025 Aug 7;85(15):2900-2918.e16. doi: 10.1016/j.molcel.2025.07.005.
2
Multiscale footprints reveal the organization of cis-regulatory elements.多尺度足迹揭示顺式调控元件的组织方式。
Nature. 2025 Feb;638(8051):779-786. doi: 10.1038/s41586-024-08443-4. Epub 2025 Jan 22.
3
Simulating 500 million years of evolution with a language model.用语言模型模拟5亿年的进化历程。
Science. 2025 Feb 21;387(6736):850-858. doi: 10.1126/science.ads0018. Epub 2025 Jan 16.
4
Predicting RNA-seq coverage from DNA sequence as a unifying model of gene regulation.将DNA序列预测RNA测序覆盖度作为基因调控的统一模型。
Nat Genet. 2025 Apr;57(4):949-961. doi: 10.1038/s41588-024-02053-6. Epub 2025 Jan 8.
5
A foundation model of transcription across human cell types.一种跨人类细胞类型的转录基础模型。
Nature. 2025 Jan;637(8047):965-973. doi: 10.1038/s41586-024-08391-z. Epub 2025 Jan 8.
6
Single-molecule states link transcription factor binding to gene expression.单分子状态将转录因子结合与基因表达联系起来。
Nature. 2024 Dec;636(8043):745-754. doi: 10.1038/s41586-024-08219-w. Epub 2024 Nov 20.
7
InterPro: the protein sequence classification resource in 2025.InterPro:2025年的蛋白质序列分类资源。
Nucleic Acids Res. 2025 Jan 6;53(D1):D444-D456. doi: 10.1093/nar/gkae1082.
8
Sequence modeling and design from molecular to genome scale with Evo.基于 Evo 在从分子到基因组尺度上进行序列建模和设计。
Science. 2024 Nov 15;386(6723):eado9336. doi: 10.1126/science.ado9336.
9
Degradation of IKZF1 prevents epigenetic progression of T cell exhaustion in an antigen-specific assay.IKZF1 降解可防止抗原特异性测定中 T 细胞耗竭的表观遗传进展。
Cell Rep Med. 2024 Nov 19;5(11):101804. doi: 10.1016/j.xcrm.2024.101804. Epub 2024 Oct 31.
10
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.