• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FoldMark:通过分布和进化水印保护蛋白质结构生成模型

FoldMark: Safeguarding Protein Structure Generative Models with Distributional and Evolutionary Watermarking.

作者信息

Zhang Zaixi, Jin Ruofan, Xu Guangxue, Wang Xiaotong, Zitnik Marinka, Cong Le, Wang Mengdi

机构信息

Princeton University, Princeton, NJ, USA.

Stanford University, Stanford, CA, USA.

出版信息

bioRxiv. 2025 Jun 4:2024.10.23.619960. doi: 10.1101/2024.10.23.619960.

DOI:10.1101/2024.10.23.619960
PMID:39554012
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11565776/
Abstract

Proteins are the principal architects of life, fueling advances in bioengineering, drug discovery, and synthetic biology. The integration of generative AI with computational protein science has revolutionized protein design while also posing dual-use risks, such as enabling the creation of pandemic-capable proteins that require strong biosecurity safeguards. Here, we introduce FoldMark, a first-of-its-kind watermarking strategy leveraging distributional and evolutionary principles tailored for protein generative models, balancing watermark capacity and structural quality. FoldMark achieves over 95% watermark bit accuracy at 32 bits with minimal impact on structural quality (>0.9 scTM scores) for leading models including AlphaFold3, ESMFold, RFDiffusion, and RFDiffusionAA. For user tracing, FoldMark can successfully trace up to users. To validate FoldMark in wet lab, we applied it to structure-based design of EGFP and CRISPR-Cas13, showing wildtype-level function (98% fluorescence, 95% editing efficiency) and >90% watermark detection, demonstrating its practical utility for safeguarding AI-driven protein research.

摘要

蛋白质是生命的主要构建者,推动着生物工程、药物研发和合成生物学的发展。生成式人工智能与计算蛋白质科学的整合彻底改变了蛋白质设计,但同时也带来了两用风险,比如可能催生需要严格生物安全防护措施的具备大流行能力的蛋白质。在此,我们介绍FoldMark,这是一种首创的水印策略,它利用为蛋白质生成模型量身定制的分布和进化原理,平衡水印容量和结构质量。对于包括AlphaFold3、ESMFold、RFDiffusion和RFDiffusionAA在内的领先模型,FoldMark在32位时水印比特准确率超过95%,对结构质量的影响极小(结构相似性得分>0.9)。对于用户追踪,FoldMark最多可成功追踪 用户。为了在湿实验室中验证FoldMark,我们将其应用于基于结构的增强绿色荧光蛋白(EGFP)和CRISPR-Cas13设计,结果显示其具有野生型水平的功能(98%的荧光、95%的编辑效率)以及>90%的水印检测率,证明了它在保障人工智能驱动的蛋白质研究方面的实际效用。 (原文中“To validate FoldMark in wet lab, we applied it to structure-based design of EGFP and CRISPR-Cas13, showing wildtype-level function (98% fluorescence, 95% editing efficiency) and >90% watermark detection, demonstrating its practical utility for safeguarding AI-driven protein research.”部分括号内的内容翻译时疑似原文有误,98% fluorescence应改为98% fluorescence intensity,95% editing efficiency应改为95% editing accuracy,但按照要求未做修改)

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/c9f87874953c/nihpp-2024.10.23.619960v7-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/1fae7ac0eeeb/nihpp-2024.10.23.619960v7-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/16a4e4c6ef06/nihpp-2024.10.23.619960v7-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/44bfead7ea2a/nihpp-2024.10.23.619960v7-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/8b2b342f0f7b/nihpp-2024.10.23.619960v7-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/f9c26f0fe64c/nihpp-2024.10.23.619960v7-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/c9f87874953c/nihpp-2024.10.23.619960v7-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/1fae7ac0eeeb/nihpp-2024.10.23.619960v7-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/16a4e4c6ef06/nihpp-2024.10.23.619960v7-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/44bfead7ea2a/nihpp-2024.10.23.619960v7-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/8b2b342f0f7b/nihpp-2024.10.23.619960v7-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/f9c26f0fe64c/nihpp-2024.10.23.619960v7-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8171/12218785/c9f87874953c/nihpp-2024.10.23.619960v7-f0006.jpg

相似文献

1
FoldMark: Safeguarding Protein Structure Generative Models with Distributional and Evolutionary Watermarking.FoldMark:通过分布和进化水印保护蛋白质结构生成模型
bioRxiv. 2025 Jun 4:2024.10.23.619960. doi: 10.1101/2024.10.23.619960.
2
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
3
Linking transcriptome and morphology in bone cells at cellular resolution with generative AI.利用生成式人工智能在细胞分辨率下将骨细胞中的转录组与形态学联系起来。
J Bone Miner Res. 2024 Dec 31;40(1):20-26. doi: 10.1093/jbmr/zjae151.
4
AI-Driven Antimicrobial Peptide Discovery: Mining and Generation.人工智能驱动的抗菌肽发现:挖掘与生成
Acc Chem Res. 2025 Jun 17;58(12):1831-1846. doi: 10.1021/acs.accounts.0c00594. Epub 2025 Jun 3.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
Safety and User Experience of a Generative Artificial Intelligence Digital Mental Health Intervention: Exploratory Randomized Controlled Trial.生成式人工智能数字心理健康干预的安全性与用户体验:探索性随机对照试验
J Med Internet Res. 2025 May 23;27:e67365. doi: 10.2196/67365.
7
Incentives for preventing smoking in children and adolescents.预防儿童和青少年吸烟的激励措施。
Cochrane Database Syst Rev. 2017 Jun 6;6(6):CD008645. doi: 10.1002/14651858.CD008645.pub3.
8
Rapid molecular tests for tuberculosis and tuberculosis drug resistance: a qualitative evidence synthesis of recipient and provider views.快速分子检测结核分枝杆菌和结核分枝杆菌耐药性:受检者和提供者观点的定性证据综合评价。
Cochrane Database Syst Rev. 2022 Apr 26;4(4):CD014877. doi: 10.1002/14651858.CD014877.pub2.
9
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
10
Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.影响重症成人和儿童机械通气撤机方案使用的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2.

本文引用的文献

1
A call for built-in biosecurity safeguards for generative AI tools.呼吁为生成式人工智能工具设置内置生物安全保障措施。
Nat Biotechnol. 2025 Apr 28. doi: 10.1038/s41587-025-02650-8.
2
Scalable watermarking for identifying large language model outputs.可扩展的水印技术用于识别大型语言模型输出。
Nature. 2024 Oct;634(8035):818-823. doi: 10.1038/s41586-024-08025-4. Epub 2024 Oct 23.
3
De novo protein design with a denoising diffusion network independent of pretrained structure prediction models.基于去噪扩散网络的从头蛋白质设计,无需预先训练的结构预测模型。
Nat Methods. 2024 Nov;21(11):2107-2116. doi: 10.1038/s41592-024-02437-w. Epub 2024 Oct 9.
4
AI and biosecurity: The need for governance.人工智能与生物安全:治理的必要性。
Science. 2024 Aug 23;385(6711):831-833. doi: 10.1126/science.adq1977. Epub 2024 Aug 22.
5
Affinity gaps among B cells in germinal centers drive the selection of MPER precursors.生发中心 B 细胞之间的亲和力差距驱动 MPER 前体的选择。
Nat Immunol. 2024 Jun;25(6):1083-1096. doi: 10.1038/s41590-024-01844-7. Epub 2024 May 30.
6
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
7
Structures, mechanisms and applications of RNA-centric CRISPR-Cas13.以 RNA 为中心的 CRISPR-Cas13 的结构、机制与应用。
Nat Chem Biol. 2024 Jun;20(6):673-688. doi: 10.1038/s41589-024-01593-6. Epub 2024 May 3.
8
Generalized biomolecular modeling and design with RoseTTAFold All-Atom.基于 RoseTTAFold All-Atom 的广义生物分子建模与设计。
Science. 2024 Apr 19;384(6693):eadl2528. doi: 10.1126/science.adl2528.
9
PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences.PoseBusters:基于人工智能的对接方法无法生成符合物理原理的构象,也无法推广到新序列。
Chem Sci. 2023 Dec 13;15(9):3130-3139. doi: 10.1039/d3sc04185a. eCollection 2024 Feb 28.
10
DynamicBind: predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model.动态绑定:使用深度等变生成模型预测配体特异性蛋白质-配体复合物结构。
Nat Commun. 2024 Feb 5;15(1):1071. doi: 10.1038/s41467-024-45461-2.