将大语言模型与几何深度模型相结合用于蛋白质表示。

Aligning large language models and geometric deep models for protein representation.

作者信息

Shu Dong, Duan Bingbing, Guo Kai, Zhou Kaixiong, Tang Jiliang, Du Mengnan

机构信息

Northwestern University, Computer Science Department, Evanston, IL 60201, USA.

University of Pittsburgh, Biological Sciences Department, Pittsburgh, PA 15260, USA.

出版信息

Patterns (N Y). 2025 Apr 11;6(5):101227. doi: 10.1016/j.patter.2025.101227. eCollection 2025 May 9.

DOI:10.1016/j.patter.2025.101227

PMID:40486971

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12142629/

Abstract

In this study, we explore the alignment of multimodal representations between large language models (LLMs) and geometric deep models (GDMs) in the protein domain. We comprehensively evaluate three LLMs with four protein-specialized GDMs. Our work examines alignment factors from both model and protein perspectives, identifying challenges in current alignment methodologies and proposing strategies to improve the alignment process. Experimental results reveal that GDMs incorporating both graph and 3D structural information align better with LLMs, larger LLMs demonstrate improved alignment capabilities, and protein rarity significantly impacts alignment performance. We also find that increasing GDM embedding dimensions, using two-layer projection heads, and fine-tuning LLMs on protein-specific data substantially enhance alignment quality. Last, we demonstrate that improved alignment correlates with better downstream performance and reduced hallucination in protein-focused multimodal LLMs.

摘要

在本研究中，我们探索了蛋白质领域中大型语言模型（LLMs）与几何深度模型（GDMs）之间多模态表示的对齐情况。我们用四个蛋白质专用的GDMs全面评估了三个LLMs。我们的工作从模型和蛋白质两个角度研究了对齐因素，确定了当前对齐方法中的挑战，并提出了改进对齐过程的策略。实验结果表明，结合了图和三维结构信息的GDMs与LLMs的对齐效果更好，更大的LLMs展示出了更强的对齐能力，并且蛋白质的稀有性显著影响对齐性能。我们还发现，增加GDM嵌入维度、使用双层投影头以及在蛋白质特定数据上对LLMs进行微调，可大幅提高对齐质量。最后，我们证明，改进的对齐与更好的下游性能以及蛋白质聚焦多模态LLMs中幻觉的减少相关。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc32/12142629/fc77e39f2146/gr1.jpg

相似文献

Aligning large language models and geometric deep models for protein representation.将大语言模型与几何深度模型相结合用于蛋白质表示。

Patterns (N Y). 2025 Apr 11;6(5):101227. doi: 10.1016/j.patter.2025.101227. eCollection 2025 May 9.

Multimodal Integrated Knowledge Transfer to Large Language Models through Preference Optimization with Biomedical Applications.通过偏好优化将多模态集成知识转移到具有生物医学应用的大语言模型

ArXiv. 2025 May 9:arXiv:2505.05736v1.

Evaluating the effectiveness of biomedical fine-tuning for large language models on clinical tasks.评估生物医学微调对大语言模型在临床任务上的有效性。

J Am Med Inform Assoc. 2025 Jun 1;32(6):1015-1024. doi: 10.1093/jamia/ocaf045.

Assessing the Alignment of Large Language Models With Human Values for Mental Health Integration: Cross-Sectional Study Using Schwartz's Theory of Basic Values.评估大型语言模型与人类心理健康整合价值观的一致性：使用施瓦茨基本价值观理论的横断面研究。

JMIR Ment Health. 2024 Apr 9;11:e55988. doi: 10.2196/55988.

Aligning large language models with radiologists by reinforcement learning from AI feedback for chest CT reports.通过基于人工智能反馈的强化学习使大型语言模型与放射科医生在胸部CT报告方面保持一致。

Eur J Radiol. 2025 Mar;184:111984. doi: 10.1016/j.ejrad.2025.111984. Epub 2025 Feb 6.

Developing healthcare language model embedding spaces.开发医疗保健语言模型嵌入空间。

Artif Intell Med. 2024 Dec;158:103009. doi: 10.1016/j.artmed.2024.103009. Epub 2024 Oct 31.

Utilizing large language models for gastroenterology research: a conceptual framework.利用大语言模型进行胃肠病学研究：一个概念框架。

Therap Adv Gastroenterol. 2025 Apr 1;18:17562848251328577. doi: 10.1177/17562848251328577. eCollection 2025.

Multimodal LLMs for retinal disease diagnosis via OCT: few-shot versus single-shot learning.通过光学相干断层扫描（OCT）进行视网膜疾病诊断的多模态语言模型：少样本学习与单样本学习

Ther Adv Ophthalmol. 2025 May 20;17:25158414251340569. doi: 10.1177/25158414251340569. eCollection 2025 Jan-Dec.

Leveraging Medical Knowledge Graphs Into Large Language Models for Diagnosis Prediction: Design and Application Study.将医学知识图谱融入大语言模型进行诊断预测：设计与应用研究

JMIR AI. 2025 Feb 24;4:e58670. doi: 10.2196/58670.

Use of SNOMED CT in Large Language Models: Scoping Review.SNOMED CT 在大语言模型中的应用：范围综述。

JMIR Med Inform. 2024 Oct 7;12:e62924. doi: 10.2196/62924.

本文引用的文献

Fast, sensitive detection of protein homologs using deep dense retrieval.使用深度密集检索快速、灵敏地检测蛋白质同源物。

Nat Biotechnol. 2024 Aug 9. doi: 10.1038/s41587-024-02353-6.

Geometric deep learning of protein-DNA binding specificity.蛋白质-DNA 结合特异性的几何深度学习。

Nat Methods. 2024 Sep;21(9):1674-1683. doi: 10.1038/s41592-024-02372-w. Epub 2024 Aug 5.

Contextual AI models for single-cell protein biology.用于单细胞蛋白质生物学的情境人工智能模型。

Nat Methods. 2024 Aug;21(8):1546-1557. doi: 10.1038/s41592-024-02341-3. Epub 2024 Jul 22.

Highly accurate carbohydrate-binding site prediction with DeepGlycanSite.利用 DeepGlycanSite 进行高精度糖基结合位点预测。

Nat Commun. 2024 Jun 17;15(1):5163. doi: 10.1038/s41467-024-49516-2.

PLMSearch: Protein language model powers accurate and fast sequence search for remote homology.PLMSearch：蛋白质语言模型为远程同源性的准确快速序列搜索提供动力。

Nat Commun. 2024 Mar 30;15(1):2775. doi: 10.1038/s41467-024-46808-5.

3D molecular generative framework for interaction-guided drug design.用于基于相互作用的药物设计的 3D 分子生成框架。

Nat Commun. 2024 Mar 27;15(1):2688. doi: 10.1038/s41467-024-47011-2.

Large language models improve annotation of prokaryotic viral proteins.大语言模型提高原核病毒蛋白的注释效果。

Nat Microbiol. 2024 Feb;9(2):537-549. doi: 10.1038/s41564-023-01584-8. Epub 2024 Jan 29.

MM-StackEns: A new deep multimodal stacked generalization approach for protein-protein interaction prediction.MM-StackEns：一种用于蛋白质-蛋白质相互作用预测的新型深度多模态堆叠泛化方法。

Comput Biol Med. 2023 Feb;153:106526. doi: 10.1016/j.compbiomed.2022.106526. Epub 2023 Jan 3.

ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction.ScanNet：一种用于基于结构的蛋白质结合位点预测的可解释几何深度学习模型。

Nat Methods. 2022 Jun;19(6):730-739. doi: 10.1038/s41592-022-01490-7. Epub 2022 May 30.

A Comprehensive Survey on Graph Neural Networks.图神经网络综述。

IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24. doi: 10.1109/TNNLS.2020.2978386. Epub 2021 Jan 4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

将大语言模型与几何深度模型相结合用于蛋白质表示。

Aligning large language models and geometric deep models for protein representation.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献