• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从非结构化生物医学文本数据中提取蛋白质-蛋白质相互作用并从中学习的新进展。

New advances in extracting and learning from protein-protein interactions within unstructured biomedical text data.

作者信息

Caufield J Harry, Ping Peipei

机构信息

The NIH BD2K Center of Excellence in Biomedical Computing, University of California at Los Angeles, Los Angeles, CA 90095, U.S.A.

Department of Physiology, University of California at Los Angeles, Los Angeles, CA 90095, U.S.A.

出版信息

Emerg Top Life Sci. 2019 Aug 16;3(4):357-369. doi: 10.1042/ETLS20190003.

DOI:10.1042/ETLS20190003
PMID:33523203
Abstract

Protein-protein interactions, or PPIs, constitute a basic unit of our understanding of protein function. Though substantial effort has been made to organize PPI knowledge into structured databases, maintenance of these resources requires careful manual curation. Even then, many PPIs remain uncurated within unstructured text data. Extracting PPIs from experimental research supports assembly of PPI networks and highlights relationships crucial to elucidating protein functions. Isolating specific protein-protein relationships from numerous documents is technically demanding by both manual and automated means. Recent advances in the design of these methods have leveraged emerging computational developments and have demonstrated impressive results on test datasets. In this review, we discuss recent developments in PPI extraction from unstructured biomedical text. We explore the historical context of these developments, recent strategies for integrating and comparing PPI data, and their application to advancing the understanding of protein function. Finally, we describe the challenges facing the application of PPI mining to the text concerning protein families, using the multifunctional 14-3-3 protein family as an example.

摘要

蛋白质-蛋白质相互作用(PPIs)是我们理解蛋白质功能的基本单元。尽管已经付出了巨大努力将PPIs知识整理到结构化数据库中,但维护这些资源需要仔细的人工编目。即便如此,许多PPIs在非结构化文本数据中仍未得到编目。从实验研究中提取PPIs有助于构建PPI网络,并突出对于阐明蛋白质功能至关重要的关系。通过人工和自动化手段从众多文档中分离特定的蛋白质-蛋白质关系在技术上都具有挑战性。这些方法设计上的最新进展利用了新兴的计算技术,并在测试数据集上取得了令人瞩目的成果。在本综述中,我们讨论了从非结构化生物医学文本中提取PPIs的最新进展。我们探讨了这些进展的历史背景、整合和比较PPI数据的最新策略,以及它们在推进对蛋白质功能理解方面的应用。最后,我们以多功能的14-3-3蛋白质家族为例,描述了将PPI挖掘应用于蛋白质家族文本时面临的挑战。

相似文献

1
New advances in extracting and learning from protein-protein interactions within unstructured biomedical text data.从非结构化生物医学文本数据中提取蛋白质-蛋白质相互作用并从中学习的新进展。
Emerg Top Life Sci. 2019 Aug 16;3(4):357-369. doi: 10.1042/ETLS20190003.
2
Large-scale protein-protein post-translational modification extraction with distant supervision and confidence calibrated BioBERT.利用远距离监督和置信度校准的 BioBERT 进行大规模蛋白质 - 蛋白质翻译后修饰提取。
BMC Bioinformatics. 2022 Jan 4;23(1):4. doi: 10.1186/s12859-021-04504-x.
3
Towards Extracting Supporting Information About Predicted Protein-Protein Interactions.关于提取预测蛋白-蛋白相互作用的支持信息的研究。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Jul-Aug;15(4):1239-1246. doi: 10.1109/TCBB.2015.2505278. Epub 2015 Dec 7.
4
Learning the Structure of Biomedical Relationships from Unstructured Text.从非结构化文本中学习生物医学关系的结构
PLoS Comput Biol. 2015 Jul 28;11(7):e1004216. doi: 10.1371/journal.pcbi.1004216. eCollection 2015 Jul.
5
Text Mining and Machine Learning Protocol for Extracting Human-Related Protein Phosphorylation Information from PubMed.从 PubMed 中提取与人相关的蛋白质磷酸化信息的文本挖掘和机器学习协议。
Methods Mol Biol. 2022;2496:159-177. doi: 10.1007/978-1-0716-2305-3_9.
6
Facts from text: can text mining help to scale-up high-quality manual curation of gene products with ontologies?文本中的事实:文本挖掘能否助力利用本体对基因产物进行大规模高质量人工编目?
Brief Bioinform. 2008 Nov;9(6):466-78. doi: 10.1093/bib/bbn043. Epub 2008 Dec 6.
7
The eFIP system for text mining of protein interaction networks of phosphorylated proteins.基于磷酸化蛋白质相互作用网络的文本挖掘的 eFIP 系统。
Database (Oxford). 2012 Dec 5;2012:bas044. doi: 10.1093/database/bas044. Print 2012.
8
Extraction of protein interaction information from unstructured text using a context-free grammar.使用上下文无关语法从非结构化文本中提取蛋白质相互作用信息。
Bioinformatics. 2003 Nov 1;19(16):2046-53. doi: 10.1093/bioinformatics/btg279.
9
Construction of biological networks from unstructured information based on a semi-automated curation workflow.基于半自动编目工作流程从非结构化信息构建生物网络。
Database (Oxford). 2015 Jun 17;2015:bav057. doi: 10.1093/database/bav057.
10
Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.设计一个基于 openEHR 的管道,使用自然语言处理提取和标准化非结构化临床数据。
Methods Inf Med. 2020 Dec;59(S 02):e64-e78. doi: 10.1055/s-0040-1716403. Epub 2020 Oct 14.

引用本文的文献

1
Text mining for modeling of protein complexes enhanced by machine learning.基于机器学习的蛋白质复合物建模的文本挖掘。
Bioinformatics. 2021 May 1;37(4):497-505. doi: 10.1093/bioinformatics/btaa823.