• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从大型文本语料库中高效挖掘蛋白质相互作用关系。

Efficiently mining protein interaction dependencies from large text corpora.

机构信息

Genome Informatics, Institute of Human Genetics, Faculty of Medicine, University of Duisburg-Essen, Essen, Germany.

出版信息

Integr Biol (Camb). 2012 Jul;4(7):805-12. doi: 10.1039/c2ib00126h. Epub 2012 Jun 15.

DOI:10.1039/c2ib00126h
PMID:22706334
Abstract

Biochemical research has yielded an extensive amount of information about dependencies between protein interactions, as generated by allosteric regulations, steric hindrance and other mechanisms. Collectively, this information is valuable for understanding large intracellular protein networks. However, this information is sparsely distributed among millions of publications and documented as freely styled text meant for manual reading. Here we develop a computational approach for extracting information about interaction dependencies from large numbers of publications. First, keyword-based tokenization reduces full papers to short strings, facilitating an efficient search for patterns that are likely to indicate descriptions of interaction dependencies. Sentences that match such patterns are extracted, thereby reducing the amount of text to be read by human curators. Application of this approach to the integrin adhesome network extracted from 59,933 papers 208 short statements, close to half of which indeed describe interaction dependencies. We visualize the obtained hypernetwork of dependencies and illustrate that these dependencies confine the feasible mechanisms of adhesion sites assembly and generate testable hypotheses about their switchability.

摘要

生化研究产生了大量关于蛋白质相互作用之间的依赖关系的信息,这些依赖关系是由变构调节、空间位阻和其他机制产生的。这些信息对于理解大型细胞内蛋白质网络非常有价值。然而,这些信息在数百万篇文献中分布稀疏,并以自由风格的文本形式记录,以便人工阅读。在这里,我们开发了一种从大量文献中提取相互作用依赖关系信息的计算方法。首先,基于关键字的标记化将全文简化为短字符串,从而可以有效地搜索可能表示相互作用依赖关系描述的模式。提取与这些模式匹配的句子,从而减少了人类编辑者需要阅读的文本量。将这种方法应用于从 59933 篇论文中提取的整合素黏着斑网络,得到了 208 个简短的陈述,其中近一半确实描述了相互作用的依赖关系。我们可视化了获得的依赖关系超网络,并说明了这些依赖关系限制了黏着斑组装的可行机制,并生成了关于其可切换性的可测试假设。

相似文献

1
Efficiently mining protein interaction dependencies from large text corpora.从大型文本语料库中高效挖掘蛋白质相互作用关系。
Integr Biol (Camb). 2012 Jul;4(7):805-12. doi: 10.1039/c2ib00126h. Epub 2012 Jun 15.
2
Modeling and simulating networks of interdependent protein interactions.相互依赖的蛋白质相互作用网络的建模与模拟
Integr Biol (Camb). 2018 May 21;10(5):290-305. doi: 10.1039/c8ib00012c.
3
Text mining and visualisation of Protein-Protein Interactions.蛋白质-蛋白质相互作用的文本挖掘与可视化
Int J Comput Biol Drug Des. 2011;4(3):239-44. doi: 10.1504/IJCBDD.2011.041412. Epub 2011 Jul 21.
4
A multi-layered approach to protein data integration for diabetes research.一种用于糖尿病研究的蛋白质数据整合的多层方法。
Artif Intell Med. 2007 Oct;41(2):129-43. doi: 10.1016/j.artmed.2007.07.009. Epub 2007 Sep 14.
5
Visualization and analysis of a cardio vascular disease- and MUPP1-related biological network combining text mining and data warehouse approaches.结合文本挖掘和数据仓库方法的心血管疾病与MUPP1相关生物网络的可视化与分析
J Integr Bioinform. 2010 Nov 11;7(1):148. doi: 10.2390/biecoll-jib-2010-148.
6
The protein interaction network mediated by human SH3 domains.人类 SH3 结构域介导的蛋白质相互作用网络。
Biotechnol Adv. 2012 Jan-Feb;30(1):4-15. doi: 10.1016/j.biotechadv.2011.06.012. Epub 2011 Jun 29.
7
Hash subgraph pairwise kernel for protein-protein interaction extraction.基于哈希子图的成对核函数用于蛋白质-蛋白质相互作用提取。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1190-202. doi: 10.1109/TCBB.2012.50.
8
The Gene Interaction Miner: a new tool for data mining contextual information for protein-protein interaction analysis.基因交互挖掘器:一种新的工具,用于挖掘蛋白质-蛋白质相互作用分析的上下文信息。
Bioinformatics. 2010 Jan 15;26(2):283-4. doi: 10.1093/bioinformatics/btp652. Epub 2009 Dec 4.
9
Manual annotation of protein interactions.蛋白质相互作用的人工注释。
Methods Mol Biol. 2009;563:75-95. doi: 10.1007/978-1-60761-175-2_5.
10
Finding the evidence for protein-protein interactions from PubMed abstracts.从PubMed摘要中寻找蛋白质-蛋白质相互作用的证据。
Bioinformatics. 2006 Jul 15;22(14):e220-6. doi: 10.1093/bioinformatics/btl203.

引用本文的文献

1
The integrin adhesome network at a glance.整合素黏附体网络概览。
J Cell Sci. 2016 Nov 15;129(22):4159-4163. doi: 10.1242/jcs.192054. Epub 2016 Oct 31.
2
Integrative systems and synthetic biology of cell-matrix adhesion sites.细胞-基质粘附位点的整合系统与合成生物学
Cell Adh Migr. 2016 Sep 2;10(5):451-460. doi: 10.1080/19336918.2016.1148865. Epub 2016 Feb 6.
3
Symmetric exchange of multi-protein building blocks between stationary focal adhesions and the cytosol.固定的粘着斑与细胞质之间多蛋白构建模块的对称交换。
Elife. 2014 Jun 3;3:e02257. doi: 10.7554/eLife.02257.