• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于行走权重的蛋白质相互作用提取子序列核方法。

Walk-weighted subsequence kernels for protein-protein interaction extraction.

机构信息

Department of Computer Science, Sogang University, Seoul, Korea.

出版信息

BMC Bioinformatics. 2010 Feb 25;11:107. doi: 10.1186/1471-2105-11-107.

DOI:10.1186/1471-2105-11-107
PMID:20184736
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2844389/
Abstract

BACKGROUND

The construction of interaction networks between proteins is central to understanding the underlying biological processes. However, since many useful relations are excluded in databases and remain hidden in raw text, a study on automatic interaction extraction from text is important in bioinformatics field.

RESULTS

Here, we suggest two kinds of kernel methods for genic interaction extraction, considering the structural aspects of sentences. First, we improve our prior dependency kernel by modifying the kernel function so that it can involve various substructures in terms of (1) e-walks, (2) partial match, (3) non-contiguous paths, and (4) different significance of substructures. Second, we propose the walk-weighted subsequence kernel to parameterize non-contiguous syntactic structures as well as semantic roles and lexical features, which makes learning structural aspects from a small amount of training data effective. Furthermore, we distinguish the significances of parameters such as syntactic locality, semantic roles, and lexical features by varying their weights.

CONCLUSIONS

We addressed the genic interaction problem with various dependency kernels and suggested various structural kernel scenarios based on the directed shortest dependency path connecting two entities. Consequently, we obtained promising results over genic interaction data sets with the walk-weighted subsequence kernel. The results are compared using automatically parsed third party protein-protein interaction (PPI) data as well as perfectly syntactic labeled PPI data.

摘要

背景

蛋白质间相互作用网络的构建对于理解潜在的生物过程至关重要。然而,由于许多有用的关系在数据库中被排除在外,并且隐藏在原始文本中,因此从文本中自动提取相互作用的研究在生物信息学领域非常重要。

结果

在这里,我们考虑句子的结构方面,提出了两种用于基因相互作用提取的核方法。首先,我们通过修改核函数来改进我们先前的依赖核,以便它可以涉及(1)e-游走、(2)部分匹配、(3)非连续路径和(4)不同子结构的重要性等各种子结构。其次,我们提出了游走加权子序列核,以便将非连续的句法结构以及语义角色和词汇特征参数化,从而使从少量训练数据中学习结构方面变得有效。此外,我们通过改变它们的权重来区分句法局部性、语义角色和词汇特征等参数的重要性。

结论

我们使用各种依赖核解决了基因相互作用问题,并根据连接两个实体的有向最短依赖路径提出了各种结构核方案。因此,我们使用游走加权子序列核在基因相互作用数据集上获得了有希望的结果。结果与自动解析的第三方蛋白质-蛋白质相互作用(PPI)数据以及完全句法标记的 PPI 数据进行了比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/cb2e5023cb26/1471-2105-11-107-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/702266fded4a/1471-2105-11-107-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/d09bf2a6f951/1471-2105-11-107-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/506b0f6bb78b/1471-2105-11-107-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/8bbdd8a93b60/1471-2105-11-107-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/4344b0301271/1471-2105-11-107-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/c5cb57d3095d/1471-2105-11-107-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/cb2e5023cb26/1471-2105-11-107-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/702266fded4a/1471-2105-11-107-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/d09bf2a6f951/1471-2105-11-107-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/506b0f6bb78b/1471-2105-11-107-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/8bbdd8a93b60/1471-2105-11-107-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/4344b0301271/1471-2105-11-107-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/c5cb57d3095d/1471-2105-11-107-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42ba/2844389/cb2e5023cb26/1471-2105-11-107-7.jpg

相似文献

1
Walk-weighted subsequence kernels for protein-protein interaction extraction.基于行走权重的蛋白质相互作用提取子序列核方法。
BMC Bioinformatics. 2010 Feb 25;11:107. doi: 10.1186/1471-2105-11-107.
2
Kernel approaches for genic interaction extraction.用于基因相互作用提取的核方法。
Bioinformatics. 2008 Jan 1;24(1):118-26. doi: 10.1093/bioinformatics/btm544. Epub 2007 Nov 14.
3
Distributed smoothed tree kernel for protein-protein interaction extraction from the biomedical literature.用于从生物医学文献中提取蛋白质-蛋白质相互作用的分布式平滑树核
PLoS One. 2017 Nov 3;12(11):e0187379. doi: 10.1371/journal.pone.0187379. eCollection 2017.
4
Hash subgraph pairwise kernel for protein-protein interaction extraction.基于哈希子图的成对核函数用于蛋白质-蛋白质相互作用提取。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1190-202. doi: 10.1109/TCBB.2012.50.
5
Tree kernel-based protein-protein interaction extraction from biomedical literature.基于树核的生物医学文献中蛋白质-蛋白质相互作用提取。
J Biomed Inform. 2012 Jun;45(3):535-43. doi: 10.1016/j.jbi.2012.02.004. Epub 2012 Feb 25.
6
Exploiting graph kernels for high performance biomedical relation extraction.利用图核进行高性能生物医学关系提取。
J Biomed Semantics. 2018 Jan 30;9(1):7. doi: 10.1186/s13326-017-0168-3.
7
Protein-protein interaction extraction by leveraging multiple kernels and parsers.利用多种内核和解析器进行蛋白质-蛋白质相互作用提取。
Int J Med Inform. 2009 Dec;78(12):e39-46. doi: 10.1016/j.ijmedinf.2009.04.010. Epub 2009 Jun 4.
8
Linguistic feature analysis for protein interaction extraction.语言特征分析在蛋白质相互作用提取中的应用。
BMC Bioinformatics. 2009 Nov 12;10:374. doi: 10.1186/1471-2105-10-374.
9
Leveraging syntactic and semantic graph kernels to extract pharmacokinetic drug drug interactions from biomedical literature.利用句法和语义图核从生物医学文献中提取药代动力学药物相互作用。
BMC Syst Biol. 2016 Aug 26;10 Suppl 3(Suppl 3):67. doi: 10.1186/s12918-016-0311-2.
10
Neighborhood hash graph kernel for protein-protein interaction extraction.基于邻近哈希图核的蛋白质-蛋白质相互作用提取方法。
J Biomed Inform. 2011 Dec;44(6):1086-92. doi: 10.1016/j.jbi.2011.08.011. Epub 2011 Aug 23.

引用本文的文献

1
Large language model based framework for automated extraction of genetic interactions from unstructured data.基于大型语言模型的框架,用于从非结构化数据中自动提取遗传相互作用。
PLoS One. 2024 May 21;19(5):e0303231. doi: 10.1371/journal.pone.0303231. eCollection 2024.
2
Extraction of causal relations based on SBEL and BERT model.基于 SBEL 和 BERT 模型的因果关系提取。
Database (Oxford). 2021 Feb 18;2021. doi: 10.1093/database/baab005.
3
A Graph Convolutional Network-Based Method for Chemical-Protein Interaction Extraction: Algorithm Development.

本文引用的文献

1
Protein-protein interaction extraction by leveraging multiple kernels and parsers.利用多种内核和解析器进行蛋白质-蛋白质相互作用提取。
Int J Med Inform. 2009 Dec;78(12):e39-46. doi: 10.1016/j.ijmedinf.2009.04.010. Epub 2009 Jun 4.
2
Evaluating contributions of natural language parsers to protein-protein interaction extraction.评估自然语言解析器对蛋白质-蛋白质相互作用提取的贡献。
Bioinformatics. 2009 Feb 1;25(3):394-400. doi: 10.1093/bioinformatics/btn631. Epub 2008 Dec 9.
3
All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning.
一种基于图卷积网络的化学-蛋白质相互作用提取方法:算法开发
JMIR Med Inform. 2020 May 19;8(5):e17643. doi: 10.2196/17643.
4
Chemical-protein interaction extraction via contextualized word representations and multihead attention.基于上下文化词表示和多头注意力机制的化学-蛋白质相互作用提取。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz054.
5
Exploring semi-supervised variational autoencoders for biomedical relation extraction.探索半监督变分自动编码器在生物医学关系抽取中的应用。
Methods. 2019 Aug 15;166:112-119. doi: 10.1016/j.ymeth.2019.02.021. Epub 2019 Feb 27.
6
Identification of conclusive association entities in biomedical articles.生物医学文章中确凿关联实体的识别。
J Biomed Semantics. 2019 Jan 7;10(1):1. doi: 10.1186/s13326-018-0194-9.
7
Semisupervised Learning Based Disease-Symptom and Symptom-Therapeutic Substance Relation Extraction from Biomedical Literature.基于半监督学习的生物医学文献中疾病-症状及症状-治疗物质关系提取
Biomed Res Int. 2016;2016:3594937. doi: 10.1155/2016/3594937. Epub 2016 Oct 16.
8
DTMiner: identification of potential disease targets through biomedical literature mining.DTMiner:通过生物医学文献挖掘识别潜在疾病靶点。
Bioinformatics. 2016 Dec 1;32(23):3619-3626. doi: 10.1093/bioinformatics/btw503. Epub 2016 Aug 9.
9
A Shortest Dependency Path Based Convolutional Neural Network for Protein-Protein Relation Extraction.基于最短依赖路径的卷积神经网络在蛋白质-蛋白质关系抽取中的应用。
Biomed Res Int. 2016;2016:8479587. doi: 10.1155/2016/8479587. Epub 2016 Jul 14.
10
A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set.一种使用混合特征集的生物医学文本关系提取框架。
Comput Math Methods Med. 2015;2015:910423. doi: 10.1155/2015/910423. Epub 2015 Aug 10.
用于蛋白质-蛋白质相互作用提取的全路径图核以及跨语料库学习评估
BMC Bioinformatics. 2008 Nov 19;9 Suppl 11(Suppl 11):S2. doi: 10.1186/1471-2105-9-S11-S2.
4
Comparative analysis of five protein-protein interaction corpora.五个蛋白质-蛋白质相互作用语料库的比较分析。
BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S6. doi: 10.1186/1471-2105-9-S3-S6.
5
Kernel approaches for genic interaction extraction.用于基因相互作用提取的核方法。
Bioinformatics. 2008 Jan 1;24(1):118-26. doi: 10.1093/bioinformatics/btm544. Epub 2007 Nov 14.
6
RelEx--relation extraction using dependency parse trees.RelEx——使用依存句法分析树进行关系抽取。
Bioinformatics. 2007 Feb 1;23(3):365-71. doi: 10.1093/bioinformatics/btl616. Epub 2006 Dec 1.
7
The Universal Protein Resource (UniProt).通用蛋白质资源(UniProt)。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D154-9. doi: 10.1093/nar/gki070.
8
Discovering patterns to extract protein-protein interactions from full texts.从全文中发现提取蛋白质-蛋白质相互作用的模式。
Bioinformatics. 2004 Dec 12;20(18):3604-12. doi: 10.1093/bioinformatics/bth451. Epub 2004 Jul 29.
9
The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.2003年的SWISS-PROT蛋白质知识库及其补充TrEMBL。
Nucleic Acids Res. 2003 Jan 1;31(1):365-70. doi: 10.1093/nar/gkg095.
10
BIND: the Biomolecular Interaction Network Database.BIND:生物分子相互作用网络数据库。
Nucleic Acids Res. 2003 Jan 1;31(1):248-50. doi: 10.1093/nar/gkg056.