• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

新型功能关系发现的搜索方法。

Novel search method for the discovery of functional relationships.

机构信息

Max Planck Institute for Informatics, Campus E1.4, 66123 Saarbrücken, Germany.

出版信息

Bioinformatics. 2012 Jan 15;28(2):269-76. doi: 10.1093/bioinformatics/btr631. Epub 2011 Dec 16.

DOI:10.1093/bioinformatics/btr631
PMID:22180409
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3259435/
Abstract

MOTIVATION

Numerous annotations are available that functionally characterize genes and proteins with regard to molecular process, cellular localization, tissue expression, protein domain composition, protein interaction, disease association and other properties. Searching this steadily growing amount of information can lead to the discovery of new biological relationships between genes and proteins. To facilitate the searches, methods are required that measure the annotation similarity of genes and proteins. However, most current similarity methods are focused only on annotations from the Gene Ontology (GO) and do not take other annotation sources into account.

RESULTS

We introduce the new method BioSim that incorporates multiple sources of annotations to quantify the functional similarity of genes and proteins. We compared the performance of our method with four other well-known methods adapted to use multiple annotation sources. We evaluated the methods by searching for known functional relationships using annotations based only on GO or on our large data warehouse BioMyn. This warehouse integrates many diverse annotation sources of human genes and proteins. We observed that the search performance improved substantially for almost all methods when multiple annotation sources were included. In particular, our method outperformed the other methods in terms of recall and average precision.

摘要

动机

有许多注释可用于从分子过程、细胞定位、组织表达、蛋白质结构域组成、蛋白质相互作用、疾病关联和其他特性等方面对基因和蛋白质进行功能描述。搜索这些不断增长的信息可以发现基因和蛋白质之间新的生物学关系。为了便于搜索,需要能够衡量基因和蛋白质注释相似性的方法。然而,目前大多数相似性方法仅关注来自基因本体论(GO)的注释,而不考虑其他注释来源。

结果

我们引入了新的方法 BioSim,该方法结合了多种注释来源,以量化基因和蛋白质的功能相似性。我们将我们的方法与另外四个适用于使用多种注释来源的知名方法进行了比较。我们通过仅使用基于 GO 的注释或我们的大型数据仓库 BioMyn 进行注释来搜索已知的功能关系,以此来评估这些方法。该仓库整合了人类基因和蛋白质的许多不同注释来源。我们观察到,当包含多种注释来源时,几乎所有方法的搜索性能都有了显著提高。特别是,我们的方法在召回率和平均精度方面优于其他方法。

相似文献

1
Novel search method for the discovery of functional relationships.新型功能关系发现的搜索方法。
Bioinformatics. 2012 Jan 15;28(2):269-76. doi: 10.1093/bioinformatics/btr631. Epub 2011 Dec 16.
2
IntelliGO: a new vector-based semantic similarity measure including annotation origin.IntelliGO:一种新的基于向量的语义相似性度量方法,包含注释来源。
BMC Bioinformatics. 2010 Dec 1;11:588. doi: 10.1186/1471-2105-11-588.
3
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks.通过结合基因本体注释和基因共功能网络来测量语义相似性。
BMC Bioinformatics. 2015 Feb 14;16:44. doi: 10.1186/s12859-015-0474-7.
4
A relation based measure of semantic similarity for Gene Ontology annotations.一种基于关系的基因本体注释语义相似度度量方法。
BMC Bioinformatics. 2008 Nov 4;9:468. doi: 10.1186/1471-2105-9-468.
5
Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks.基因本体注释的自动提取及其与蛋白质网络中聚类的相关性。
BMC Bioinformatics. 2007 Jul 10;8:243. doi: 10.1186/1471-2105-8-243.
6
GO functional similarity clustering depends on similarity measure, clustering method, and annotation completeness.GO 功能相似性聚类取决于相似性度量、聚类方法和注释完整性。
BMC Bioinformatics. 2019 Mar 27;20(1):155. doi: 10.1186/s12859-019-2752-2.
7
TopoICSim: a new semantic similarity measure based on gene ontology.TopoICSim:一种基于基因本体论的新语义相似性度量方法。
BMC Bioinformatics. 2016 Jul 29;17(1):296. doi: 10.1186/s12859-016-1160-0.
8
Cross-organism learning method to discover new gene functionalities.跨生物学习方法发现新基因功能。
Comput Methods Programs Biomed. 2016 Apr;126:20-34. doi: 10.1016/j.cmpb.2015.12.002. Epub 2015 Dec 17.
9
Measuring gene functional similarity based on group-wise comparison of GO terms.基于 GO 术语的组间比较来衡量基因功能相似性。
Bioinformatics. 2013 Jun 1;29(11):1424-32. doi: 10.1093/bioinformatics/btt160. Epub 2013 Apr 9.
10
Computational algorithms to predict Gene Ontology annotations.预测基因本体注释的计算算法。
BMC Bioinformatics. 2015;16 Suppl 6(Suppl 6):S4. doi: 10.1186/1471-2105-16-S6-S4. Epub 2015 Apr 17.

引用本文的文献

1
Ranking Cancer Proteins by Integrating PPI Network and Protein Expression Profiles.基于 PPI 网络和蛋白表达谱整合对癌症相关蛋白进行排序
Biomed Res Int. 2019 Jan 6;2019:3907195. doi: 10.1155/2019/3907195. eCollection 2019.
2
Use of DAVID algorithms for clustering custom annotated gene lists in a non-model organism, rainbow trout.使用DAVID算法对非模式生物虹鳟鱼中自定义注释的基因列表进行聚类。
BMC Res Notes. 2018 Jan 23;11(1):63. doi: 10.1186/s13104-018-3154-7.
3
ProteINSIDE to Easily Investigate Proteomics Data from Ruminants: Application to Mine Proteome of Adipose and Muscle Tissues in Bovine Foetuses.

本文引用的文献

1
Interactome networks and human disease.相互作用组网络与人类疾病。
Cell. 2011 Mar 18;144(6):986-98. doi: 10.1016/j.cell.2011.02.016.
2
IntelliGO: a new vector-based semantic similarity measure including annotation origin.IntelliGO:一种新的基于向量的语义相似性度量方法,包含注释来源。
BMC Bioinformatics. 2010 Dec 1;11:588. doi: 10.1186/1471-2105-11-588.
3
The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics.小鼠基因组数据库(MGD):哺乳动物基因组学和遗传学的首要模式生物资源。
ProteINSIDE:轻松研究反刍动物蛋白质组学数据——在挖掘牛胎儿脂肪和肌肉组织蛋白质组中的应用
PLoS One. 2015 May 22;10(5):e0128086. doi: 10.1371/journal.pone.0128086. eCollection 2015.
4
NetworkPrioritizer: a versatile tool for network-based prioritization of candidate disease genes or other molecules.网络优先级排序工具:一个基于网络的候选疾病基因或其他分子优先级排序的多功能工具。
Bioinformatics. 2013 Jun 1;29(11):1471-3. doi: 10.1093/bioinformatics/btt164. Epub 2013 Apr 16.
5
Subtypes of primary colorectal tumors correlate with response to targeted treatment in colorectal cell lines.原发性结直肠肿瘤的亚型与结直肠细胞系中靶向治疗的反应相关。
BMC Med Genomics. 2012 Dec 31;5:66. doi: 10.1186/1755-8794-5-66.
Nucleic Acids Res. 2011 Jan;39(Database issue):D842-8. doi: 10.1093/nar/gkq1008. Epub 2010 Nov 3.
4
genenames.org: the HGNC resources in 2011.基因名称组织:2011年的HGNC资源。
Nucleic Acids Res. 2011 Jan;39(Database issue):D514-9. doi: 10.1093/nar/gkq892. Epub 2010 Oct 6.
5
Improving disease gene prioritization using the semantic similarity of Gene Ontology terms.利用基因本体论术语的语义相似性提高疾病基因优先级。
Bioinformatics. 2010 Sep 15;26(18):i561-7. doi: 10.1093/bioinformatics/btq384.
6
It's the machine that matters: Predicting gene function and phenotype from protein networks.关键在于机器:从蛋白质网络预测基因功能和表型。
J Proteomics. 2010 Oct 10;73(11):2277-89. doi: 10.1016/j.jprot.2010.07.005. Epub 2010 Jul 15.
7
The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function.GeneMANIA 预测服务器:用于基因优先级排序和预测基因功能的生物网络集成。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W214-20. doi: 10.1093/nar/gkq537.
8
Revealing and avoiding bias in semantic similarity scores for protein pairs.揭示和避免蛋白质对语义相似性评分中的偏差。
BMC Bioinformatics. 2010 May 28;11:290. doi: 10.1186/1471-2105-11-290.
9
Annotation error in public databases: misannotation of molecular function in enzyme superfamilies.公共数据库中的注释错误:酶超家族中分子功能的错误注释。
PLoS Comput Biol. 2009 Dec;5(12):e1000605. doi: 10.1371/journal.pcbi.1000605. Epub 2009 Dec 11.
10
Recent advances and method development for drug target identification.近年来药物靶点鉴定的技术进展和方法发展。
Trends Pharmacol Sci. 2010 Feb;31(2):82-8. doi: 10.1016/j.tips.2009.11.002. Epub 2009 Dec 7.