• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于基因产物相似性的基因本体模糊测度。

Fuzzy measures on the Gene Ontology for gene product similarity.

作者信息

Popescu Mihail, Keller James M, Mitchell Joyce A

机构信息

Health Management and Informatics Department, University of Missouri, Columbia, MO 65211, USA.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2006 Jul-Sep;3(3):263-74. doi: 10.1109/TCBB.2006.37.

DOI:10.1109/TCBB.2006.37
PMID:17048464
Abstract

One of the most important objects in bioinformatics is a gene product (protein or RNA). For many gene products, functional information is summarized in a set of Gene Ontology (GO) annotations. For these genes, it is reasonable to include similarity measures based on the terms found in the GO or other taxonomy. In this paper, we introduce several novel measures for computing the similarity of two gene products annotated with GO terms. The fuzzy measure similarity (FMS) has the advantage that it takes into consideration the context of both complete sets of annotation terms when computing the similarity between two gene products. When the two gene products are not annotated by common taxonomy terms, we propose a method that avoids a zero similarity result. To account for the variations in the annotation reliability, we propose a similarity measure based on the Choquet integral. These similarity measures provide extra tools for the biologist in search of functional information for gene products. The initial testing on a group of 194 sequences representing three proteins families shows a higher correlation of the FMS and Choquet similarities to the BLAST sequence similarities than the traditional similarity measures such as pairwise average or pairwise maximum.

摘要

生物信息学中最重要的对象之一是基因产物(蛋白质或RNA)。对于许多基因产物,功能信息总结在一组基因本体论(GO)注释中。对于这些基因,基于GO或其他分类法中发现的术语纳入相似性度量是合理的。在本文中,我们引入了几种用于计算两个带有GO术语注释的基因产物相似性的新度量。模糊度量相似性(FMS)的优点在于,在计算两个基因产物之间的相似性时,它考虑了注释术语全集的上下文。当两个基因产物没有被共同的分类法术语注释时,我们提出了一种避免相似性结果为零的方法。为了考虑注释可靠性的差异,我们提出了一种基于Choquet积分的相似性度量。这些相似性度量为寻求基因产物功能信息的生物学家提供了额外的工具。对代表三个蛋白质家族的194个序列进行的初步测试表明,与传统相似性度量(如成对平均值或成对最大值)相比,FMS和Choquet相似性与BLAST序列相似性具有更高的相关性。

相似文献

1
Fuzzy measures on the Gene Ontology for gene product similarity.用于基因产物相似性的基因本体模糊测度。
IEEE/ACM Trans Comput Biol Bioinform. 2006 Jul-Sep;3(3):263-74. doi: 10.1109/TCBB.2006.37.
2
A genetic similarity algorithm for searching the Gene Ontology terms and annotating anonymous protein sequences.一种用于搜索基因本体术语和注释匿名蛋白质序列的遗传相似性算法。
J Biomed Inform. 2008 Feb;41(1):65-81. doi: 10.1016/j.jbi.2007.05.010. Epub 2007 Jun 27.
3
A relation based measure of semantic similarity for Gene Ontology annotations.一种基于关系的基因本体注释语义相似度度量方法。
BMC Bioinformatics. 2008 Nov 4;9:468. doi: 10.1186/1471-2105-9-468.
4
A new similarity measure among protein sequences.一种蛋白质序列间新的相似性度量方法。
Proc IEEE Comput Soc Bioinform Conf. 2003;2:347-52.
5
Protein superfamily classification using fuzzy rule-based classifier.使用基于模糊规则的分类器进行蛋白质超家族分类。
IEEE Trans Nanobioscience. 2009 Mar;8(1):92-9. doi: 10.1109/TNB.2009.2016484. Epub 2009 Mar 21.
6
Multi-label learning with fuzzy hypergraph regularization for protein subcellular location prediction.基于模糊超图正则化的多标签学习用于蛋白质亚细胞定位预测
IEEE Trans Nanobioscience. 2014 Dec;13(4):438-47. doi: 10.1109/TNB.2014.2341111. Epub 2014 Jul 31.
7
SimShift: identifying structural similarities from NMR chemical shifts.SimShift:从核磁共振化学位移中识别结构相似性。
Bioinformatics. 2006 Feb 15;22(4):460-5. doi: 10.1093/bioinformatics/bti805. Epub 2005 Nov 29.
8
PairProSVM: protein subcellular localization based on local pairwise profile alignment and SVM.PairProSVM:基于局部两两轮廓比对和支持向量机的蛋白质亚细胞定位
IEEE/ACM Trans Comput Biol Bioinform. 2008 Jul-Sep;5(3):416-22. doi: 10.1109/TCBB.2007.70256.
9
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
10
The relationship between protein sequences and their gene ontology functions.蛋白质序列与其基因本体功能之间的关系。
BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S11. doi: 10.1186/1471-2105-7-S4-S11.

引用本文的文献

1
Unravelling the genetic variability of host resilience to endo- and ectoparasites in Nellore commercial herds.揭示内寄生虫和外寄生虫对 Nellore 商业牛群宿主抗性的遗传变异性。
Genet Sel Evol. 2023 Nov 21;55(1):81. doi: 10.1186/s12711-023-00844-9.
2
MicroRNA and mRNA analysis of angiotensin II-induced renal artery endothelial cell dysfunction.血管紧张素II诱导的肾动脉内皮细胞功能障碍的微小RNA和信使核糖核酸分析
Exp Ther Med. 2020 Jun;19(6):3723-3737. doi: 10.3892/etm.2020.8613. Epub 2020 Mar 19.
3
Extended Multitarget Pharmacology of Anticancer Drugs.
抗癌药物的扩展多靶点药理学。
J Chem Inf Model. 2019 Jun 24;59(6):3006-3017. doi: 10.1021/acs.jcim.9b00031. Epub 2019 May 3.
4
MGOGP: a gene module-based heuristic algorithm for cancer-related gene prioritization.MGOGP:基于基因模块的癌症相关基因优先级启发式算法。
BMC Bioinformatics. 2018 Jun 5;19(1):215. doi: 10.1186/s12859-018-2216-0.
5
Developing a similarity searching module for patient safety event reporting system using semantic similarity measures.开发一个使用语义相似度测量的患者安全事件报告系统的相似性搜索模块。
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):75. doi: 10.1186/s12911-017-0467-8.
6
A Novel Schema to Enhance Data Quality of Patient Safety Event Reports.一种提高患者安全事件报告数据质量的新颖模式。
AMIA Annu Symp Proc. 2017 Feb 10;2016:1840-1849. eCollection 2016.
7
The inferred cardiogenic gene regulatory network in the mammalian heart.哺乳动物心脏中推断出的心脏发生基因调控网络。
PLoS One. 2014 Jun 27;9(6):e100842. doi: 10.1371/journal.pone.0100842. eCollection 2014.
8
Discovering pathway cross-talks based on functional relations between pathways.基于通路间的功能关系发现通路串扰。
BMC Genomics. 2012;13 Suppl 7(Suppl 7):S25. doi: 10.1186/1471-2164-13-S7-S25. Epub 2012 Dec 13.
9
Novel search method for the discovery of functional relationships.新型功能关系发现的搜索方法。
Bioinformatics. 2012 Jan 15;28(2):269-76. doi: 10.1093/bioinformatics/btr631. Epub 2011 Dec 16.
10
Phosphoproteomics identifies oncogenic Ras signaling targets and their involvement in lung adenocarcinomas.磷酸化蛋白质组学鉴定致癌 Ras 信号靶标及其在肺腺癌中的作用。
PLoS One. 2011;6(5):e20199. doi: 10.1371/journal.pone.0020199. Epub 2011 May 26.