• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因本体论语义相似性的新见解。

A novel insight into Gene Ontology semantic similarity.

机构信息

School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, PR China.

出版信息

Genomics. 2013 Jun;101(6):368-75. doi: 10.1016/j.ygeno.2013.04.010. Epub 2013 Apr 26.

DOI:10.1016/j.ygeno.2013.04.010
PMID:23628645
Abstract

Existing methods for computing the semantic similarity between Gene Ontology (GO) terms are often based on external datasets and, therefore are not intrinsic to GO. Furthermore, they not only fail to handle identical annotations but also show a strong bias toward well-annotated proteins when being used for measuring similarity of proteins. Inspired by the concept of cellular differentiation and dedifferentiation in developmental biology, we propose a shortest semantic differentiation distance (SSDD) based on the concept of semantic totipotency to measure the semantic similarity of GO terms and further compare the functional similarity of proteins. Using human ratings and a benchmark dataset, SSDD was found to improve upon existing methods for computing the semantic similarity of GO terms. An in-depth analysis shows that SSDD is able to distinguish identical annotations and does not depend on annotation richness, thus producing more unbiased and reliable results. Online services can be accessed at the Gene Functional Similarity Analysis Tools website (GFSAT: http://nclab.hit.edu.cn/GFSAT).

摘要

现有的计算基因本体论(GO)术语之间语义相似度的方法通常基于外部数据集,因此不是 GO 固有的。此外,当用于测量蛋白质的相似性时,它们不仅无法处理相同的注释,而且对注释良好的蛋白质表现出很强的偏见。受发育生物学中细胞分化和去分化概念的启发,我们提出了一种基于语义全能性概念的最短语义分化距离(SSDD),用于测量 GO 术语的语义相似性,并进一步比较蛋白质的功能相似性。使用人类评分和基准数据集,发现 SSDD 优于现有的计算 GO 术语语义相似性的方法。深入分析表明,SSDD 能够区分相同的注释,并且不依赖于注释丰富度,从而产生更公正和可靠的结果。在线服务可以在基因功能相似性分析工具网站(GFSAT:http://nclab.hit.edu.cn/GFSAT)上访问。

相似文献

1
A novel insight into Gene Ontology semantic similarity.基因本体论语义相似性的新见解。
Genomics. 2013 Jun;101(6):368-75. doi: 10.1016/j.ygeno.2013.04.010. Epub 2013 Apr 26.
2
A relation based measure of semantic similarity for Gene Ontology annotations.一种基于关系的基因本体注释语义相似度度量方法。
BMC Bioinformatics. 2008 Nov 4;9:468. doi: 10.1186/1471-2105-9-468.
3
A new method to measure the semantic similarity of GO terms.一种测量基因本体术语语义相似性的新方法。
Bioinformatics. 2007 May 15;23(10):1274-81. doi: 10.1093/bioinformatics/btm087. Epub 2007 Mar 7.
4
Finding disease similarity based on implicit semantic similarity.基于隐语义相似性的疾病相似性发现。
J Biomed Inform. 2012 Apr;45(2):363-71. doi: 10.1016/j.jbi.2011.11.017. Epub 2011 Dec 7.
5
Measure the Semantic Similarity of GO Terms Using Aggregate Information Content.使用聚合信息内容测量基因本体术语的语义相似性。
IEEE/ACM Trans Comput Biol Bioinform. 2014 May-Jun;11(3):468-76. doi: 10.1109/TCBB.2013.176.
6
TopoICSim: a new semantic similarity measure based on gene ontology.TopoICSim:一种基于基因本体论的新语义相似性度量方法。
BMC Bioinformatics. 2016 Jul 29;17(1):296. doi: 10.1186/s12859-016-1160-0.
7
Measuring gene functional similarity based on group-wise comparison of GO terms.基于 GO 术语的组间比较来衡量基因功能相似性。
Bioinformatics. 2013 Jun 1;29(11):1424-32. doi: 10.1093/bioinformatics/btt160. Epub 2013 Apr 9.
8
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks.通过结合基因本体注释和基因共功能网络来测量语义相似性。
BMC Bioinformatics. 2015 Feb 14;16:44. doi: 10.1186/s12859-015-0474-7.
9
IntelliGO: a new vector-based semantic similarity measure including annotation origin.IntelliGO:一种新的基于向量的语义相似性度量方法,包含注释来源。
BMC Bioinformatics. 2010 Dec 1;11:588. doi: 10.1186/1471-2105-11-588.
10
GOSemSim: an R package for measuring semantic similarity among GO terms and gene products.GO 语义相似度分析:用于测量 GO 术语和基因产物之间语义相似性的 R 包。
Bioinformatics. 2010 Apr 1;26(7):976-8. doi: 10.1093/bioinformatics/btq064. Epub 2010 Feb 23.

引用本文的文献

1
An Effective DNA Methylation Biomarker Screening Mechanism for Amyotrophic Lateral Sclerosis (ALS) Based on Comorbidities and Gene Function Analysis.一种基于合并症和基因功能分析的肌萎缩侧索硬化症(ALS)有效DNA甲基化生物标志物筛选机制
Bioengineering (Basel). 2024 Oct 12;11(10):1020. doi: 10.3390/bioengineering11101020.
2
Unveiling inter-embryo variability in spindle length over time: Towards quantitative phenotype analysis.揭示胚胎间纺锤体长度随时间的变化:迈向定量表型分析。
PLoS Comput Biol. 2024 Sep 5;20(9):e1012330. doi: 10.1371/journal.pcbi.1012330. eCollection 2024 Sep.
3
Computational identification of protein complexes from network interactions: Present state, challenges, and the way forward.
从网络相互作用中进行蛋白质复合物的计算识别:现状、挑战及未来方向。
Comput Struct Biotechnol J. 2022 May 27;20:2699-2712. doi: 10.1016/j.csbj.2022.05.049. eCollection 2022.
4
A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology.一种新的基因功能相似性计算模型,利用基因本体论中术语和关系的特异性。
BMC Bioinformatics. 2022 Jan 20;23(Suppl 1):47. doi: 10.1186/s12859-022-04557-6.
5
BiGAN: LncRNA-disease association prediction based on bidirectional generative adversarial network.BiGAN:基于双向生成对抗网络的 lncRNA 疾病关联预测。
BMC Bioinformatics. 2021 Jun 30;22(1):357. doi: 10.1186/s12859-021-04273-7.
6
GSAn: an alternative to enrichment analysis for annotating gene sets.GSAn:一种用于注释基因集的富集分析替代方法。
NAR Genom Bioinform. 2020 Mar 14;2(2):lqaa017. doi: 10.1093/nargab/lqaa017. eCollection 2020 Jun.
7
A Collection of Benchmark Data Sets for Knowledge Graph-based Similarity in the Biomedical Domain.生物医学领域基于知识图的相似度的基准数据集集合。
Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa078.
8
A Literature Review of Gene Function Prediction by Modeling Gene Ontology.基于基因本体建模的基因功能预测文献综述
Front Genet. 2020 Apr 24;11:400. doi: 10.3389/fgene.2020.00400. eCollection 2020.
9
Transcriptome analysis reveals an important candidate gene involved in both nodal metastasis and prognosis in lung adenocarcinoma.转录组分析揭示了一个在肺腺癌淋巴结转移和预后中均起作用的重要候选基因。
Cell Biosci. 2019 Nov 19;9:92. doi: 10.1186/s13578-019-0356-1. eCollection 2019.
10
Dual Convolutional Neural Networks With Attention Mechanisms Based Method for Predicting Disease-Related lncRNA Genes.基于注意力机制的双卷积神经网络预测疾病相关长链非编码RNA基因的方法
Front Genet. 2019 May 3;10:416. doi: 10.3389/fgene.2019.00416. eCollection 2019.