• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多因素基因-基因邻近度度量方法,利用从基因本体论中提取的生物学知识:在基因聚类中的应用。

Multi-Factored Gene-Gene Proximity Measures Exploiting Biological Knowledge Extracted from Gene Ontology: Application in Gene Clustering.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2020 Jan-Feb;17(1):207-219. doi: 10.1109/TCBB.2018.2849362. Epub 2018 Jun 21.

DOI:10.1109/TCBB.2018.2849362
PMID:29994130
Abstract

To describe the cellular functions of proteins and genes, a potential dynamic vocabulary is Gene Ontology (GO), which comprises of three sub-ontologies namely, Biological-process, Cellular-component, and Molecular-function. It has several applications in the field of bioinformatics like annotating/measuring gene-gene or protein-protein semantic similarity, identifying genes/proteins by their GO annotations for disease gene and target discovery, etc. To determine semantic similarity between genes, several semantic measures have been proposed in literature, which involve information content of GO-terms, GO tree structure, or the combination of both. But, most of the existing semantic similarity measures do not consider different topological and information theoretic aspects of GO-terms collectively. Inspired by this fact, in this article, we have first proposed three novel semantic similarity/distance measures for genes covering different aspects of GO-tree. These are further implanted in the frameworks of well-known multi-objective and single-objective based clustering algorithms to determine functionally similar genes. For comparative analysis, 10 popular existing GO based semantic similarity/distance measures and tools are also considered. Experimental results on Mouse genome, Yeast, and Human genome datasets evidently demonstrate the supremacy of multi-objective clustering algorithms in association with proposed multi-factored similarity/distance measures. Clustering outcomes are further validated by conducting some biological/statistical significance tests. Supplementary information is available at https://www.iitp.ac.in/sriparna/journals.html.

摘要

为了描述蛋白质和基因的细胞功能,潜在的动态词汇是基因本体论 (GO),它由三个子本体组成,即生物过程、细胞成分和分子功能。它在生物信息学领域有多种应用,例如注释/测量基因-基因或蛋白质-蛋白质语义相似性、根据 GO 注释识别疾病基因和靶标发现中的基因/蛋白质等。为了确定基因之间的语义相似性,文献中提出了几种语义度量方法,涉及 GO 术语的信息量、GO 树结构或两者的组合。但是,大多数现有的语义相似性度量方法并没有综合考虑 GO 术语的不同拓扑和信息理论方面。受此启发,本文首次提出了三种覆盖 GO 树不同方面的新型基因语义相似性/距离度量方法。这些方法进一步植入了著名的多目标和单目标聚类算法框架中,以确定功能相似的基因。为了进行比较分析,还考虑了 10 种流行的基于 GO 的现有语义相似性/距离度量方法和工具。在小鼠基因组、酵母和人类基因组数据集上的实验结果明显表明,多目标聚类算法与提出的多因素相似性/距离度量方法相结合具有优越性。通过进行一些生物学/统计意义测试来验证聚类结果。补充信息可在 https://www.iitp.ac.in/sriparna/journals.html 获得。

相似文献

1
Multi-Factored Gene-Gene Proximity Measures Exploiting Biological Knowledge Extracted from Gene Ontology: Application in Gene Clustering.多因素基因-基因邻近度度量方法,利用从基因本体论中提取的生物学知识:在基因聚类中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jan-Feb;17(1):207-219. doi: 10.1109/TCBB.2018.2849362. Epub 2018 Jun 21.
2
TopoICSim: a new semantic similarity measure based on gene ontology.TopoICSim:一种基于基因本体论的新语义相似性度量方法。
BMC Bioinformatics. 2016 Jul 29;17(1):296. doi: 10.1186/s12859-016-1160-0.
3
Novel symmetry-based gene-gene dissimilarity measures utilizing Gene Ontology: Application in gene clustering.基于新型对称的基因-基因相异度度量方法,并利用基因本体论:在基因聚类中的应用。
Gene. 2018 Dec 30;679:341-351. doi: 10.1016/j.gene.2018.08.062. Epub 2018 Sep 2.
4
GO functional similarity clustering depends on similarity measure, clustering method, and annotation completeness.GO 功能相似性聚类取决于相似性度量、聚类方法和注释完整性。
BMC Bioinformatics. 2019 Mar 27;20(1):155. doi: 10.1186/s12859-019-2752-2.
5
IntelliGO: a new vector-based semantic similarity measure including annotation origin.IntelliGO:一种新的基于向量的语义相似性度量方法,包含注释来源。
BMC Bioinformatics. 2010 Dec 1;11:588. doi: 10.1186/1471-2105-11-588.
6
Influence of the go-based semantic similarity measures in multi-objective gene clustering algorithm performance.基于 GO 的语义相似度度量对多目标基因聚类算法性能的影响。
J Bioinform Comput Biol. 2020 Dec;18(6):2050038. doi: 10.1142/S0219720020500389. Epub 2020 Nov 5.
7
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks.通过结合基因本体注释和基因共功能网络来测量语义相似性。
BMC Bioinformatics. 2015 Feb 14;16:44. doi: 10.1186/s12859-015-0474-7.
8
DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures.DAGO-Fun:一种基于基因本体论的功能分析工具,使用术语信息内容度量。
BMC Bioinformatics. 2013 Sep 25;14:284. doi: 10.1186/1471-2105-14-284.
9
A-DaGO-Fun: an adaptable Gene Ontology semantic similarity-based functional analysis tool.A-DaGO-Fun:一种基于基因本体语义相似性的适应性功能分析工具。
Bioinformatics. 2016 Feb 1;32(3):477-9. doi: 10.1093/bioinformatics/btv590. Epub 2015 Oct 17.
10
Assessment of Semantic Similarity between Proteins Using Information Content and Topological Properties of the Gene Ontology Graph.使用信息内容和基因本体论图的拓扑属性评估蛋白质之间的语义相似性。
IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):839-849. doi: 10.1109/TCBB.2017.2689762. Epub 2017 Mar 31.

引用本文的文献

1
MSC-CSMC: A multi-objective semi-supervised clustering algorithm based on constraints selection and multi-source constraints for gene expression data.MSC-CSMC:一种基于约束选择和多源约束的基因表达数据多目标半监督聚类算法。
Front Genet. 2023 Feb 27;14:1135260. doi: 10.3389/fgene.2023.1135260. eCollection 2023.
2
Multi-view feature selection for identifying gene markers: a diversified biological data driven approach.多视角特征选择用于鉴定基因标志物:一种多样化的生物数据驱动方法。
BMC Bioinformatics. 2020 Dec 30;21(Suppl 18):483. doi: 10.1186/s12859-020-03810-0.
3
A consensus multi-view multi-objective gene selection approach for improved sample classification.
一种共识多视角多目标基因选择方法,用于提高样本分类。
BMC Bioinformatics. 2020 Sep 17;21(Suppl 13):386. doi: 10.1186/s12859-020-03681-5.