• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用语义距离和模糊聚类方法对基因进行功能分类:基于参考集的评估与重叠分析

Functional classification of genes using semantic distance and fuzzy clustering approach: evaluation with reference sets and overlap analysis.

作者信息

Devignes Marie-Dominique, Benabderrahmane Sidahmed, Smaïl-Tabbone Malika, Napoli Amedeo, Poch Olivier

机构信息

Lorraine University, Equipe Orpailleur, Campus Scientifique, Vandoeuvre les Nancy cedex, France.

出版信息

Int J Comput Biol Drug Des. 2012;5(3-4):245-60. doi: 10.1504/IJCBDD.2012.049207. Epub 2012 Sep 24.

DOI:10.1504/IJCBDD.2012.049207
PMID:23013652
Abstract

Functional classification aims at grouping genes according to their molecular function or the biological process they participate in. Evaluating the validity of such unsupervised gene classification remains a challenge given the variety of distance measures and classification algorithms that can be used. We evaluate here functional classification of genes with the help of reference sets: KEGG (Kyoto Encyclopaedia of Genes and Genomes) pathways and Pfam clans. These sets represent ground truth for any distance based on GO (Gene Ontology) biological process and molecular function annotations respectively. Overlaps between clusters and reference sets are estimated by the F-score method. We test our previously described IntelliGO semantic distance with hierarchical and fuzzy C-means clustering and we compare results with the state-of-the-art DAVID (Database for Annotation Visualisation and Integrated Discovery) functional classification method. Finally, study of best matching clusters to reference sets leads us to propose a set-difference method for discovering missing information.

摘要

功能分类旨在根据基因的分子功能或它们参与的生物学过程对基因进行分组。鉴于可以使用的多种距离度量和分类算法,评估这种无监督基因分类的有效性仍然是一项挑战。我们在此借助参考集评估基因的功能分类:KEGG(京都基因与基因组百科全书)通路和Pfam家族。这些集合分别代表基于GO(基因本体论)生物学过程和分子功能注释的任何距离的基本事实。通过F分数方法估计聚类与参考集之间的重叠。我们使用层次聚类和模糊C均值聚类测试我们之前描述的IntelliGO语义距离,并将结果与最先进的DAVID(注释可视化与综合发现数据库)功能分类方法进行比较。最后,对与参考集最佳匹配聚类的研究使我们提出一种用于发现缺失信息的集差方法。

相似文献

1
Functional classification of genes using semantic distance and fuzzy clustering approach: evaluation with reference sets and overlap analysis.使用语义距离和模糊聚类方法对基因进行功能分类:基于参考集的评估与重叠分析
Int J Comput Biol Drug Des. 2012;5(3-4):245-60. doi: 10.1504/IJCBDD.2012.049207. Epub 2012 Sep 24.
2
IntelliGO: a new vector-based semantic similarity measure including annotation origin.IntelliGO:一种新的基于向量的语义相似性度量方法,包含注释来源。
BMC Bioinformatics. 2010 Dec 1;11:588. doi: 10.1186/1471-2105-11-588.
3
Detecting clusters of different geometrical shapes in microarray gene expression data.在微阵列基因表达数据中检测不同几何形状的聚类。
Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.
4
Fuzzy-rough supervised attribute clustering algorithm and classification of microarray data.模糊粗糙监督属性聚类算法与微阵列数据分类
IEEE Trans Syst Man Cybern B Cybern. 2011 Feb;41(1):222-33. doi: 10.1109/TSMCB.2010.2050684. Epub 2010 Jun 10.
5
Modified fuzzy gap statistic for estimating preferable number of clusters in fuzzy k-means clustering.用于估计模糊k均值聚类中最优聚类数的改进模糊间隙统计量
J Biosci Bioeng. 2008 Mar;105(3):273-81. doi: 10.1263/jbb.105.273.
6
Fuzzy ensemble clustering based on random projections for DNA microarray data analysis.基于随机投影的模糊集成聚类用于DNA微阵列数据分析
Artif Intell Med. 2009 Feb-Mar;45(2-3):173-83. doi: 10.1016/j.artmed.2008.07.014. Epub 2008 Sep 17.
7
Analysis of a Gibbs sampler method for model-based clustering of gene expression data.一种基于模型的基因表达数据聚类的吉布斯采样器方法分析。
Bioinformatics. 2008 Jan 15;24(2):176-83. doi: 10.1093/bioinformatics/btm562. Epub 2007 Nov 22.
8
GO functional similarity clustering depends on similarity measure, clustering method, and annotation completeness.GO 功能相似性聚类取决于相似性度量、聚类方法和注释完整性。
BMC Bioinformatics. 2019 Mar 27;20(1):155. doi: 10.1186/s12859-019-2752-2.
9
Comparing algorithms for clustering of expression data: how to assess gene clusters.比较用于表达数据聚类的算法:如何评估基因簇。
Methods Mol Biol. 2009;541:479-509. doi: 10.1007/978-1-59745-243-4_21.
10
Rough-fuzzy clustering for grouping functionally similar genes from microarray data.基于粗糙模糊聚类的基因功能相似性分组方法研究
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):286-99. doi: 10.1109/TCBB.2012.103.

引用本文的文献

1
Sulfatase 2 Is Associated with Steroid Resistance in Childhood Nephrotic Syndrome.硫酸酯酶2与儿童肾病综合征的类固醇抵抗有关。
J Clin Med. 2021 Feb 2;10(3):523. doi: 10.3390/jcm10030523.
2
Discovering associations between adverse drug events using pattern structures and ontologies.利用模式结构和本体论发现药物不良事件之间的关联。
J Biomed Semantics. 2017 Aug 22;8(1):29. doi: 10.1186/s13326-017-0137-x.