• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因本体友好型表达谱双聚类分析

Gene Ontology friendly biclustering of expression profiles.

作者信息

Liu Jinze, Wang Wei, Yang Jiong

机构信息

Department of Computer Science, University of North Carolina, Chapel Hill, 27599, USA.

出版信息

Proc IEEE Comput Syst Bioinform Conf. 2004:436-47.

PMID:16448036
Abstract

The soundness of clustering in the analysis of gene expression profiles and gene function prediction is based on the hypothesis that genes with similar expression profiles may imply strong correlations with their functions in the biological activities. Gene Ontology (GO) has become a well accepted standard in organizing gene function categories. Different gene function categories in GO can have very sophisticated relationships, such as 'part of' and 'overlapping'. Until now, no clustering algorithm can generate gene clusters within which the relationships can naturally reflect those of gene function categories in the GO hierarchy. The failure in resembling the relationships may reduce the confidence of clustering in gene function prediction. In this paper, we present a new clustering technique, Smart Hierarchical Tendency Preserving clustering (SHTP-clustering), based on a bicluster model, Tendency Preserving cluster (TP-Cluster). By directly incorporating Gene Ontology information into the clustering process, the SHTP-clustering algorithm yields a TP-cluster tree within which any subtree can be well mapped to a part of the GO hierarchy. Our experiments on yeast cell cycle data demonstrate that this method is efficient and effective in generating the biological relevant TP-Clusters.

摘要

在基因表达谱分析和基因功能预测中,聚类的合理性基于这样一个假设:具有相似表达谱的基因可能在生物活动中与其功能存在强相关性。基因本体论(GO)已成为组织基因功能类别的公认标准。GO中的不同基因功能类别可能具有非常复杂的关系,例如“部分属于”和“重叠”。到目前为止,没有聚类算法能够生成基因簇,其中基因簇内的关系能够自然地反映GO层次结构中基因功能类别的关系。无法反映这些关系可能会降低基因功能预测中聚类的可信度。在本文中,我们提出了一种基于双聚类模型——趋势保留聚类(TP-Cluster)的新聚类技术,即智能层次趋势保留聚类(SHTP-聚类)。通过将基因本体论信息直接纳入聚类过程,SHTP-聚类算法生成了一个TP-聚类树,其中任何子树都可以很好地映射到GO层次结构的一部分。我们对酵母细胞周期数据的实验表明,该方法在生成与生物学相关的TP-聚类方面是高效且有效的。

相似文献

1
Gene Ontology friendly biclustering of expression profiles.基因本体友好型表达谱双聚类分析
Proc IEEE Comput Syst Bioinform Conf. 2004:436-47.
2
GOurmet: a tool for quantitative comparison and visualization of gene expression profiles based on gene ontology (GO) distributions.GOurmet:一种基于基因本体(GO)分布进行基因表达谱定量比较和可视化的工具。
BMC Bioinformatics. 2006 Mar 17;7:151. doi: 10.1186/1471-2105-7-151.
3
Text mining biomedical literature for discovering gene-to-gene relationships: a comparative study of algorithms.挖掘生物医学文献以发现基因与基因之间的关系:算法的比较研究
IEEE/ACM Trans Comput Biol Bioinform. 2005 Jan-Mar;2(1):62-76. doi: 10.1109/TCBB.2005.14.
4
Comparative analysis of gene sets in the Gene Ontology space under the multiple hypothesis testing framework.在多重假设检验框架下对基因本体空间中基因集的比较分析。
Proc IEEE Comput Syst Bioinform Conf. 2004:425-35. doi: 10.1109/csb.2004.1332455.
5
BicAT: a biclustering analysis toolbox.BicAT:一个双聚类分析工具箱。
Bioinformatics. 2006 May 15;22(10):1282-3. doi: 10.1093/bioinformatics/btl099. Epub 2006 Mar 21.
6
Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes.使用功能类别参考集评估基因表达数据聚类算法的方法。
BMC Bioinformatics. 2006 Aug 31;7:397. doi: 10.1186/1471-2105-7-397.
7
GOChase: correcting errors from Gene Ontology-based annotations for gene products.GOChase:纠正基于基因本体论的基因产物注释中的错误。
Bioinformatics. 2005 Mar;21(6):829-31. doi: 10.1093/bioinformatics/bti106. Epub 2004 Oct 28.
8
Co-clustering and visualization of gene expression data and gene ontology terms for Saccharomyces cerevisiae using self-organizing maps.使用自组织映射对酿酒酵母的基因表达数据和基因本体术语进行共聚类和可视化。
J Biomed Inform. 2007 Apr;40(2):160-73. doi: 10.1016/j.jbi.2006.05.001. Epub 2006 May 20.
9
Attribute clustering for grouping, selection, and classification of gene expression data.用于基因表达数据分组、选择和分类的属性聚类
IEEE/ACM Trans Comput Biol Bioinform. 2005 Apr-Jun;2(2):83-101. doi: 10.1109/TCBB.2005.17.
10
Ontologizing gene-expression microarray data: characterizing clusters with Gene Ontology.基因表达微阵列数据的本体化:用基因本体论表征聚类
Bioinformatics. 2004 Apr 12;20(6):979-81. doi: 10.1093/bioinformatics/bth040. Epub 2004 Feb 5.

引用本文的文献

1
Biclustering methods: biological relevance and application in gene expression analysis.双聚类方法:生物学相关性及其在基因表达分析中的应用
PLoS One. 2014 Mar 20;9(3):e90801. doi: 10.1371/journal.pone.0090801. eCollection 2014.
2
A polynomial time biclustering algorithm for finding approximate expression patterns in gene expression time series.一种用于在基因表达时间序列中寻找近似表达模式的多项式时间双聚类算法。
Algorithms Mol Biol. 2009 Jun 4;4:8. doi: 10.1186/1748-7188-4-8.
3
Biomedical ontologies in action: role in knowledge management, data integration and decision support.
生物医学本体的应用:在知识管理、数据集成和决策支持中的作用。
Yearb Med Inform. 2008:67-79.