Suppr超能文献

层次树剪枝:由先验知识引导的聚类

Hierarchical tree snipping: clustering guided by prior knowledge.

作者信息

Dotan-Cohen Dikla, Melkman Avraham A, Kasif Simon

机构信息

Department of Computer Science, Ben Gurion University, Beer Sheva 84105, Israel.

出版信息

Bioinformatics. 2007 Dec 15;23(24):3335-42. doi: 10.1093/bioinformatics/btm526. Epub 2007 Nov 7.

Abstract

MOTIVATION

Hierarchical clustering is widely used to cluster genes into groups based on their expression similarity. This method first constructs a tree. Next this tree is partitioned into subtrees by cutting all edges at some level, thereby inducing a clustering. Unfortunately, the resulting clusters often do not exhibit significant functional coherence.

RESULTS

To improve the biological significance of the clustering, we develop a new framework of partitioning by snipping--cutting selected edges at variable levels. The snipped edges are selected to induce clusters that are maximally consistent with partially available background knowledge such as functional classifications. Algorithms for two key applications are presented: functional prediction of genes, and discovery of functionally enriched clusters of co-expressed genes. Simulation results and cross-validation tests indicate that the algorithms perform well even when the actual number of clusters differs considerably from the requested number. Performance is improved compared with a previously proposed algorithm.

AVAILABILITY

A java package is available at http://www.cs.bgu.ac.il/~dotna/ TreeSnipping

摘要

动机

层次聚类法被广泛用于根据基因表达相似性将基因聚类成组。该方法首先构建一棵树。接下来,通过在某个层次切断所有边将这棵树划分为子树,从而产生一个聚类。不幸的是,所得的聚类往往不具有显著的功能一致性。

结果

为了提高聚类的生物学意义,我们开发了一种新的剪枝划分框架——在可变层次切断选定的边。选择被剪枝的边以诱导出与部分可用背景知识(如功能分类)最大程度一致的聚类。给出了两个关键应用的算法:基因的功能预测以及共表达基因功能富集聚类的发现。模拟结果和交叉验证测试表明,即使实际聚类数与要求的聚类数有很大差异,这些算法仍能表现良好。与先前提出的算法相比,性能有所提高。

可用性

可从http://www.cs.bgu.ac.il/~dotna/TreeSnipping获得一个Java包。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验