• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用多种聚类方法探索微阵列数据的共识框架。

Consensus framework for exploring microarray data using multiple clustering methods.

作者信息

Laderas Ted, McWeeney Shannon

机构信息

Informatics Shared Resource, OHSU Cancer Institute, Portland, Oregon 97201, USA.

出版信息

OMICS. 2007 Spring;11(1):116-28. doi: 10.1089/omi.2006.0008.

DOI:10.1089/omi.2006.0008
PMID:17411399
Abstract

The large variety of clustering algorithms and their variants can be daunting to researchers wishing to explore patterns within their microarray datasets. Furthermore, each clustering method has distinct biases in finding patterns within the data, and clusterings may not be reproducible across different algorithms. A consensus approach utilizing multiple algorithms can show where the various methods agree and expose robust patterns within the data. In this paper, we present a software package - Consense, written for R/Bioconductor - that utilizes such an approach to explore microarray datasets. Consense produces clustering results for each of the clustering methods and produces a report of metrics comparing the individual clusterings. A feature of Consense is identification of genes that cluster consistently with an index gene across methods. Utilizing simulated microarray data, sensitivity of the metrics to the biases of the different clustering algorithms is explored. The framework is easily extensible, allowing this tool to be used by other functional genomic data types, as well as other high-throughput OMICS data types generated from metabolomic and proteomic experiments. It also provides a flexible environment to benchmark new clustering algorithms. Consense is currently available as an installable R/Bioconductor package (http://www.ohsucancer.com/isrdev/consense/).

摘要

对于希望在其微阵列数据集中探索模式的研究人员而言,种类繁多的聚类算法及其变体可能令人望而生畏。此外,每种聚类方法在数据中寻找模式时都有明显的偏差,并且不同算法之间的聚类结果可能无法重现。利用多种算法的共识方法可以显示各种方法的一致之处,并揭示数据中的稳健模式。在本文中,我们展示了一个为R/Bioconductor编写的软件包——Consense,它利用这种方法来探索微阵列数据集。Consense为每种聚类方法生成聚类结果,并生成一份比较各个聚类的指标报告。Consense的一个特点是识别出在各种方法中与索引基因一致聚类的基因。利用模拟微阵列数据,探索了这些指标对不同聚类算法偏差的敏感性。该框架易于扩展,允许此工具用于其他功能基因组数据类型,以及代谢组学和蛋白质组学实验产生的其他高通量组学数据类型。它还提供了一个灵活的环境来对新的聚类算法进行基准测试。Consense目前可作为一个可安装的R/Bioconductor包获取(http://www.ohsucancer.com/isrdev/consense/)。

相似文献

1
Consensus framework for exploring microarray data using multiple clustering methods.使用多种聚类方法探索微阵列数据的共识框架。
OMICS. 2007 Spring;11(1):116-28. doi: 10.1089/omi.2006.0008.
2
Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm.使用一致性算法对大型DNA微阵列数据集进行稳健的多尺度聚类
Bioinformatics. 2006 Jan 1;22(1):58-67. doi: 10.1093/bioinformatics/bti746. Epub 2005 Oct 27.
3
A mathematical and computational framework for quantitative comparison and integration of large-scale gene expression data.用于大规模基因表达数据定量比较与整合的数学和计算框架。
Nucleic Acids Res. 2005 May 10;33(8):2580-94. doi: 10.1093/nar/gki536. Print 2005.
4
Quadratic regression analysis for gene discovery and pattern recognition for non-cyclic short time-course microarray experiments.用于非循环短时间进程微阵列实验的基因发现和模式识别的二次回归分析。
BMC Bioinformatics. 2005 Apr 25;6:106. doi: 10.1186/1471-2105-6-106.
5
clusterExperiment and RSEC: A Bioconductor package and framework for clustering of single-cell and other large gene expression datasets.clusterExperiment 和 RSEC:一个用于单细胞和其他大型基因表达数据集聚类的 Bioconductor 包和框架。
PLoS Comput Biol. 2018 Sep 4;14(9):e1006378. doi: 10.1371/journal.pcbi.1006378. eCollection 2018 Sep.
6
Clustering microarray gene expression data using weighted Chinese restaurant process.使用加权中国餐馆过程对微阵列基因表达数据进行聚类
Bioinformatics. 2006 Aug 15;22(16):1988-97. doi: 10.1093/bioinformatics/btl284. Epub 2006 Jun 9.
7
AMDA: an R package for the automated microarray data analysis.AMDA:一个用于自动微阵列数据分析的R软件包。
BMC Bioinformatics. 2006 Jul 6;7:335. doi: 10.1186/1471-2105-7-335.
8
Inferential clustering approach for microarray experiments with replicated measurements.具有重复测量的微阵列实验的推断聚类方法。
IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):594-604. doi: 10.1109/TCBB.2008.106.
9
goCluster integrates statistical analysis and functional interpretation of microarray expression data.goCluster整合了微阵列表达数据的统计分析和功能解释。
Bioinformatics. 2005 Sep 1;21(17):3575-7. doi: 10.1093/bioinformatics/bti574. Epub 2005 Jul 14.
10
Mining coherent dense subgraphs across massive biological networks for functional discovery.在海量生物网络中挖掘连贯密集子图以进行功能发现。
Bioinformatics. 2005 Jun;21 Suppl 1:i213-21. doi: 10.1093/bioinformatics/bti1049.

引用本文的文献

1
A novel approach identifies the first transcriptome networks in bats: a new genetic model for vocal communication.一种新方法识别出蝙蝠中的首个转录组网络:用于声音交流的新遗传模型。
BMC Genomics. 2015 Oct 22;16:836. doi: 10.1186/s12864-015-2068-1.
2
Coral: an integrated suite of visualizations for comparing clusterings.珊瑚:用于比较聚类的集成可视化套件。
BMC Bioinformatics. 2012 Oct 29;13:276. doi: 10.1186/1471-2105-13-276.
3
A systematic comparison of genome-scale clustering algorithms.基于基因组规模的聚类算法的系统比较。
BMC Bioinformatics. 2012 Jun 25;13 Suppl 10(Suppl 10):S7. doi: 10.1186/1471-2105-13-S10-S7.
4
Comparative analysis of acute and chronic corticosteroid pharmacogenomic effects in rat liver: transcriptional dynamics and regulatory structures.大鼠肝中急性和慢性皮质甾类药物基因组药理学效应的比较分析:转录动力学和调控结构。
BMC Bioinformatics. 2010 Oct 14;11:515. doi: 10.1186/1471-2105-11-515.
5
Lung cancer gene expression database analysis incorporating prior knowledge with support vector machine-based classification method.肺癌基因表达数据库分析,结合了支持向量机分类方法和先验知识。
J Exp Clin Cancer Res. 2009 Jul 18;28(1):103. doi: 10.1186/1756-9966-28-103.