聚类微阵列数据中的数据融合：平衡发现和可解释性。

Data-fusion in clustering microarray data: balancing discovery and interpretability.

机构信息

Department of Public Health Sciences, University of Toronto, Health Sciences Building, Toronto, Ontario, Canada.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jan-Mar;7(1):50-63. doi: 10.1109/TCBB.2007.70267.

DOI:10.1109/TCBB.2007.70267

PMID:20150668

Abstract

While clustering genes remains one of the most popular exploratory tools for expression data, it often results in a highly variable and biologically uninformative clusters. This paper explores a data fusion approach to clustering microarray data. Our method, which combined expression data and Gene Ontology (GO)-derived information, is applied on a real data set to perform genome-wide clustering. A set of novel tools is proposed to validate the clustering results and pick a fair value of infusion coefficient. These tools measure stability, biological relevance, and distance from the expression-only clustering solution. Our results indicate that a data-fusion clustering leads to more stable, biologically relevant clusters that are still representative of the experimental data.

摘要

虽然聚类基因仍然是表达数据最常用的探索工具之一，但它通常会产生高度可变且生物学上无信息的聚类。本文探讨了一种用于聚类微阵列数据的数据融合方法。我们的方法结合了表达数据和基因本体论（GO）衍生的信息，应用于真实数据集进行全基因组聚类。提出了一组新的工具来验证聚类结果并选择合理的融合系数值。这些工具用于衡量稳定性、生物学相关性以及与仅表达聚类解决方案的距离。我们的结果表明，数据融合聚类可以产生更稳定、更具生物学相关性的聚类，同时仍然代表实验数据。

相似文献

Data-fusion in clustering microarray data: balancing discovery and interpretability.聚类微阵列数据中的数据融合：平衡发现和可解释性。

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jan-Mar;7(1):50-63. doi: 10.1109/TCBB.2007.70267.

Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes.使用功能类别参考集评估基因表达数据聚类算法的方法。

BMC Bioinformatics. 2006 Aug 31;7:397. doi: 10.1186/1471-2105-7-397.

Comparing algorithms for clustering of expression data: how to assess gene clusters.比较用于表达数据聚类的算法：如何评估基因簇。

Methods Mol Biol. 2009;541:479-509. doi: 10.1007/978-1-59745-243-4_21.

An iterative data mining approach for mining overlapping coexpression patterns in noisy gene expression data.一种用于在嘈杂基因表达数据中挖掘重叠共表达模式的迭代数据挖掘方法。

IEEE Trans Nanobioscience. 2009 Sep;8(3):252-8. doi: 10.1109/TNB.2009.2026747. Epub 2009 Jul 14.

Clustering of gene expression data: performance and similarity analysis.基因表达数据的聚类：性能与相似性分析

BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S19. doi: 10.1186/1471-2105-7-S4-S19.

Associative clustering for exploring dependencies between functional genomics data sets.用于探索功能基因组学数据集之间依赖性的关联聚类

IEEE/ACM Trans Comput Biol Bioinform. 2005 Jul-Sep;2(3):203-16. doi: 10.1109/TCBB.2005.32.

Analysis of a Gibbs sampler method for model-based clustering of gene expression data.一种基于模型的基因表达数据聚类的吉布斯采样器方法分析。

Bioinformatics. 2008 Jan 15;24(2):176-83. doi: 10.1093/bioinformatics/btm562. Epub 2007 Nov 22.

Novel symmetry-based gene-gene dissimilarity measures utilizing Gene Ontology: Application in gene clustering.基于新型对称的基因-基因相异度度量方法，并利用基因本体论：在基因聚类中的应用。

Gene. 2018 Dec 30;679:341-351. doi: 10.1016/j.gene.2018.08.062. Epub 2018 Sep 2.

Clustering and re-clustering for pattern discovery in gene expression data.用于基因表达数据中模式发现的聚类和再聚类。

J Bioinform Comput Biol. 2005 Apr;3(2):281-301. doi: 10.1142/s0219720005001053.

Detecting clusters of different geometrical shapes in microarray gene expression data.在微阵列基因表达数据中检测不同几何形状的聚类。

Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.

引用本文的文献

The ABC recommendations for validation of supervised machine learning results in biomedical sciences.生物医学科学中监督式机器学习结果验证的ABC建议。

Front Big Data. 2022 Sep 27;5:979465. doi: 10.3389/fdata.2022.979465. eCollection 2022.

THD-Module Extractor: An Application for CEN Module Extraction and Interesting Gene Identification for Alzheimer's Disease.THD 模块提取器：一种用于 CEN 模块提取和阿尔茨海默病相关基因鉴定的应用。

Sci Rep. 2016 Nov 30;6:38046. doi: 10.1038/srep38046.

Identifying Significant Features in Cancer Methylation Data Using Gene Pathway Segmentation.利用基因通路分割识别癌症甲基化数据中的显著特征

Cancer Inform. 2016 Sep 20;15:189-98. doi: 10.4137/CIN.S39859. eCollection 2016.

A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data.应用于微阵列数据的特征选择与特征提取方法综述

Adv Bioinformatics. 2015;2015:198363. doi: 10.1155/2015/198363. Epub 2015 Jun 11.

Improving clustering with metabolic pathway data.利用代谢途径数据改进聚类。

BMC Bioinformatics. 2014 Apr 10;15:101. doi: 10.1186/1471-2105-15-101.

An algorithm for finding biologically significant features in microarray data based on a priori manifold learning.一种基于先验流形学习在微阵列数据中寻找生物学显著特征的算法。

PLoS One. 2014 Mar 3;9(3):e90562. doi: 10.1371/journal.pone.0090562. eCollection 2014.

Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering.见林又见树：利用基因本体论重组层次聚类

Bioinformatics. 2009 Jul 15;25(14):1789-95. doi: 10.1093/bioinformatics/btp327. Epub 2009 Jun 3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

聚类微阵列数据中的数据融合：平衡发现和可解释性。

Data-fusion in clustering microarray data: balancing discovery and interpretability.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献