• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于生物数据分析的双聚类算法:一项综述。

Biclustering algorithms for biological data analysis: a survey.

作者信息

Madeira Sara C, Oliveira Arlindo L

机构信息

University of Beira Interior, Rua Marquês D'Avila e Bolama, Covilhã, Portugal.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2004 Jan-Mar;1(1):24-45. doi: 10.1109/TCBB.2004.2.

DOI:10.1109/TCBB.2004.2
PMID:17048406
Abstract

A large number of clustering approaches have been proposed for the analysis of gene expression data obtained from microarray experiments. However, the results from the application of standard clustering methods to genes are limited. This limitation is imposed by the existence of a number of experimental conditions where the activity of genes is uncorrelated. A similar limitation exists when clustering of conditions is performed. For this reason, a number of algorithms that perform simultaneous clustering on the row and column dimensions of the data matrix has been proposed. The goal is to find submatrices, that is, subgroups of genes and subgroups of conditions, where the genes exhibit highly correlated activities for every condition. In this paper, we refer to this class of algorithms as biclustering. Biclustering is also referred in the literature as coclustering and direct clustering, among others names, and has also been used in fields such as information retrieval and data mining. In this comprehensive survey, we analyze a large number of existing approaches to biclustering, and classify them in accordance with the type of biclusters they can find, the patterns of biclusters that are discovered, the methods used to perform the search, the approaches used to evaluate the solution, and the target applications.

摘要

已经提出了大量聚类方法用于分析从微阵列实验获得的基因表达数据。然而,将标准聚类方法应用于基因的结果是有限的。这种局限性是由许多基因活性不相关的实验条件的存在所导致的。在进行条件聚类时也存在类似的局限性。因此,已经提出了许多对数据矩阵的行和列维度同时进行聚类的算法。目标是找到子矩阵,即基因子组和条件子组,其中基因在每个条件下都表现出高度相关的活性。在本文中,我们将这类算法称为双聚类。双聚类在文献中也被称为协同聚类和直接聚类等,并且也已用于信息检索和数据挖掘等领域。在这项全面的综述中,我们分析了大量现有的双聚类方法,并根据它们能够找到的双聚类类型、发现的双聚类模式、执行搜索所使用的方法、评估解决方案所使用的方法以及目标应用对它们进行分类。

相似文献

1
Biclustering algorithms for biological data analysis: a survey.用于生物数据分析的双聚类算法:一项综述。
IEEE/ACM Trans Comput Biol Bioinform. 2004 Jan-Mar;1(1):24-45. doi: 10.1109/TCBB.2004.2.
2
Parallelized evolutionary learning for detection of biclusters in gene expression data.并行进化学习在基因表达数据中的双聚类检测。
IEEE/ACM Trans Comput Biol Bioinform. 2012;9(2):560-70. doi: 10.1109/TCBB.2011.53. Epub 2011 Mar 3.
3
Discovering biclusters in gene expression data based on high-dimensional linear geometries.基于高维线性几何在基因表达数据中发现双簇。
BMC Bioinformatics. 2008 Apr 23;9:209. doi: 10.1186/1471-2105-9-209.
4
Finding multiple coherent biclusters in microarray data using variable string length multiobjective genetic algorithm.使用可变字符串长度多目标遗传算法在微阵列数据中寻找多个相干双聚类
IEEE Trans Inf Technol Biomed. 2009 Nov;13(6):969-75. doi: 10.1109/TITB.2009.2017527. Epub 2009 Mar 16.
5
Biclustering via optimal re-ordering of data matrices in systems biology: rigorous methods and comparative studies.系统生物学中通过数据矩阵的最优重排进行双聚类分析:严格方法与比较研究。
BMC Bioinformatics. 2008 Oct 27;9:458. doi: 10.1186/1471-2105-9-458.
6
A new geometric biclustering algorithm based on the Hough transform for analysis of large-scale microarray data.一种基于霍夫变换的新型几何双聚类算法,用于大规模微阵列数据分析。
J Theor Biol. 2008 Mar 21;251(2):264-74. doi: 10.1016/j.jtbi.2007.11.030. Epub 2007 Dec 4.
7
VisBicluster: A Matrix-Based Bicluster Visualization of Expression Data.VisBicluster:一种基于矩阵的表达数据关联聚类可视化方法。
J Comput Biol. 2020 Sep;27(9):1384-1396. doi: 10.1089/cmb.2019.0385. Epub 2020 Feb 7.
8
Possibilistic approach for biclustering microarray data.用于双聚类微阵列数据的可能性方法。
Comput Biol Med. 2007 Oct;37(10):1426-36. doi: 10.1016/j.compbiomed.2007.01.005. Epub 2007 Mar 8.
9
KMeans greedy search hybrid algorithm for biclustering gene expression data.用于基因表达数据的分聚类的 KMeans 贪婪搜索混合算法。
Adv Exp Med Biol. 2010;680:181-8. doi: 10.1007/978-1-4419-5913-3_21.
10
Pattern-driven neighborhood search for biclustering of microarray data.基于模式驱动的基因表达数据子矩阵聚类邻域搜索算法。
BMC Bioinformatics. 2012 May 8;13 Suppl 7(Suppl 7):S11. doi: 10.1186/1471-2105-13-S7-S11.

引用本文的文献

1
Robust hierarchical co-clustering for exploring toxicogenomic biomarkers and their chemical regulators.用于探索毒理基因组生物标志物及其化学调节剂的稳健分层共聚类
Sci Rep. 2025 May 14;15(1):16676. doi: 10.1038/s41598-025-99568-7.
2
Optimal variable clustering for high-dimensional matrix valued data.高维矩阵值数据的最优变量聚类
Inf inference. 2025 Mar 12;14(1):iaaf001. doi: 10.1093/imaiai/iaaf001. eCollection 2025 Mar.
3
Robust convex biclustering with a tuning-free method.一种无需调优方法的稳健凸双聚类
J Appl Stat. 2024 Jun 17;52(2):271-286. doi: 10.1080/02664763.2024.2367143. eCollection 2025.
4
funBIalign: a hierachical algorithm for functional motif discovery based on mean squared residue scores.funBIalign:一种基于均方残基分数的用于功能基序发现的分层算法。
Stat Comput. 2025;35(1):11. doi: 10.1007/s11222-024-10537-y. Epub 2024 Dec 10.
5
Detecting Boolean Asymmetric Relationships with a Loop Counting Technique and its Implications for Analyzing Heterogeneity within Gene Expression Datasets.使用循环计数技术检测布尔不对称关系及其对分析基因表达数据集中异质性的意义。
IEEE/ACM Trans Comput Biol Bioinform. 2024 Oct 29;PP. doi: 10.1109/TCBB.2024.3487434.
6
Biclustering Multivariate Longitudinal Data with Application to Recovery Trajectories of White Matter After Sport-Related Concussion.双聚类多元纵向数据及其在运动相关脑震荡后白质恢复轨迹中的应用
Data Sci Sci. 2024;3(1). doi: 10.1080/26941899.2024.2376535. Epub 2024 Jul 16.
7
Evolutionary Mechanism Based Conserved Gene Expression Biclustering Module Analysis for Breast Cancer Genomics.基于进化机制的乳腺癌基因组保守基因表达双聚类模块分析
Biomedicines. 2024 Sep 12;12(9):2086. doi: 10.3390/biomedicines12092086.
8
A parameter free relative density based biclustering method for identifying non-linear feature relations.一种基于无参数相对密度的双聚类方法,用于识别非线性特征关系。
Heliyon. 2024 Jul 20;10(15):e34736. doi: 10.1016/j.heliyon.2024.e34736. eCollection 2024 Aug 15.
9
Biclustering data analysis: a comprehensive survey.双聚类数据分析:全面综述。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae342.
10
Quantitatively Visualizing Bipartite Datasets.定量可视化二分数据集。
Phys Rev X. 2023 Apr-Jun;13(2). doi: 10.1103/physrevx.13.021002. Epub 2023 Apr 4.