Suppr超能文献

COBRAC:一种具有压缩功能的凸双聚类快速实现方法。

COBRAC: a fast implementation of convex biclustering with compression.

作者信息

Yi Haidong, Huang Le, Mishne Gal, Chi Eric C

机构信息

Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.

Department of Genetics, Curriculum in Bioinformatics & Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.

出版信息

Bioinformatics. 2021 Oct 25;37(20):3667-3669. doi: 10.1093/bioinformatics/btab248.

Abstract

SUMMARY

Biclustering is a generalization of clustering used to identify simultaneous grouping patterns in observations (rows) and features (columns) of a data matrix. Recently, the biclustering task has been formulated as a convex optimization problem. While this convex recasting of the problem has attractive properties, existing algorithms do not scale well. To address this problem and make convex biclustering a practical tool for analyzing larger data, we propose an implementation of fast convex biclustering called COBRAC to reduce the computing time by iteratively compressing problem size along with the solution path. We apply COBRAC to several gene expression datasets to demonstrate its effectiveness and efficiency. Besides the standalone version for COBRAC, we also developed a related online web server for online calculation and visualization of the downloadable interactive results.

AVAILABILITY AND IMPLEMENTATION

The source code and test data are available at https://github.com/haidyi/cvxbiclustr or https://zenodo.org/record/4620218. The web server is available at https://cvxbiclustr.ericchi.com.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

双聚类是聚类的一种推广,用于识别数据矩阵的观测值(行)和特征(列)中的同时分组模式。最近,双聚类任务已被表述为一个凸优化问题。虽然该问题的这种凸形式化具有吸引人的特性,但现有算法扩展性不佳。为解决此问题并使凸双聚类成为分析更大数据的实用工具,我们提出一种名为COBRAC的快速凸双聚类实现方法,通过沿求解路径迭代压缩问题规模来减少计算时间。我们将COBRAC应用于多个基因表达数据集,以证明其有效性和效率。除了COBRAC的独立版本,我们还开发了一个相关的在线网络服务器,用于对可下载的交互式结果进行在线计算和可视化。

可用性与实现

源代码和测试数据可在https://github.com/haidyi/cvxbiclustr或https://zenodo.org/record/4620218获取。网络服务器可在https://cvxbiclustr.ericchi.com访问。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

2
BiCoN: network-constrained biclustering of patients and omics data.BiCoN:患者与组学数据的网络约束双聚类
Bioinformatics. 2021 Aug 25;37(16):2398-2404. doi: 10.1093/bioinformatics/btaa1076.
6
Convex biclustering.凸双聚类
Biometrics. 2017 Mar;73(1):10-19. doi: 10.1111/biom.12540. Epub 2016 May 10.

本文引用的文献

1
Clustering with t-SNE, provably.使用t-SNE进行聚类,可证明。
SIAM J Math Data Sci. 2019;1(2):313-332. doi: 10.1137/18m1216134. Epub 2019 May 28.
3
Convex biclustering.凸双聚类
Biometrics. 2017 Mar;73(1):10-19. doi: 10.1111/biom.12540. Epub 2016 May 10.
4
Splitting Methods for Convex Clustering.凸聚类的分裂方法
J Comput Graph Stat. 2015;24(4):994-1013. doi: 10.1080/10618600.2014.948181. Epub 2015 Dec 10.
5
Convex clustering: an attractive alternative to hierarchical clustering.凸聚类:层次聚类的一种有吸引力的替代方法。
PLoS Comput Biol. 2015 May 12;11(5):e1004228. doi: 10.1371/journal.pcbi.1004228. eCollection 2015 May.
6
Comprehensive molecular portraits of human breast tumours.人类乳腺肿瘤的全面分子特征图谱。
Nature. 2012 Oct 4;490(7418):61-70. doi: 10.1038/nature11412. Epub 2012 Sep 23.
7
Biclustering via sparse singular value decomposition.基于稀疏奇异值分解的双聚类
Biometrics. 2010 Dec;66(4):1087-95. doi: 10.1111/j.1541-0420.2010.01392.x.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验