• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MCLEAN:作为网络的多层次聚类探索

MCLEAN: Multilevel Clustering Exploration As Network.

作者信息

Alcaide Daniel, Aerts Jan

机构信息

Department of Electrical Engineering (ESAT) STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Leuven, Belgium.

imec, KU Leuven, Leuven, Belgium.

出版信息

PeerJ Comput Sci. 2018 Jan 29;4:e145. doi: 10.7717/peerj-cs.145. eCollection 2018.

DOI:10.7717/peerj-cs.145
PMID:33816801
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7924466/
Abstract

Finding useful patterns in datasets has attracted considerable interest in the field of visual analytics. One of the most common tasks is the identification and representation of clusters. However, this is non-trivial in heterogeneous datasets since the data needs to be analyzed from different perspectives. Indeed, highly variable patterns may mask underlying trends in the dataset. Dendrograms are graphical representations resulting from agglomerative hierarchical clustering and provide a framework for viewing the clustering at different levels of detail. However, dendrograms become cluttered when the dataset gets large, and the single cut of the dendrogram to demarcate different clusters can be insufficient in heterogeneous datasets. In this work, we propose a visual analytics methodology called MCLEAN that offers a general approach for guiding the user through the exploration and detection of clusters. Powered by a graph-based transformation of the relational data, it supports a scalable environment for representation of heterogeneous datasets by changing the spatialization. We thereby combine multilevel representations of the clustered dataset with community finding algorithms. Our approach entails displaying the results of the heuristics to users, providing a setting from which to start the exploration and data analysis. To evaluate our proposed approach, we conduct a qualitative user study, where participants are asked to explore a heterogeneous dataset, comparing the results obtained by MCLEAN with the dendrogram. These qualitative results reveal that MCLEAN is an effective way of aiding users in the detection of clusters in heterogeneous datasets. The proposed methodology is implemented in an R package available at https://bitbucket.org/vda-lab/mclean.

摘要

在数据集里寻找有用的模式已经在视觉分析领域引起了相当大的关注。最常见的任务之一是聚类的识别与表示。然而,在异构数据集中这并非易事,因为需要从不同角度分析数据。实际上,高度可变的模式可能会掩盖数据集中的潜在趋势。树形图是凝聚层次聚类产生的图形表示,为查看不同详细程度的聚类提供了一个框架。然而,当数据集变大时,树形图会变得杂乱,并且在异构数据集中,用树形图的单次切割来划分不同聚类可能并不充分。在这项工作中,我们提出了一种名为MCLEAN的视觉分析方法,它提供了一种通用方法来指导用户探索和检测聚类。借助基于关系数据的图形变换,它通过改变空间化支持异构数据集表示的可扩展环境。我们由此将聚类数据集的多级表示与社区发现算法相结合。我们的方法需要向用户展示启发式算法的结果,提供一个开始探索和数据分析的设置。为了评估我们提出的方法,我们进行了一项定性用户研究,要求参与者探索一个异构数据集,将MCLEAN获得的结果与树形图进行比较。这些定性结果表明,MCLEAN是帮助用户在异构数据集中检测聚类的有效方法。所提出的方法在一个R包中实现,可在https://bitbucket.org/vda-lab/mclean获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/6e9486ce5f7b/peerj-cs-04-145-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/7ed6a5d8d78c/peerj-cs-04-145-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/a559659c74a1/peerj-cs-04-145-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/5b62c79b519e/peerj-cs-04-145-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/75691b190f70/peerj-cs-04-145-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/2636391295eb/peerj-cs-04-145-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/071f7c365042/peerj-cs-04-145-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/bd25ab89f24b/peerj-cs-04-145-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/d1f6b842fa75/peerj-cs-04-145-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/0f7c6d98400c/peerj-cs-04-145-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/6e9486ce5f7b/peerj-cs-04-145-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/7ed6a5d8d78c/peerj-cs-04-145-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/a559659c74a1/peerj-cs-04-145-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/5b62c79b519e/peerj-cs-04-145-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/75691b190f70/peerj-cs-04-145-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/2636391295eb/peerj-cs-04-145-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/071f7c365042/peerj-cs-04-145-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/bd25ab89f24b/peerj-cs-04-145-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/d1f6b842fa75/peerj-cs-04-145-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/0f7c6d98400c/peerj-cs-04-145-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b9d/7924466/6e9486ce5f7b/peerj-cs-04-145-g010.jpg

相似文献

1
MCLEAN: Multilevel Clustering Exploration As Network.MCLEAN:作为网络的多层次聚类探索
PeerJ Comput Sci. 2018 Jan 29;4:e145. doi: 10.7717/peerj-cs.145. eCollection 2018.
2
Visual analytics for spatial clustering: using a heuristic approach for guided exploration.空间聚类的可视化分析:使用启发式方法进行引导式探索。
IEEE Trans Vis Comput Graph. 2013 Dec;19(12):2179-88. doi: 10.1109/TVCG.2013.224.
3
dendsort: modular leaf ordering methods for dendrogram representations in R.dendsort:用于R中树形图表示的模块化叶序方法。
F1000Res. 2014 Jul 30;3:177. doi: 10.12688/f1000research.4784.1. eCollection 2014.
4
Clustervision: Visual Supervision of Unsupervised Clustering.聚类视觉监督:无监督聚类的视觉监督。
IEEE Trans Vis Comput Graph. 2018 Jan;24(1):142-151. doi: 10.1109/TVCG.2017.2745085. Epub 2017 Aug 29.
5
HGC: fast hierarchical clustering for large-scale single-cell data.HGC:适用于大规模单细胞数据的快速层次聚类。
Bioinformatics. 2021 Nov 5;37(21):3964-3965. doi: 10.1093/bioinformatics/btab420.
6
How frequently do clusters occur in hierarchical clustering analysis? A graph theoretical approach to studying ties in proximity.在层次聚类分析中,簇出现的频率如何?一种研究邻近关系中联系的图论方法。
J Cheminform. 2016 Jan 25;8:4. doi: 10.1186/s13321-016-0114-x. eCollection 2016.
7
Visual MRI: merging information visualization and non-parametric clustering techniques for MRI dataset analysis.可视化磁共振成像:融合信息可视化与非参数聚类技术用于磁共振成像数据集分析。
Artif Intell Med. 2008 Nov;44(3):183-99. doi: 10.1016/j.artmed.2008.06.006. Epub 2008 Sep 4.
8
Geovisual analytics to enhance spatial scan statistic interpretation: an analysis of U.S. cervical cancer mortality.用于增强空间扫描统计解释的地理可视化分析:美国宫颈癌死亡率分析
Int J Health Geogr. 2008 Nov 7;7:57. doi: 10.1186/1476-072X-7-57.
9
A permutation test for determining significance of clusters with applications to spatial and gene expression data.一种用于确定聚类显著性的置换检验及其在空间和基因表达数据中的应用。
Comput Stat Data Anal. 2009 Oct 1;53(12):4290-4300. doi: 10.1016/j.csda.2009.05.031.
10
DendroX: multi-level multi-cluster selection in dendrograms.DendroX:树状图中的多层次多聚类选择。
BMC Genomics. 2024 Feb 2;25(1):134. doi: 10.1186/s12864-024-10048-0.

引用本文的文献

1
Identification of a Novel Clinical Phenotype of Severe Malaria using a Network-Based Clustering Approach.利用基于网络的聚类方法鉴定严重疟疾的新型临床表型。
Sci Rep. 2018 Aug 27;8(1):12849. doi: 10.1038/s41598-018-31320-w.

本文引用的文献

1
Topological data analysis of biological aggregation models.生物聚集模型的拓扑数据分析
PLoS One. 2015 May 13;10(5):e0126383. doi: 10.1371/journal.pone.0126383. eCollection 2015.
2
Semi-supervised adaptive-height snipping of the hierarchical clustering tree.层次聚类树的半监督自适应高度剪枝
BMC Bioinformatics. 2015 Jan 16;16(1):15. doi: 10.1186/s12859-014-0448-1.
3
dendsort: modular leaf ordering methods for dendrogram representations in R.dendsort:用于R中树形图表示的模块化叶序方法。
F1000Res. 2014 Jul 30;3:177. doi: 10.12688/f1000research.4784.1. eCollection 2014.
4
Spark: a navigational paradigm for genomic data exploration.Spark:一种用于基因组数据探索的导航范例。
Genome Res. 2012 Nov;22(11):2262-9. doi: 10.1101/gr.140665.112. Epub 2012 Sep 7.
5
Constructing overview+detail dendrogram-matrix views.构建概览+详细的树状图-矩阵视图。
IEEE Trans Vis Comput Graph. 2009 Nov-Dec;15(6):889-96. doi: 10.1109/TVCG.2009.130.
6
GrouseFlocks: steerable exploration of graph hierarchy space.松鸡群:图层次结构空间的可控探索
IEEE Trans Vis Comput Graph. 2008 Jul-Aug;14(4):900-13. doi: 10.1109/TVCG.2008.34.
7
Maps of random walks on complex networks reveal community structure.复杂网络上随机游走的图谱揭示了群落结构。
Proc Natl Acad Sci U S A. 2008 Jan 29;105(4):1118-23. doi: 10.1073/pnas.0706851105. Epub 2008 Jan 23.
8
Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R.从层次聚类树定义聚类:用于R的动态树切割软件包。
Bioinformatics. 2008 Mar 1;24(5):719-20. doi: 10.1093/bioinformatics/btm563. Epub 2007 Nov 16.
9
Graph visualization techniques for web clustering engines.用于网络聚类引擎的图形可视化技术。
IEEE Trans Vis Comput Graph. 2007 Mar-Apr;13(2):294-304. doi: 10.1109/TVCG.2007.40.
10
ASK-GraphView: A large scale graph visualization system.ASK-图形视图:一个大规模图形可视化系统。
IEEE Trans Vis Comput Graph. 2006 Sep-Oct;12(5):669-76. doi: 10.1109/TVCG.2006.120.