• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

可视化数据挖掘

Visual data mining.

作者信息

Wegman Edward J

机构信息

Center for Computational Statistics, George Mason University, Fairfax, VA 22030-4444, USA.

出版信息

Stat Med. 2003 May 15;22(9):1383-97. doi: 10.1002/sim.1502.

DOI:10.1002/sim.1502
PMID:12704604
Abstract

Data mining strategies are usually applied to opportunistically collected data and frequently focus on the discovery of structure such as clusters, bumps, trends, periodicities, associations and correlations, quantization and granularity, and other structures for which a visual data analysis is very appropriate and quite likely to yield insight. However, data mining strategies are often applied to massive data sets where visualization may not be very successful because of the limits of both screen resolution, human visual system resolution as well as the limits of available computational resources. In this paper I suggest some strategies for overcoming such limitations and illustrate visual data mining with some examples of successful attacks on high-dimensional and large data sets.

摘要

数据挖掘策略通常应用于机会性收集的数据,并且经常专注于发现诸如聚类、峰值、趋势、周期性、关联和相关性、量化和粒度等结构,以及其他视觉数据分析非常适用且很可能产生见解的结构。然而,数据挖掘策略经常应用于海量数据集,在这些数据集中,由于屏幕分辨率、人类视觉系统分辨率的限制以及可用计算资源的限制,可视化可能不太成功。在本文中,我提出了一些克服此类限制的策略,并通过对高维和大数据集成功进行分析的一些示例来说明可视化数据挖掘。

相似文献

1
Visual data mining.可视化数据挖掘
Stat Med. 2003 May 15;22(9):1383-97. doi: 10.1002/sim.1502.
2
InfVis--platform-independent visual data mining of multidimensional chemical data sets.InfVis——多维化学数据集的平台无关型可视化数据挖掘
J Chem Inf Model. 2005 Sep-Oct;45(5):1456-67. doi: 10.1021/ci050202k.
3
Assessment of hierarchical clustering methodologies for proteomic data mining.蛋白质组学数据挖掘中层次聚类方法的评估
J Proteome Res. 2007 Jan;6(1):358-66. doi: 10.1021/pr060343h.
4
Improving pattern discovery and visualization of SAGE data through poisson-based self-adaptive neural networks.
IEEE Trans Inf Technol Biomed. 2008 Jul;12(4):459-69. doi: 10.1109/TITB.2007.901208.
5
Open-source tools for data mining.用于数据挖掘的开源工具。
Clin Lab Med. 2008 Mar;28(1):37-54, vi. doi: 10.1016/j.cll.2007.10.002.
6
Mining coherent dense subgraphs across massive biological networks for functional discovery.在海量生物网络中挖掘连贯密集子图以进行功能发现。
Bioinformatics. 2005 Jun;21 Suppl 1:i213-21. doi: 10.1093/bioinformatics/bti1049.
7
Bioinformatics and data mining in proteomics.蛋白质组学中的生物信息学与数据挖掘
Expert Rev Proteomics. 2006 Jun;3(3):333-43. doi: 10.1586/14789450.3.3.333.
8
An insight-based methodology for evaluating bioinformatics visualizations.一种基于洞察的生物信息学可视化评估方法。
IEEE Trans Vis Comput Graph. 2005 Jul-Aug;11(4):443-56. doi: 10.1109/TVCG.2005.53.
9
The extraction of information and knowledge from trained neural networks.从经过训练的神经网络中提取信息和知识。
Methods Mol Biol. 2008;458:231-48.
10
SNP-VISTA: an interactive SNP visualization tool.SNP-VISTA:一种交互式单核苷酸多态性可视化工具。
BMC Bioinformatics. 2005 Dec 8;6:292. doi: 10.1186/1471-2105-6-292.

引用本文的文献

1
Statistical challenges of high-dimensional data.高维数据的统计挑战。
Philos Trans A Math Phys Eng Sci. 2009 Nov 13;367(1906):4237-53. doi: 10.1098/rsta.2009.0159.
2
Space-time visualization and analysis in the Cancer Atlas Viewer.癌症图谱浏览器中的时空可视化与分析。
J Geogr Syst. 2005 May;7(1):67-84. doi: 10.1007/s10109-005-0150-y.
3
d-matrix - database exploration, visualization and analysis.d矩阵 - 数据库探索、可视化与分析
BMC Bioinformatics. 2004 Oct 28;5:168. doi: 10.1186/1471-2105-5-168.