• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高维数据的选择偏差跟踪和详细子集比较。

Selection Bias Tracking and Detailed Subset Comparison for High-Dimensional Data.

出版信息

IEEE Trans Vis Comput Graph. 2020 Jan;26(1):429-439. doi: 10.1109/TVCG.2019.2934209. Epub 2019 Aug 20.

DOI:10.1109/TVCG.2019.2934209
PMID:31442975
Abstract

The collection of large, complex datasets has become common across a wide variety of domains. Visual analytics tools increasingly play a key role in exploring and answering complex questions about these large datasets. However, many visualizations are not designed to concurrently visualize the large number of dimensions present in complex datasets (e.g. tens of thousands of distinct codes in an electronic health record system). This fact, combined with the ability of many visual analytics systems to enable rapid, ad-hoc specification of groups, or cohorts, of individuals based on a small subset of visualized dimensions, leads to the possibility of introducing selection bias-when the user creates a cohort based on a specified set of dimensions, differences across many other unseen dimensions may also be introduced. These unintended side effects may result in the cohort no longer being representative of the larger population intended to be studied, which can negatively affect the validity of subsequent analyses. We present techniques for selection bias tracking and visualization that can be incorporated into high-dimensional exploratory visual analytics systems, with a focus on medical data with existing data hierarchies. These techniques include: (1) tree-based cohort provenance and visualization, including a user-specified baseline cohort that all other cohorts are compared against, and visual encoding of cohort "drift", which indicates where selection bias may have occurred, and (2) a set of visualizations, including a novel icicle-plot based visualization, to compare in detail the per-dimension differences between the baseline and a user-specified focus cohort. These techniques are integrated into a medical temporal event sequence visual analytics tool. We present example use cases and report findings from domain expert user interviews.

摘要

在许多领域,收集大型复杂数据集已变得很常见。可视化分析工具在探索和回答这些大型数据集的复杂问题方面越来越发挥关键作用。然而,许多可视化工具并非设计用于同时可视化复杂数据集(例如电子健康记录系统中的数万个不同代码)中存在的大量维度。再加上许多可视化分析系统能够根据可视化维度的一小部分快速、临时指定个体的群组或队列的能力,就有可能引入选择偏差——当用户根据指定的维度集创建队列时,许多其他看不见的维度也可能存在差异。这些意外的副作用可能导致队列不再代表更大的研究人群,这会对后续分析的有效性产生负面影响。我们提出了用于选择偏差跟踪和可视化的技术,这些技术可以被整合到高维探索性可视化分析系统中,重点是具有现有数据层次结构的医疗数据。这些技术包括:(1)基于树的队列来源和可视化,包括用户指定的基线队列,所有其他队列都与之进行比较,以及队列“漂移”的可视化编码,这表明选择偏差可能发生的位置,(2)一组可视化,包括一种新颖的基于冰柱图的可视化,用于详细比较基线和用户指定的焦点队列之间的每个维度差异。这些技术集成到一个医疗时间事件序列可视化分析工具中。我们提出了示例用例,并报告了来自领域专家用户访谈的发现。

相似文献

1
Selection Bias Tracking and Detailed Subset Comparison for High-Dimensional Data.高维数据的选择偏差跟踪和详细子集比较。
IEEE Trans Vis Comput Graph. 2020 Jan;26(1):429-439. doi: 10.1109/TVCG.2019.2934209. Epub 2019 Aug 20.
2
Selection-Bias-Corrected Visualization via Dynamic Reweighting.通过动态重新加权实现选择偏差校正可视化。
IEEE Trans Vis Comput Graph. 2021 Feb;27(2):1481-1491. doi: 10.1109/TVCG.2020.3030455. Epub 2021 Jan 28.
3
DecisionFlow: Visual Analytics for High-Dimensional Temporal Event Sequence Data.决策流:用于高维时间事件序列数据的可视化分析
IEEE Trans Vis Comput Graph. 2014 Dec;20(12):1783-92. doi: 10.1109/TVCG.2014.2346682.
4
Progressive Visual Analytics: User-Driven Visual Exploration of In-Progress Analytics.渐进式视觉分析:用户驱动的进行中分析的视觉探索
IEEE Trans Vis Comput Graph. 2014 Dec;20(12):1653-62. doi: 10.1109/TVCG.2014.2346574.
5
Visual Analysis of High-Dimensional Event Sequence Data via Dynamic Hierarchical Aggregation.通过动态层次聚合对高维事件序列数据进行可视化分析
IEEE Trans Vis Comput Graph. 2020 Jan;26(1):440-450. doi: 10.1109/TVCG.2019.2934661. Epub 2019 Aug 20.
6
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
7
A Comparison of Spatiotemporal Visualizations for 3D Urban Analytics.用于3D城市分析的时空可视化比较
IEEE Trans Vis Comput Graph. 2023 Jan;29(1):1277-1287. doi: 10.1109/TVCG.2022.3209474. Epub 2022 Dec 16.
8
A Unified Comparison of User Modeling Techniques for Predicting Data Interaction and Detecting Exploration Bias.用于预测数据交互和检测探索偏差的用户建模技术的统一比较
IEEE Trans Vis Comput Graph. 2023 Jan;29(1):483-492. doi: 10.1109/TVCG.2022.3209476. Epub 2022 Dec 20.
9
Comparison Conundrum and the Chamber of Visualizations: An Exploration of How Language Influences Visual Design.比较难题与可视化空间:语言如何影响视觉设计的探索
IEEE Trans Vis Comput Graph. 2023 Jan;29(1):1211-1221. doi: 10.1109/TVCG.2022.3209456. Epub 2022 Dec 16.
10
How information visualization novices construct visualizations.信息可视化新手如何构建可视化。
IEEE Trans Vis Comput Graph. 2010 Nov-Dec;16(6):943-52. doi: 10.1109/TVCG.2010.164.