Suppr超能文献

医学大数据:前景与挑战

Medical big data: promise and challenges.

作者信息

Lee Choong Ho, Yoon Hyung-Jin

机构信息

Department of Biomedical Engineering, Seoul National University College of Medicine, Seoul, Korea.

出版信息

Kidney Res Clin Pract. 2017 Mar;36(1):3-11. doi: 10.23876/j.krcp.2017.36.1.3. Epub 2017 Mar 31.

Abstract

The concept of big data, commonly characterized by volume, variety, velocity, and veracity, goes far beyond the data type and includes the aspects of data analysis, such as hypothesis-generating, rather than hypothesis-testing. Big data focuses on temporal stability of the association, rather than on causal relationship and underlying probability distribution assumptions are frequently not required. Medical big data as material to be analyzed has various features that are not only distinct from big data of other disciplines, but also distinct from traditional clinical epidemiology. Big data technology has many areas of application in healthcare, such as predictive modeling and clinical decision support, disease or safety surveillance, public health, and research. Big data analytics frequently exploits analytic methods developed in data mining, including classification, clustering, and regression. Medical big data analyses are complicated by many technical issues, such as missing values, curse of dimensionality, and bias control, and share the inherent limitations of observation study, namely the inability to test causality resulting from residual confounding and reverse causation. Recently, propensity score analysis and instrumental variable analysis have been introduced to overcome these limitations, and they have accomplished a great deal. Many challenges, such as the absence of evidence of practical benefits of big data, methodological issues including legal and ethical issues, and clinical integration and utility issues, must be overcome to realize the promise of medical big data as the fuel of a continuous learning healthcare system that will improve patient outcome and reduce waste in areas including nephrology.

摘要

大数据的概念通常以体量、多样性、速度和准确性为特征,其范畴远不止数据类型,还涵盖数据分析的各个方面,比如生成假设,而非检验假设。大数据关注关联的时间稳定性,而非因果关系,且通常无需潜在概率分布假设。作为待分析素材的医学大数据具有多种特性,这些特性不仅有别于其他学科的大数据,也与传统临床流行病学不同。大数据技术在医疗保健领域有诸多应用,如预测建模和临床决策支持、疾病或安全监测、公共卫生及研究。大数据分析常常采用数据挖掘中开发的分析方法,包括分类、聚类和回归。医学大数据分析因诸多技术问题而变得复杂,如缺失值、维度诅咒和偏差控制,并且存在观察性研究固有的局限性,即无法检验由残余混杂和反向因果关系导致的因果性。近来,倾向得分分析和工具变量分析已被引入以克服这些局限性,且成效显著。要实现医学大数据作为持续学习型医疗系统的动力从而改善患者预后并减少包括肾脏病学领域在内的医疗资源浪费这一前景,必须克服诸多挑战,如缺乏大数据实际效益的证据、包括法律和伦理问题在内的方法学问题,以及临床整合和实用性问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0aa1/5331970/e31b409fbc6d/krcp-36-003f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验