Department of Surgery, College of Medicine, University of Florida, Gainesville, Florida.
Department of Epidemiology, College of Public Health & Health Professions and College of Medicine, University of Florida, Gainesville, Florida.
J Surg Res. 2020 Feb;246:599-604. doi: 10.1016/j.jss.2019.09.053. Epub 2019 Oct 22.
As more and more health systems have converted to the use of electronic health records, the amount of searchable and analyzable data is exploding. This includes not just provider or laboratory created data but also data collected by instruments, personal devices, and patients themselves, among others. This has led to more attention being paid to the analysis of these data to answer previously unaddressed questions. This is especially important given the number of therapies previously found to be beneficial in clinical trials that are currently being re-scrutinized. Because there are orders of magnitude more information contained in these data sets, a fundamentally different approach needs to be taken to their processing and analysis and the generation of knowledge. Health care and medicine are drivers of this phenomenon and will ultimately be the main beneficiaries. Concurrently, many different types of questions can now be asked using these data sets. Research groups have become increasingly active in mining large data sets, including nationwide health care databases, to learn about associations of medication use and various unrelated diseases such as cancer. Given the recent increase in research activity in this area, its promise to radically change clinical research, and the relative lack of widespread knowledge about its potential and advances, we surveyed the available literature to understand the strengths and limitations of these new tools. We also outline new databases and techniques that are available to researchers worldwide, with special focus on work pertaining to the broad and rapid monitoring of drug safety and secondary effects.
随着越来越多的医疗系统转向使用电子健康记录,可搜索和可分析的数据量呈爆炸式增长。这不仅包括提供者或实验室创建的数据,还包括仪器、个人设备和患者自身等收集的数据。这使得人们更加关注对这些数据的分析,以回答以前未解决的问题。鉴于以前在临床试验中发现有益的治疗方法数量众多,目前正在重新进行审查,这一点尤其重要。由于这些数据集包含数量级更多的信息,因此需要采用一种根本不同的方法来处理和分析这些数据,并从中生成知识。医疗保健和医学是推动这一现象的主要因素,最终也将是主要受益者。同时,现在可以使用这些数据集来提出许多不同类型的问题。研究小组越来越活跃地挖掘大型数据集,包括全国性的医疗保健数据库,以了解药物使用与癌症等各种不相关疾病之间的关联。鉴于该领域最近研究活动的增加,以及它有可能从根本上改变临床研究,并且相对缺乏对其潜力和进展的广泛了解,我们调查了现有文献,以了解这些新工具的优缺点。我们还概述了全球研究人员可获得的新数据库和技术,特别关注与广泛快速监测药物安全性和副作用相关的工作。