Catlow Jamie, Bray Benjamin, Morris Eva, Rutter Matt
Population Health Sciences Institute, Newcastle University, Newcastle upon Tyne, UK.
Gastroenterology, University Hospital of North Tees, Stockton-on-Tees, UK.
Frontline Gastroenterol. 2021 May 28;13(3):237-244. doi: 10.1136/flgastro-2019-101239. eCollection 2022.
Big data is defined as being large, varied or frequently updated, and usually generated from real-world interaction. With the unprecedented availability of big data, comes an obligation to maximise its potential for healthcare improvements in treatment effectiveness, disease prevention and healthcare delivery. We review the opportunities and challenges that big data brings to gastroenterology. We review its sources for healthcare improvement in gastroenterology, including electronic medical records, patient registries and patient-generated data. Big data can complement traditional research methods in hypothesis generation, supporting studies and disseminating findings; and in some cases holds distinct advantages where traditional trials are unfeasible. There is great potential power in patient-level linkage of datasets to help quantify inequalities, identify best practice and improve patient outcomes. We exemplify this with the UK colorectal cancer repository and the potential of linkage using the National Endoscopy Database, the inflammatory bowel disease registry and the National Health Service bowel cancer screening programme. Artificial intelligence and machine learning are increasingly being used to improve diagnostics in gastroenterology, with image analysis entering clinical practice, and the potential of machine learning to improve outcome prediction and diagnostics in other clinical areas. Big data brings issues with large sample sizes, real-world biases, data curation, keeping clinical context at analysis and General Data Protection Regulation compliance. There is a tension between our obligation to use data for the common good and protecting individual patient's data. We emphasise the importance of engaging with our patients to enable them to understand their data usage as fully as they wish.
大数据被定义为规模巨大、种类多样或频繁更新,且通常源自现实世界的交互。随着大数据前所未有的可得性,随之而来的是一项义务,即最大限度地发挥其在提高治疗效果、疾病预防和医疗服务方面改善医疗保健的潜力。我们回顾了大数据给胃肠病学带来的机遇和挑战。我们审视了其在胃肠病学中改善医疗保健的来源,包括电子病历、患者登记册和患者生成的数据。大数据可以在假设生成、支持研究和传播研究结果方面补充传统研究方法;在某些情况下,在传统试验不可行时具有明显优势。数据集在患者层面的关联具有巨大的潜在力量,有助于量化不平等现象、确定最佳实践并改善患者预后。我们以英国结直肠癌储存库以及利用国家内窥镜数据库、炎症性肠病登记册和国民保健服务肠癌筛查计划进行关联的潜力为例进行说明。人工智能和机器学习越来越多地用于改善胃肠病学诊断,图像分析已进入临床实践,机器学习在改善其他临床领域的预后预测和诊断方面也具有潜力。大数据带来了大样本量、现实世界偏差、数据管理、在分析时保留临床背景以及遵守《通用数据保护条例》等问题。我们在将数据用于公共利益的义务与保护个体患者数据之间存在矛盾。我们强调与患者互动的重要性,以使他们能够尽可能充分地了解其数据的使用情况。