Dipartimento di Scienze di Laboratorio e Infettivologiche, Fondazione Policlinico Universitario A. Gemelli IRCCS, Roma, Italy; Dipartimento di Sicurezza e Bioetica, Sezione Malattie Infettive, Università Cattolica S. Cuore, Roma, Italy.
Fondazione Policlinico Universitario A. Gemelli IRCCS, Roma, Italy.
Comput Methods Programs Biomed. 2022 Apr;217:106655. doi: 10.1016/j.cmpb.2022.106655. Epub 2022 Jan 29.
The COVID-19 pandemic affected healthcare systems worldwide. Predictive models developed by Artificial Intelligence (AI) and based on timely, centralized and standardized real world patient data could improve management of COVID-19 to achieve better clinical outcomes. The objectives of this manuscript are to describe the structure and technologies used to construct a COVID-19 Data Mart architecture and to present how a large hospital has tackled the challenge of supporting daily management of COVID-19 pandemic emergency, by creating a strong retrospective knowledge base, a real time environment and integrated information dashboard for daily practice and early identification of critical condition at patient level. This framework is also used as an informative, continuously enriched data lake, which is a base for several on-going predictive studies.
The information technology framework for clinical practice and research was described. It was developed using SAS Institute software analytics tool and SAS® Vyia® environment and Open-Source environment R ® and Python ® for fast prototyping and modeling. The included variables and the source extraction procedures were presented.
The Data Mart covers a retrospective cohort of 5528 patients with SARS-CoV-2 infection. People who died were older, had more comorbidities, reported more frequently dyspnea at onset, had higher d-dimer, C-reactive protein and urea nitrogen. The dashboard was developed to support the management of COVID-19 patients at three levels: hospital, single ward and individual care level.
The COVID-19 Data Mart based on integration of a large collection of clinical data and an AI-based integrated framework has been developed, based on a set of automated procedures for data mining and retrieval, transformation and integration, and has been embedded in the clinical practice to help managing daily care. Benefits from the availability of a Data Mart include the opportunity to build predictive models with a machine learning approach to identify undescribed clinical phenotypes and to foster hospital networks. A real-time updated dashboard built from the Data Mart may represent a valid tool for a better knowledge of epidemiological and clinical features of COVID-19, especially when multiple waves are observed, as well as for epidemic and pandemic events of the same nature (e. g. with critical clinical conditions leading to severe pulmonary inflammation). Therefore, we believe the approach presented in this paper may find several applications in comparable situations even at region or state levels. Finally, models predicting the course of future waves or new pandemics could largely benefit from network of DataMarts.
COVID-19 大流行影响了全球的医疗体系。基于及时、集中和标准化的真实世界患者数据开发的人工智能(AI)预测模型可以改善 COVID-19 的管理,以实现更好的临床结果。本文的目的是描述构建 COVID-19 数据集市架构所使用的结构和技术,并介绍一家大型医院如何通过创建一个强大的回顾性知识库、一个实时环境和一个综合信息仪表板来应对 COVID-19 大流行紧急情况的日常管理挑战,该仪表板用于日常实践和早期识别患者层面的危急情况。该框架还被用作一个信息丰富、不断丰富的数据湖,为几个正在进行的预测研究提供基础。
介绍了临床实践和研究的信息技术框架。它是使用 SAS Institute 软件分析工具和 SAS®Vyia®环境以及开源环境 R®和 Python®开发的,用于快速原型设计和建模。介绍了包含的变量和源提取过程。
数据集市涵盖了 5528 名 SARS-CoV-2 感染患者的回顾性队列。死亡患者年龄较大,合并症较多,发病时呼吸困难较常见,D-二聚体、C 反应蛋白和尿素氮较高。该仪表板是为了支持 COVID-19 患者在三个层面的管理而开发的:医院、单个病房和个体护理层面。
基于对大量临床数据的集成和基于人工智能的综合框架,开发了 COVID-19 数据集市,该框架基于一套用于数据挖掘和检索、转换和集成的自动化程序,并已嵌入临床实践中,以帮助管理日常护理。数据集市的可用性带来了一些好处,包括使用机器学习方法构建预测模型,以识别未描述的临床表型,并促进医院网络。从数据集市构建的实时更新仪表板可能是更好地了解 COVID-19 的流行病学和临床特征的有效工具,尤其是在观察到多个波次时,以及对具有严重肺部炎症等危急临床情况的类似性质的流行和大流行事件。因此,我们认为本文提出的方法在区域或州一级等类似情况下可能会有多种应用。最后,预测未来波次或新大流行的模型可以从数据集市网络中获益良多。