Kwak Gloria Hyunjung, Moukheiber Dana, Moukheiber Mira, Moukheiber Lama, Moukheiber Sulaiman, Butala Neel M, Celi Leo A, Chen Christina W
Harvard Medical School, Boston, MA, USA.
Massachusetts General Hospital, Boston, MA, USA.
Sci Data. 2025 Jul 5;12(1):1153. doi: 10.1038/s41597-025-04849-5.
The EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM) is a structured echocardiogram database derived from 43,472 observational notes obtained during echocardiogram studies conducted in the intensive care unit at the Beth Israel Deaconess Medical Center between 2001 and 2012. The database encompasses various aspects of cardiac structure and function, including cavity size, wall thickness, systolic and diastolic function, valve regurgitation and stenosis, as well as pulmonary pressures. To facilitate extensive data analysis, the clinical notes were transformed into a structured numerical format. Within each echocardiogram report sentence, specific words or phrases were identified to describe abnormal findings, and a severity staging system using numeric categories was established. This large publicly accessible database of structured echocardiogram data holds significant potential as a tool to investigate cardiovascular disease in the intensive care unit and as a reference point for future note-based structured databases. Moreover, its structured nature allows for easy integration with other data types in MIMIC, such as images or vital signs, enabling large-scale data analysis and further advancements in this field.
源自MIMIC-III的EchoNotes结构化数据库(ECHO-NOTE2NUM)是一个结构化超声心动图数据库,它源自2001年至2012年期间在贝斯以色列女执事医疗中心重症监护病房进行的超声心动图研究中获得的43472份观察记录。该数据库涵盖心脏结构和功能的各个方面,包括腔室大小、壁厚、收缩和舒张功能、瓣膜反流和狭窄,以及肺压力。为便于进行广泛的数据分析,临床记录被转换为结构化数字格式。在每份超声心动图报告句子中,识别出描述异常发现的特定单词或短语,并建立了使用数字类别的严重程度分级系统。这个大型的可公开访问的结构化超声心动图数据数据库作为研究重症监护病房中心血管疾病的工具以及未来基于记录的结构化数据库的参考点具有巨大潜力。此外,其结构化性质允许轻松与MIMIC中的其他数据类型(如图像或生命体征)集成,从而实现大规模数据分析并推动该领域的进一步发展。