Hosseini Masoud, Meade Jonathan, Schnitzius Jamie, Dixon Brian E
School of Informatics and Computing, Department of BioHealth Informatics, Indiana University Regenstrief Institute, Inc.
CreateIT, Inc., Richmond, Indiana, USA.
J Am Med Inform Assoc. 2016 Mar;23(2):317-23. doi: 10.1093/jamia/ocv084. Epub 2015 Jul 13.
Healthcare providers sometimes receive multiple continuity of care documents (CCDs) for a single patient encompassing the patient's various encounters and medical history recorded in different information systems. It is cumbersome for providers to explore different pages of CCDs to find specific data which can be duplicated or even conflicted. This study describes initial steps toward a modular system that integrates and de-duplicates multiple CCDs into one consolidated document for viewing or processing patient-level data.
The authors developed a prototype system to consolidate and de-duplicate CCDs. The system is engineered to be scalable, extensible, and open source. Using a corpus of 150 de-identified CCDs synthetically generated from a single data source with a common vocabulary to represent 50 unique patients, the authors tested the system's performance and output. Performance was measured based on document throughput and reduction in file size and volume of data. The authors further compared the output of the system with manual consolidation and de-duplication. Testing across multiple vendor systems or implementations was not performed.
All of the input CCDs was successfully consolidated, and no data were lost. De-duplication significantly reduced the number of entries in different sections (49% in Problems, 60.6% in Medications, and 79% in Allergies) and reduced the size of the documents (57.5%) as well as the number of lines in each document (58%). The system executed at a rate of approximately 0.009-0.03 s per rule depending on the complexity of the rule.
Given increasing adoption and use of health information exchange (HIE) to share data and information across the care continuum, duplication of information is inevitable. A novel system designed to support automated consolidation and de-duplication of information across clinical documents as they are exchanged shows promise. Future work is needed to expand the capabilities of the system and further test it using heterogeneous vocabularies across multiple HIE scenarios.
医疗服务提供者有时会收到同一患者的多份连续性医疗文档(CCD),这些文档涵盖了患者在不同信息系统中记录的各种诊疗经历和病史。医疗服务提供者要在不同页面的CCD中查找特定数据既繁琐又麻烦,这些数据可能存在重复甚至冲突的情况。本研究描述了迈向模块化系统的初步步骤,该系统可将多个CCD整合并去重,形成一份综合文档,以便查看或处理患者层面的数据。
作者开发了一个用于整合和去重CCD的原型系统。该系统设计为具有可扩展性、可延伸性且为开源。作者使用从单一数据源合成生成的150份去标识化CCD组成的语料库,这些CCD使用通用词汇表来代表50名不同的患者,以此测试系统的性能和输出。基于文档吞吐量、文件大小的缩减以及数据量来衡量性能。作者还将系统的输出与手动整合和去重进行了比较。未在多个供应商系统或实施中进行测试。
所有输入的CCD均成功整合,且无数据丢失。去重显著减少了不同部分的条目数量(问题部分减少49%,用药部分减少60.6%,过敏部分减少79%),并减小了文档大小(57.5%)以及每份文档的行数(58%)。根据规则的复杂程度,系统每条规则的执行速率约为0.009 - 0.03秒。
鉴于越来越多地采用健康信息交换(HIE)来跨连续医疗过程共享数据和信息,信息重复不可避免。一个旨在支持在临床文档交换时对信息进行自动整合和去重的新型系统显示出了前景。未来需要开展工作来扩展系统功能,并在多个HIE场景中使用异构词汇表进一步对其进行测试。