Dolin R H, Rogers B, Jaffe C
Robert H Dolin, MD, 1368 N Stallion St Orange, CA 92869, USA, E-mail:
Methods Inf Med. 2015;54(1):75-82. doi: 10.3414/ME14-01-0030. Epub 2014 Dec 2.
Describe how the HL7 Clinical Document Architecture (CDA), a foundational standard in US Meaningful Use, contributes to a "big data, incrementally structured" interoperability strategy, whereby data structured incrementally gets large amounts of data flowing faster. We present cases showing how this approach is leveraged for big data analysis.
To support the assertion that semi-structured narrative in CDA format can be a useful adjunct in an overall big data analytic approach, we present two case studies. The first assesses an organization's ability to generate clinical quality reports using coded data alone vs. coded data supplemented by CDA narrative. The second leverages CDA to construct a network model for referral management, from which additional observations can be gleaned.
The first case shows that coded data supplemented by CDA narrative resulted in significant variances in calculated performance scores. In the second case, we found that the constructed network model enables the identification of differences in patient characteristics among different referral work flows.
The CDA approach goes after data indirectly, by focusing first on the flow of narrative, which is then incrementally structured. A quantitative assessment of whether this approach will lead to a greater flow of data and ultimately a greater flow of structured data vs. other approaches is planned as a future exercise.
Along with growing adoption of CDA, we are now seeing the big data community explore the standard, particularly given its potential to supply analytic en- gines with volumes of data previously not possible.
描述美国《有意义使用》中的基础标准HL7临床文档架构(CDA)如何助力“大数据、渐进式结构化”的互操作性策略,即通过渐进式结构化的数据使大量数据流动得更快。我们展示案例来说明这种方法如何用于大数据分析。
为支持CDA格式的半结构化叙述可成为整体大数据分析方法中有用辅助手段这一论断,我们展示两个案例研究。第一个评估一个组织仅使用编码数据与使用CDA叙述补充的编码数据生成临床质量报告的能力。第二个利用CDA构建转诊管理网络模型,从中可收集更多观察结果。
第一个案例表明,用CDA叙述补充的编码数据在计算出的绩效分数上产生了显著差异。在第二个案例中,我们发现构建的网络模型能够识别不同转诊工作流程中患者特征的差异。
CDA方法通过首先关注叙述流来间接处理数据,然后对其进行渐进式结构化。计划在未来进行一项定量评估,以确定与其他方法相比,这种方法是否会带来更大的数据流动,最终带来更大的结构化数据流动。
随着CDA的采用日益增多,我们现在看到大数据界正在探索该标准,特别是考虑到其有潜力为分析引擎提供以前无法获得的大量数据。