Department of Biomedical Informatics, Columbia University, New York, New York, USA.
J Am Med Inform Assoc. 2010 Jan-Feb;17(1):49-53. doi: 10.1197/jamia.M3390.
Although electronic notes have advantages compared to handwritten notes, they take longer to write and promote information redundancy in electronic health records (EHRs). We sought to quantify redundancy in clinical documentation by studying collections of physician notes in an EHR.
We implemented a retrospective design to gather all electronic admission, progress, resident signout and discharge summary notes written during 100 randomly selected patient admissions within a 6 month period. We modified and applied a Levenshtein edit-distance algorithm to align and compare the documents written for each of the 100 admissions. We then identified and measured the amount of text duplicated from previous notes. Finally, we manually reviewed the content that was conserved between note types in a subsample of notes.
We measured the amount of new information in a document, which was calculated as the number of words that did not match with previous documents divided by the length, in words, of the document. Results are reported as the percentage of information in a document that had been duplicated from previously written documents.
Signout and progress notes proved to be particularly redundant, with an average of 78% and 54% information duplicated from previous documents respectively. There was also significant information duplication between document types (eg, from an admission note to a progress note).
The study established the feasibility of exploring redundancy in the narrative record with a known sequence alignment algorithm used frequently in the field of bioinformatics. The findings provide a foundation for studying the usefulness and risks of redundancy in the EHR.
虽然电子病历相较于手写病历具有优势,但电子病历的书写时间更长,且易导致电子病历(EHR)中信息冗余。本研究旨在通过分析 EHR 中的医师病历,量化临床文档中的冗余信息。
我们采用回顾性设计,在 6 个月的时间内,随机选取 100 例患者的所有电子入院记录、病程记录、住院医师交班记录和出院小结,并对其进行收集。我们修改并应用了莱文斯坦编辑距离算法,对 100 例患者的病历进行对齐和比较。然后,我们确定并测量了从之前病历中重复的文本量。最后,我们对笔记类型之间保留的内容进行了子样本手动审查。
我们测量了文档中的新信息量,计算方法是将与之前文档不匹配的字数除以文档的字数,结果以文档中来自之前已写文档的重复信息量的百分比表示。
住院医师交班记录和病程记录的冗余性尤其突出,分别有 78%和 54%的信息来自之前的文档。不同文档类型之间也存在显著的信息重复(例如,入院记录到病程记录)。
本研究采用生物信息学领域常用的已知序列比对算法,证实了探索叙事记录中冗余性的可行性。该研究为研究 EHR 中冗余信息的有用性和风险提供了基础。