Division of Healthcare Policy and Research, Mayo Clinic, 200 First Street Southwest, Rochester, MN 55905, USA.
Int J Qual Health Care. 2010 Jun;22(3):229-35. doi: 10.1093/intqhc/mzq012. Epub 2010 Mar 27.
To determine whether text mining can accurately detect specific follow-up appointment criteria in free-text hospital discharge records.
Cross-sectional study.
Mayo Clinic Rochester hospitals.
Inpatients discharged from general medicine services in 2006 (n = 6481).
Textual hospital dismissal summaries were manually reviewed to determine whether the records contained specific follow-up appointment arrangement elements: date, time and either physician or location for an appointment. The data set was evaluated for the same criteria using SAS Text Miner software. The two assessments were compared to determine the accuracy of text mining for detecting records containing follow-up appointment arrangements.
Agreement of text-mined appointment findings with gold standard (manual abstraction) including sensitivity, specificity, positive predictive and negative predictive values (PPV and NPV).
About 55.2% (3576) of discharge records contained all criteria for follow-up appointment arrangements according to the manual review, 3.2% (113) of which were missed through text mining. Text mining incorrectly identified 3.7% (107) follow-up appointments that were not considered valid through manual review. Therefore, the text mining analysis concurred with the manual review in 96.6% of the appointment findings. Overall sensitivity and specificity were 96.8 and 96.3%, respectively; and PPV and NPV were 97.0 and 96.1%, respectively.
of individual appointment criteria resulted in accuracy rates of 93.5% for date, 97.4% for time, 97.5% for physician and 82.9% for location.
Text mining of unstructured hospital dismissal summaries can accurately detect documentation of follow-up appointment arrangement elements, thus saving considerable resources for performance assessment and quality-related research.
确定文本挖掘是否能准确地从医院出院记录的自由文本中检测到特定的随访预约标准。
横断面研究。
梅奥诊所罗切斯特医院。
2006 年从普通内科出院的住院患者(n=6481)。
手动审查住院患者的出院摘要,以确定记录是否包含特定的随访预约安排要素:日期、时间以及预约的医生或地点。使用 SAS Text Miner 软件评估数据集是否符合相同的标准。比较这两种评估方法,以确定文本挖掘在检测包含随访预约安排的记录方面的准确性。
文本挖掘预约结果与黄金标准(手动提取)的一致性,包括敏感性、特异性、阳性预测值和阴性预测值(PPV 和 NPV)。
根据手动审查,大约 55.2%(3576)的出院记录包含了所有随访预约安排的标准,其中 3.2%(113)的记录未被文本挖掘识别。文本挖掘错误地识别出 3.7%(107)的随访预约,这些预约未被手动审查认为是有效的。因此,文本挖掘分析与手动审查在 96.6%的预约结果上是一致的。总体敏感性和特异性分别为 96.8%和 96.3%,阳性预测值和阴性预测值分别为 97.0%和 96.1%。
对个别预约标准的分析结果表明,日期的准确率为 93.5%,时间的准确率为 97.4%,医生的准确率为 97.5%,地点的准确率为 82.9%。
对非结构化的医院出院摘要进行文本挖掘,可以准确地检测到随访预约安排要素的文档,从而为绩效评估和质量相关研究节省大量资源。