Floyd Sarah B, Almeldien Ahmed G, Smith D Hudson, Judkins Benjamin, Krohn Claire E, Reynolds Zachary Cole, Jeray Kyle, Obeid Jihad S
Department of Public Health Sciences, Clemson University, Clemson, SC, United States.
Biomedical Data Science and Informatics Program, Clemson University, Clemson, SC, United States.
Front Digit Health. 2025 Apr 24;7:1523953. doi: 10.3389/fdgth.2025.1523953. eCollection 2025.
A readily available outcome measure that reflects the success of a patient's treatment is needed to demonstrate the value of orthopaedic interventions. Patient-reported outcome measures (PROMs) are survey-based instruments that collect joint-specific and general health perceptions on symptoms, functioning, and health-related quality of life. PROMs are considered the gold standard outcome measure in orthopaedic medicine, but their use is limited in real-world practice due to challenges with technology integration, the pace of clinic workflows, and patient compliance. Clinical notes generated during each encounter patients have with their physician contain rich information on current disease symptoms, rehabilitation progress, and unexpected complications. Artificial intelligence (AI) methods can be used to identify phrases of treatment success or failure captured in clinical notes and discern an indicator of treatment success for orthopaedic patients.
This was a cross-sectional analysis of clinical notes from a sample of patients with an acute shoulder injury. The study included adult patients presenting to a Level-1 Trauma Center and regional health system for an acute Proximal Humerus Fracture (PHF) between January 1, 2019 and December 31, 2021. We used the progress note from the office visit for PHF-related care (ICD10: S42.2XXX) or shoulder pain (ICD10: M45.2XXX) closest to 1-year after the injury date. Clinical notes were reviewed by an orthopaedic resident and labeled as treatment success or failure. A structured comparative analysis of classifiers including both machine and deep learning algorithms was performed.
The final sample included 868 clinical notes from patients treated by 123 physicians across 35 departments within one regional health system. The study sample was stratified into 465 notes labeled as treatment success and 403 labeled as treatment failure. The Bio-ClinicalBERT model had the highest performance of 87% accuracy (AUC = 0.87 ± 0.04) in correctly distinguishing between treatment success and failure notes.
Our results suggest that text classifiers applied to clinical notes are capable of differentiating patients with successful treatment outcomes with high levels of accuracy. This finding is encouraging, signaling that routinely collected clinical note content may serve as a data source to develop an outcome measure for orthopaedic patients.
需要一种易于获得的、能反映患者治疗成功与否的结果指标,以证明骨科干预措施的价值。患者报告的结果指标(PROMs)是基于调查的工具,用于收集患者对症状、功能以及与健康相关的生活质量的关节特定和总体健康认知。PROMs被认为是骨科医学中的金标准结果指标,但由于技术整合、门诊工作流程节奏以及患者依从性等方面的挑战,其在实际临床实践中的应用受到限制。患者每次与医生就诊时生成的临床记录包含有关当前疾病症状、康复进展和意外并发症的丰富信息。人工智能(AI)方法可用于识别临床记录中所体现的治疗成功或失败的表述,并辨别骨科患者治疗成功的指标。
这是一项对急性肩部损伤患者样本的临床记录进行的横断面分析。该研究纳入了2019年1月1日至2021年12月31日期间因急性肱骨近端骨折(PHF)到一级创伤中心和区域卫生系统就诊的成年患者。我们使用了与PHF相关护理(ICD10:S42.2XXX)或肩部疼痛(ICD10:M45.2XXX)的门诊就诊病程记录,选取最接近受伤日期后1年的记录。临床记录由一名骨科住院医师进行审查,并标记为治疗成功或失败。对包括机器学习和深度学习算法在内的分类器进行了结构化比较分析。
最终样本包括来自一个区域卫生系统内35个科室的123名医生治疗的患者的868份临床记录。研究样本被分为465份标记为治疗成功的记录和403份标记为治疗失败的记录。Bio-ClinicalBERT模型在正确区分治疗成功和失败记录方面表现最佳,准确率为87%(AUC = 0.87 ± 0.04)。
我们的结果表明,应用于临床记录的文本分类器能够高精度地区分治疗结果成功的患者。这一发现令人鼓舞,表明常规收集的临床记录内容可作为为骨科患者开发结果指标的数据源。