Department of Psychiatry & Behavioral Sciences, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
Department of Epidemiology & Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
Qual Life Res. 2019 Jun;28(6):1441-1455. doi: 10.1007/s11136-019-02132-w. Epub 2019 Feb 23.
PURPOSE: As we begin to leverage Big Data in health care settings and particularly in assessing patient-reported outcomes, there is a need for novel analytics to address unique challenges. One such challenge is in coding transcribed interview data, typically free-text entries of statements made during a face-to-face interview. Latent Dirichlet Allocation (LDA) offers statistical rigor and consistency in automating the interpretation of patients' expressed concerns and coping strategies. METHODS: LDA was applied to interview data collected as part of a prospective, longitudinal study of QOL in N = 211 patients undergoing radical cystectomy and urinary diversion for bladder cancer. LDA analyzed personal goal statements to extract the latent topics and themes, stratified by time, and on things patients wanted to accomplish and prevent. Model comparison metrics determined the number of topics to extract. RESULTS: LDA extracted seven latent topics. Prior to surgery, patients' priorities were primarily in cancer surgery and recovery. Six months after the surgery, they were replaced by goals on regaining a sense of normalcy, to resume work, to enjoy life more fully, and to appreciate friends and family more. LDA model parameters showed changing priorities, e.g., immediate concerns on surgery and resuming employment decreased post-surgery and were replaced by concerns over cancer recurrence and a desire to remain healthy and strong. CONCLUSIONS: Novel Big Data analytics such as LDA offer the possibility of summarizing personal goals without the need for conventional fixed-length measures and resource-intensive qualitative data coding.
目的:随着我们开始在医疗保健环境中利用大数据,特别是在评估患者报告的结果方面,需要新的分析方法来解决独特的挑战。其中一个挑战是对转录访谈数据进行编码,这些数据通常是在面对面访谈中陈述的自由文本条目。潜在狄利克雷分配(LDA)为自动解释患者表达的关注点和应对策略提供了统计严谨性和一致性。
方法:LDA 应用于作为膀胱癌根治性膀胱切除术和尿流改道术患者生活质量前瞻性纵向研究的一部分收集的访谈数据。LDA 通过分析个人目标陈述来提取潜在主题和主题,按时间分层,并关注患者想要完成和预防的事情。模型比较指标确定了要提取的主题数量。
结果:LDA 提取了七个潜在主题。在手术前,患者的首要任务主要是癌症手术和康复。手术后六个月,他们的目标被恢复正常感、重新工作、更充分地享受生活以及更欣赏朋友和家人所取代。LDA 模型参数显示了优先级的变化,例如,对手术和恢复工作的直接关注在手术后减少,并被对癌症复发的担忧和保持健康和强壮的愿望所取代。
结论:新型大数据分析方法(如 LDA)提供了无需传统固定长度措施和资源密集型定性数据编码即可总结个人目标的可能性。
Eur Urol. 2002-10
Psychooncology. 2017-2
Health Qual Life Outcomes. 2014-4-7
Front Digit Health. 2025-4-30
Medicine (Baltimore). 2024-11-22
Int J Environ Res Public Health. 2023-2-28
Int J Inf Technol. 2023
Front Public Health. 2022
J Pain Symptom Manage. 2018-2-27
Health Qual Life Outcomes. 2004-3-15