使用新型核k均值机器学习算法进行治疗过程聚类：双相I型障碍保险理赔的回顾性分析

Treatment journey clustering with a novel kernel k-means machine learning algorithm: a retrospective analysis of insurance claims in bipolar I disorder.

作者信息

Littman Matthew, Nguyen Huy-Binh, Campbell Joanna, Keyloun Katelyn R

机构信息

AbbVie, North Chicago, IL, USA.

出版信息

Brain Inform. 2025 May 22;12(1):12. doi: 10.1186/s40708-025-00258-x.

DOI:10.1186/s40708-025-00258-x

PMID:40402327

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12098244/

Abstract

In real-world psychiatric practice, patients may experience complex treatment journeys, including various diagnoses and lines of therapy. Insurance claims databases could potentially provide insight into outcomes of psychiatric treatment processes, but the diversity of event sequences restricts analyses with currently available methods. Here, we developed a novel kernel k-means clustering algorithm for event sequences that can accommodate highly diverse event types and sequence lengths. The approach, Divisive Optimized Clustering using Kernel K-means for Event Sequences (DOCKKES), also leverages a novel performance metric, the transition score, which measures sequence coherence in individual clusters. The performance of DOCKKES was evaluated in the context of bipolar I disorder, which is characterized by heterogeneous treatment journeys. We conducted a retrospective, observational analysis of a large sample (n = 31,578) of patients with bipolar I disorder from the MarketScan® Commercial Database. Using insurance claims, bipolar episode diagnoses and mental health-related lines of therapy were identified as events of interest for patient clustering. The dataset included 202,122 events; 75% of the cohort experienced unique treatment journeys. Based on an optimal run, DOCKKES identified 16 treatment journey clusters, which were evenly split for initial manic/mixed or depressive episodes (8 clusters each) and varied in sequence length and early lines of therapy. Variability across clusters was also observed for demographics, comorbidities, and mental health-related healthcare resource utilization and cost. This proof-of-concept study demonstrated the use of DOCKKES for integrating information from large datasets, enabling comparisons between patient clusters and evaluation of real-world treatment journeys in the context of evidence-based guidelines.

摘要

在现实世界的精神科实践中，患者可能会经历复杂的治疗过程，包括各种诊断和治疗方案。保险理赔数据库有可能提供对精神科治疗过程结果的洞察，但事件序列的多样性限制了使用现有方法进行的分析。在此，我们开发了一种用于事件序列的新型核k均值聚类算法，该算法可以适应高度多样化的事件类型和序列长度。这种方法，即使用核k均值的事件序列分裂优化聚类（DOCKKES），还利用了一种新型性能指标——转换分数，该指标用于衡量各个聚类中的序列连贯性。在以异质治疗过程为特征的双相I型障碍背景下评估了DOCKKES的性能。我们对来自MarketScan®商业数据库的大量双相I型障碍患者样本（n = 31,578）进行了回顾性观察分析。利用保险理赔，将双相情感发作诊断和心理健康相关治疗方案确定为患者聚类的感兴趣事件。该数据集包含202,122个事件；75%的队列经历了独特的治疗过程。基于一次最优运行，DOCKKES识别出16个治疗过程聚类，这些聚类在初始躁狂/混合发作或抑郁发作时平均分配（各8个聚类），并且在序列长度和早期治疗方案方面存在差异。在人口统计学、合并症以及心理健康相关医疗资源利用和成本方面，各聚类之间也观察到了变异性。这项概念验证研究证明了使用DOCKKES整合来自大型数据集的信息，能够在基于证据的指南背景下比较患者聚类并评估现实世界的治疗过程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/496b/12098244/1b88997facd4/40708_2025_258_Fig1_HTML.jpg

相似文献

Treatment journey clustering with a novel kernel k-means machine learning algorithm: a retrospective analysis of insurance claims in bipolar I disorder.

Brain Inform. 2025 May 22;12(1):12. doi: 10.1186/s40708-025-00258-x.

Letter to the Editor: CONVERGENCES AND DIVERGENCES IN THE ICD-11 VS. DSM-5 CLASSIFICATION OF MOOD DISORDERS.

Turk Psikiyatri Derg. 2021;32(4):293-295. doi: 10.5080/u26899.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

The bird's-eye view: A data-driven approach to understanding patient journeys from claims data.

J Am Med Inform Assoc. 2020 Jul 1;27(7):1037-1045. doi: 10.1093/jamia/ocaa052.

Health Care Resource Use, Costs, and Diagnosis Patterns in Patients With Schizophrenia and Bipolar Disorder: Real-world Evidence From US Claims Databases.

Clin Ther. 2018 Oct;40(10):1670-1682. doi: 10.1016/j.clinthera.2018.08.004. Epub 2018 Sep 5.

Machine learning clustering of adult spinal deformity patients identifies four prognostic phenotypes: a multicenter prospective cohort analysis with single surgeon external validation.

Spine J. 2024 Jun;24(6):1095-1108. doi: 10.1016/j.spinee.2024.02.010. Epub 2024 Feb 15.

Identifying and evaluating clinical subtypes of Alzheimer's disease in care electronic health records using unsupervised machine learning.

BMC Med Inform Decis Mak. 2021 Dec 8;21(1):343. doi: 10.1186/s12911-021-01693-6.

Estimating Determinants of Multiple Treatment Episodes for Substance Abusers.

J Ment Health Policy Econ. 2001 Jun 1;4(2):65-77.

[Antipsychotics in bipolar disorders].

Encephale. 2004 Sep-Oct;30(5):417-24. doi: 10.1016/s0013-7006(04)95456-5.

Temporal phenotyping by mining healthcare data to derive lines of therapy for cancer.

J Biomed Inform. 2019 Dec;100:103335. doi: 10.1016/j.jbi.2019.103335. Epub 2019 Nov 2.

本文引用的文献

A data-driven cluster analysis to explore cognitive reserve and modifiable risk factors in early phases of cognitive decline.

Sci Rep. 2025 Feb 7;15(1):4616. doi: 10.1038/s41598-025-88340-6.

Clustering Methods in Rheumatic and Musculoskeletal Disease Research: An Educational Guide to Best Research Practices.

J Rheumatol. 2024 Dec 1;51(12):1160-1168. doi: 10.3899/jrheum.2024-0519.

M-ClustEHR: A multimodal clustering approach for electronic health records.

Artif Intell Med. 2024 Aug;154:102905. doi: 10.1016/j.artmed.2024.102905. Epub 2024 Jun 6.

Novel Machine Learning Identifies 5 Asthma Phenotypes Using Cluster Analysis of Real-World Data.

J Allergy Clin Immunol Pract. 2024 Aug;12(8):2084-2091.e4. doi: 10.1016/j.jaip.2024.04.035. Epub 2024 Apr 27.

An overview of clustering methods with guidelines for application in mental health research.

Psychiatry Res. 2023 Sep;327:115265. doi: 10.1016/j.psychres.2023.115265. Epub 2023 May 27.

Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach.

IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5862-5871. doi: 10.1109/TPAMI.2022.3217137. Epub 2023 Apr 3.

The clinical characterization of the adult patient with bipolar disorder aimed at personalization of management.

World Psychiatry. 2022 Oct;21(3):364-387. doi: 10.1002/wps.20997.

Treatment Patterns Among Patients with Bipolar Disorder in the United States: A Retrospective Claims Database Analysis.

Adv Ther. 2022 Jun;39(6):2578-2595. doi: 10.1007/s12325-022-02112-6. Epub 2022 Apr 6.

Silhouette Analysis for Performance Evaluation in Machine Learning with Applications to Clustering.

Entropy (Basel). 2021 Jun 16;23(6):759. doi: 10.3390/e23060759.

Risk stratification of cardiovascular complications using CHADS-VASc and CHADS scores in chronic atherosclerotic cardiovascular disease.

Int J Cardiol. 2021 Aug 15;337:9-15. doi: 10.1016/j.ijcard.2021.04.067. Epub 2021 May 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用新型核k均值机器学习算法进行治疗过程聚类：双相I型障碍保险理赔的回顾性分析

Treatment journey clustering with a novel kernel k-means machine learning algorithm: a retrospective analysis of insurance claims in bipolar I disorder.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献