美国医疗保健理赔数据中用于识别新发非小细胞肺癌患者的编码算法的开发与验证

Development and Validation of Coding Algorithms to Identify Patients with Incident Non-Small Cell Lung Cancer in United States Healthcare Claims Data.

作者信息

Beyrer Julie, Nelson David R, Sheffield Kristin M, Huang Yu-Jing, Lau Yiu-Keung, Hincapie Ana L

机构信息

Eli Lilly and Company, Indianapolis, IN, USA.

University of Cincinnati James L. Winkle College of Pharmacy, Cincinnati, OH, USA.

出版信息

Clin Epidemiol. 2023 Jan 12;15:73-89. doi: 10.2147/CLEP.S389824. eCollection 2023.

DOI:10.2147/CLEP.S389824

PMID:36659903

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9842515/

Abstract

PURPOSE

We sought to develop and validate an incident non-small cell lung cancer (NSCLC) algorithm for United States (US) healthcare claims data. Diagnoses and procedures, but not medications, were incorporated to support longer-term relevance and reliability.

METHODS

Patients with newly diagnosed NSCLC per Surveillance, Epidemiology, and End Results (SEER) served as cases. Controls included newly diagnosed small-cell lung cancer and other lung cancers, and two 5% random samples for other cancer and without cancer. Algorithms derived from logistic regression and machine learning methods used the entire sample (Approach A) or started with a previous algorithm for those with lung cancer (Approach B). Sensitivity, specificity, positive predictive values (PPV), negative predictive values, and F-scores (compared for 1000 bootstrap samples) were calculated. Misclassification was evaluated by calculating the odds of selection by the algorithm among true positives and true negatives.

RESULTS

The best performing algorithm utilized neural networks (Approach B). A 10-variable point-score algorithm was derived from logistic regression (Approach B); sensitivity was 77.69% and PPV = 67.61% (F-score = 72.30%). This algorithm was less sensitive for patients ≥80 years old, with Medicare follow-up time <3 months, or missing SEER data on stage, laterality, or site and less specific for patients with SEER primary site of main bronchus, SEER summary stage 2000 regional by direct extension only, or pre-index chronic pulmonary disease.

CONCLUSION

Our study developed and validated a practical, 10-variable, point-based algorithm for identifying incident NSCLC cases in a US claims database based on a previously validated incident lung cancer algorithm.

摘要

目的

我们试图开发并验证一种针对美国医疗保健理赔数据的非小细胞肺癌（NSCLC）发病算法。纳入了诊断和手术信息，但未纳入药物信息，以确保算法具有长期相关性和可靠性。

方法

根据监测、流行病学和最终结果（SEER）数据库中确诊为NSCLC的患者作为病例组。对照组包括新诊断的小细胞肺癌和其他肺癌患者，以及两个分别为5%的其他癌症患者随机样本和无癌症患者随机样本。从逻辑回归和机器学习方法得出的算法，使用了整个样本（方法A），或者从先前针对肺癌患者的算法开始（方法B）。计算了敏感度、特异度、阳性预测值（PPV）、阴性预测值和F值（针对1000个自助抽样样本进行比较）。通过计算算法在真阳性和真阴性中选择的概率来评估错误分类情况。

结果

表现最佳的算法采用了神经网络（方法B）。从逻辑回归得出了一个包含10个变量的评分算法（方法B）；敏感度为77.69%，PPV = 67.61%（F值 = 72.30%）。该算法对80岁及以上患者、医疗保险随访时间少于3个月的患者，或在分期、肺叶或部位方面缺少SEER数据的患者敏感度较低，而对SEER主要部位为主支气管、SEER总结分期仅为2000年区域直接扩展期，或索引前患有慢性肺病的患者特异度较低。

结论

我们的研究基于先前验证的肺癌发病算法，开发并验证了一种实用的、包含10个变量的、基于点数的算法，用于在美国理赔数据库中识别NSCLC发病病例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1d1/9842515/ec5ec00fbad3/CLEP-15-73-g0001.jpg

相似文献

Development and Validation of Coding Algorithms to Identify Patients with Incident Non-Small Cell Lung Cancer in United States Healthcare Claims Data.美国医疗保健理赔数据中用于识别新发非小细胞肺癌患者的编码算法的开发与验证

Clin Epidemiol. 2023 Jan 12;15:73-89. doi: 10.2147/CLEP.S389824. eCollection 2023.

Development and validation of coding algorithms to identify patients with incident lung cancer in United States healthcare claims data.在美国医疗保健理赔数据中识别新发肺癌患者的编码算法的开发与验证

Pharmacoepidemiol Drug Saf. 2020 Nov;29(11):1465-1479. doi: 10.1002/pds.5137. Epub 2020 Oct 4.

An algorithm for the use of Medicare claims data to identify women with incident breast cancer.一种利用医疗保险理赔数据识别新发乳腺癌女性患者的算法。

Health Serv Res. 2004 Dec;39(6 Pt 1):1733-49. doi: 10.1111/j.1475-6773.2004.00315.x.

Development of an algorithm to identify small cell lung cancer patients in claims databases.开发一种在理赔数据库中识别小细胞肺癌患者的算法。

Front Oncol. 2024 Aug 15;14:1358562. doi: 10.3389/fonc.2024.1358562. eCollection 2024.

Classifying Stage IV Lung Cancer From Health Care Claims: A Comparison of Multiple Analytic Approaches.基于医疗保健理赔记录对IV期肺癌进行分类：多种分析方法的比较

JCO Clin Cancer Inform. 2019 May;3:1-19. doi: 10.1200/CCI.18.00156.

Evaluation of three algorithms to identify incident breast cancer in Medicare claims data.评估三种算法以在医疗保险索赔数据中识别新发乳腺癌。

Health Serv Res. 2007 Oct;42(5):2056-69. doi: 10.1111/j.1475-6773.2007.00705.x.

Validation of an Updated Algorithm to Identify Patients With Incident Non-Small Cell Lung Cancer in Administrative Claims Databases.验证一种用于在行政索赔数据库中识别新发非小细胞肺癌患者的更新算法。

JCO Clin Cancer Inform. 2024 Mar;8:e2300165. doi: 10.1200/CCI.23.00165.

Algorithm to Identify Incident Epithelial Ovarian Cancer Cases Using Claims Data.利用索赔数据识别偶发性上皮性卵巢癌病例的算法。

JCO Clin Cancer Inform. 2022 Mar;6:e2100187. doi: 10.1200/CCI.21.00187.

Development and Optimization of a Bladder Cancer Algorithm Using SEER-Medicare Claims Data.利用 SEER-Medicare 理赔数据开发和优化膀胱癌算法。

JCO Clin Cancer Inform. 2024 Sep;8:e2400073. doi: 10.1200/CCI.24.00073.

Validation of a Medicare Claims-based Algorithm for Identifying Breast Cancers Detected at Screening Mammography.一种基于医疗保险理赔数据的算法用于识别筛查乳腺钼靶检查中发现的乳腺癌的验证

Med Care. 2016 Mar;54(3):e15-22. doi: 10.1097/MLR.0b013e3182a303d7.

引用本文的文献

Comparison of a Risk Calculator With Frailty Indices in Patients Undergoing Lung Cancer Resection.肺癌切除患者风险计算器与衰弱指数的比较

J Surg Oncol. 2024 Dec;130(8):1532-1538. doi: 10.1002/jso.27861. Epub 2024 Oct 10.

Comparison of a risk calculator with frailty indices in patients undergoing lung cancer resection.肺癌切除患者中风险计算器与衰弱指数的比较

J Surg Oncol. 2024 Oct;130(5):1111-1118. doi: 10.1002/jso.27815. Epub 2024 Aug 29.

Patterns of immunotherapy utilization for non-small cell lung cancer in Texas pre- and post-regulatory approval.德克萨斯州免疫疗法在非小细胞肺癌中的应用模式：监管批准前后。

Clin Transl Oncol. 2024 Aug;26(8):1908-1920. doi: 10.1007/s12094-024-03412-9. Epub 2024 Mar 30.

本文引用的文献

Development and Evaluation of the Algorithm CErtaInty Tool (ACE-IT) to Assess Electronic Medical Record and Claims-based Algorithms' Fit for Purpose for Safety Outcomes.开发和评估算法确定性工具（ACE-IT），以评估电子病历和基于索赔的算法在安全性结果方面的适用性。

Drug Saf. 2023 Jan;46(1):87-97. doi: 10.1007/s40264-022-01254-4. Epub 2022 Nov 17.

A review of stakeholder recommendations for defining fit-for-purpose real-world evidence algorithms.利益相关者关于定义适用的真实世界证据算法的建议综述。

J Comp Eff Res. 2022 May;11(7):499-511. doi: 10.2217/cer-2022-0006. Epub 2022 Mar 17.

Pharmacoepidemiol Drug Saf. 2020 Nov;29(11):1465-1479. doi: 10.1002/pds.5137. Epub 2020 Oct 4.

The Certainty Framework for Assessing Real-World Data in Studies of Medical Product Safety and Effectiveness.评估医疗产品安全性和有效性的真实世界数据研究的确定性框架。

Clin Pharmacol Ther. 2021 May;109(5):1189-1196. doi: 10.1002/cpt.2045. Epub 2020 Oct 8.

Determining the Time of Cancer Recurrence Using Claims or Electronic Medical Record Data.利用理赔数据或电子病历数据确定癌症复发时间

JCO Clin Cancer Inform. 2018 Dec;2:1-10. doi: 10.1200/CCI.17.00163.

Classifying Lung Cancer Severity with Ensemble Machine Learning in Health Care Claims Data.利用集成机器学习在医疗保健理赔数据中对肺癌严重程度进行分类

Proc Mach Learn Res. 2017 Aug;68:25-38.

Accuracy of administrative databases in detecting primary breast cancer diagnoses: a systematic review.利用行政数据库诊断原发性乳腺癌的准确性：系统综述。

BMJ Open. 2018 Jul 23;8(7):e019264. doi: 10.1136/bmjopen-2017-019264.

Validation of a Case-Finding Algorithm for Identifying Patients with Non-small Cell Lung Cancer (NSCLC) in Administrative Claims Databases.行政索赔数据库中用于识别非小细胞肺癌（NSCLC）患者的病例发现算法的验证

Front Pharmacol. 2017 Nov 30;8:883. doi: 10.3389/fphar.2017.00883. eCollection 2017.

Development of scoring system for risk stratification in clinical medicine: a step-by-step tutorial.临床医学风险分层评分系统的开发：分步教程

Ann Transl Med. 2017 Nov;5(21):436. doi: 10.21037/atm.2017.08.22.

Impact of age and comorbidity on treatment of non-small cell lung cancer recurrence following complete resection: A nationally representative cohort study.年龄和合并症对完全切除术后非小细胞肺癌复发治疗的影响：一项全国代表性队列研究。

Lung Cancer. 2016 Dec;102:108-117. doi: 10.1016/j.lungcan.2016.11.002. Epub 2016 Nov 9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

美国医疗保健理赔数据中用于识别新发非小细胞肺癌患者的编码算法的开发与验证

Development and Validation of Coding Algorithms to Identify Patients with Incident Non-Small Cell Lung Cancer in United States Healthcare Claims Data.

作者信息

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献