Suppr超能文献

评估行政索赔中识别软组织肉瘤(STS)算法的准确性。

Evaluation of the accuracy of algorithms to identify soft tissue sarcoma (STS) in administrative claims.

作者信息

Princic Nicole, McMorrow Donna, Chan Philip, Hess Lisa

机构信息

IBM Watson Health, 61 Summer Avenue, Reading, Cambridge, MA 01867 USA.

2Eli Lilly and Company, Indianapolis, IN USA.

出版信息

Clin Sarcoma Res. 2020 May 5;10:8. doi: 10.1186/s13569-020-00130-y. eCollection 2020.

Abstract

BACKGROUND

Lack of using a validated algorithm to select patients is a source of selection bias in oncology studies using administrative claims. The objective of this study to evaluate published algorithms to identify patients with soft tissue sarcoma (STS) in administrative claims and to evaluate new algorithms to improved performance.

METHODS

Two cancer populations including STS cases and non-STS controls were selected from the MarketScan Explorys Linked Claims-Electronic Medical Record (EMR) Database between January 1, 2000 and July 31, 2018. Eligible cases had a diagnosis on a clinical record for STS in the EMR while controls had no evidence of STS on any EMR records. Both cases and controls were enrolled in administrative claims during a period of observation and were aged ≥ 18 years. A split sample was used to test and validate algorithms using data from administrative claims. Values for sensitivity, specificity, and positive predictive value (PPV) were calculated for 14 algorithms. Prior literature validating algorithms in administrative claims across other cancer types report both sensitivity and specificity ranging from as low as 73% to as high as 95%. This was used as a benchmark for defining algorithm success.

RESULTS

There were 784 STS cases and 249,062 non-STS cancer controls eligible for analysis. Requiring at least two claims with an ICD-CM diagnosis code for STS achieved a sensitivity of 67% but had a specificity of 72%. Algorithms that required NCCN-recommended systemic treatment for STS improved the specificity to over 90% but dropped the sensitivity to below 20%. Other combinations of diagnostic tests, symptoms, and procedures did not improve performance.

CONCLUSIONS

The algorithms tested in this study sample did not achieve sufficient performance and suggest the ability to accurately identify the STS population in administrative data is problematic. Difficulties are likely due to the origin of STS in a variety of locations, the non-specific symptoms of STS, and the common diagnostic tests recommended to diagnose the disease. Future research applying machine learning to examine timing and patterns of variables that comprise the diagnostic process may further investigate the ability to accurately identify STS cases in claims databases.

摘要

背景

在使用行政索赔数据的肿瘤学研究中,缺乏使用经过验证的算法来选择患者是选择偏倚的一个来源。本研究的目的是评估已发表的算法,以在行政索赔数据中识别软组织肉瘤(STS)患者,并评估新算法以提高其性能。

方法

从2000年1月1日至2018年7月31日的MarketScan Explorys关联索赔-电子病历(EMR)数据库中选取了两个癌症群体,包括STS病例和非STS对照。符合条件的病例在EMR中有STS的临床记录诊断,而对照在任何EMR记录中均无STS证据。病例和对照均在观察期内纳入行政索赔数据,年龄≥18岁。使用分割样本,利用行政索赔数据测试和验证算法。计算了14种算法的敏感性、特异性和阳性预测值(PPV)。先前在其他癌症类型中验证行政索赔算法的文献报道,敏感性和特异性范围低至73%高至95%。这被用作定义算法成功的基准。

结果

有784例STS病例和249,062例非STS癌症对照符合分析条件。要求至少有两条带有STS的ICD-CM诊断代码的索赔记录,敏感性为67%,但特异性为72%。要求对STS进行NCCN推荐的全身治疗的算法将特异性提高到90%以上,但敏感性降至20%以下。其他诊断测试、症状和程序的组合并未提高性能。

结论

本研究样本中测试的算法未达到足够的性能,表明在行政数据中准确识别STS人群存在问题。困难可能是由于STS起源于多种部位、STS的非特异性症状以及推荐用于诊断该疾病的常见诊断测试。未来应用机器学习来检查构成诊断过程的变量的时间和模式的研究,可能会进一步探究在索赔数据库中准确识别STS病例的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/278d/7199315/bfc6b3ac8463/13569_2020_130_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验