• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用自我报告数据补充索赔数据分析,以开发当前吸烟状况的概率表型模型。

Supplementing claims data analysis using self-reported data to develop a probabilistic phenotype model for current smoking status.

机构信息

Janssen Research and Development, Titusville, NJ, USA.

Erasmus MC, Rotterdam, the Netherlands.

出版信息

J Biomed Inform. 2019 Sep;97:103264. doi: 10.1016/j.jbi.2019.103264. Epub 2019 Aug 3.

DOI:10.1016/j.jbi.2019.103264
PMID:31386904
Abstract

OBJECTIVES

Smoking status is poorly record in US claims data. IBM MarketScan Commercial is a claims database that can be linked to an additional health risk assessment with self-reported smoking status for a subset of 1,966,174 patients. We investigate whether this subset could be used to learn a smoking status phenotype model generalizable to all US claims data that calculates the probability of being a current smoker.

METHODS

251,643 (12.8%) had self-reported their smoking status as 'current smoker'. A regularized logistic regression model, the Current Risk of Smoking Status (CROSS), was trained using the subset of patients with self-reported smoking status. CROSS considered 53,027 candidate covariates including demographics and conditions/drugs/measurements/procedures/observations recorded in the prior 365 days, The CROSS phenotype model was validated across multiple other claims data.

RESULTS

The internal validation showed the CROSS model achieved an area under the receiver operating characteristic curve (AUC) of 0.76 and the calibration plots indicated it was well calibrated. The external validation across three US claims databases obtained AUCs ranging between 0.82 and 0.87 showing the model appears to be transportable across Claims data.

CONCLUSION

CROSS predicts current smoking status based on the claims records in the prior year. CROSS can be readily implemented to any US insurance claims mapped to the OMOP common data model and will be a useful way to impute smoking status when conducting epidemiology studies where smoking is a known confounder but smoking status is not recorded. CROSS is available from https://github.com/OHDSI/StudyProtocolSandbox/tree/master/SmokingModel.

摘要

目的

美国索赔数据中吸烟状况记录不佳。IBM MarketScan 商业是一个索赔数据库,它可以与一小部分(1966174 名患者中的)自我报告的吸烟状况的额外健康风险评估相链接。我们调查了该子集是否可以用于学习吸烟状况表型模型,该模型可推广到所有美国索赔数据,计算成为当前吸烟者的概率。

方法

251643 人(12.8%)自我报告为“当前吸烟者”。使用具有自我报告吸烟状况的患者子集,使用正则化逻辑回归模型(CROSS)训练了 CROSS。CROSS 考虑了 53027 个候选协变量,包括在过去 365 天内记录的人口统计学数据和疾病/药物/测量/程序/观察结果。CROSS 表型模型在多个其他索赔数据中进行了验证。

结果

内部验证表明,CROSS 模型的接受者操作特征曲线(AUC)为 0.76,校准图表明其校准良好。在三个美国索赔数据库的外部验证中,获得的 AUC 范围在 0.82 到 0.87 之间,表明该模型似乎可以在索赔数据中进行传输。

结论

CROSS 根据前一年的索赔记录预测当前吸烟状况。CROSS 可以轻松地实现到任何映射到 OMOP 通用数据模型的美国保险索赔,并且在进行流行病学研究时,这将是一种有用的方法,因为吸烟是已知的混杂因素,但未记录吸烟状况。CROSS 可从 https://github.com/OHDSI/StudyProtocolSandbox/tree/master/SmokingModel 获得。

相似文献

1
Supplementing claims data analysis using self-reported data to develop a probabilistic phenotype model for current smoking status.利用自我报告数据补充索赔数据分析,以开发当前吸烟状况的概率表型模型。
J Biomed Inform. 2019 Sep;97:103264. doi: 10.1016/j.jbi.2019.103264. Epub 2019 Aug 3.
2
Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data.识别死亡患者:基于人群水平理赔数据建立预测患者死亡状态的个体水平预测模型的开发和验证。
Drug Saf. 2019 Nov;42(11):1377-1386. doi: 10.1007/s40264-019-00827-0.
3
Inferring disease severity in rheumatoid arthritis using predictive modeling in administrative claims databases.利用行政索赔数据库中的预测模型推断类风湿关节炎的疾病严重程度。
PLoS One. 2019 Dec 18;14(12):e0226255. doi: 10.1371/journal.pone.0226255. eCollection 2019.
4
Using Machine Learning Applied to Real-World Healthcare Data for Predictive Analytics: An Applied Example in Bariatric Surgery.运用机器学习分析真实世界医疗数据进行预测分析:减重手术中的应用实例。
Value Health. 2019 May;22(5):580-586. doi: 10.1016/j.jval.2019.01.011.
5
Identification of smoking using Medicare data--a validation study of claims-based algorithms.利用医疗保险数据识别吸烟情况——基于索赔算法的验证研究
Pharmacoepidemiol Drug Saf. 2016 Apr;25(4):472-5. doi: 10.1002/pds.3953. Epub 2016 Jan 13.
6
Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study.电子健康记录表型分析改善了美国普通人群中2型糖尿病的检测和筛查:一项横断面、非选择性、回顾性研究。
J Biomed Inform. 2016 Apr;60:162-8. doi: 10.1016/j.jbi.2015.12.006. Epub 2015 Dec 17.
7
Predicting Disability among Community-Dwelling Medicare Beneficiaries Using Claims-Based Indicators.使用基于理赔数据的指标预测社区医保受益人的残疾情况
Health Serv Res. 2016 Feb;51(1):262-81. doi: 10.1111/1475-6773.12316. Epub 2015 May 25.
8
The impact of standardizing the definition of visits on the consistency of multi-database observational health research.规范就诊定义对多数据库观察性健康研究一致性的影响。
BMC Med Res Methodol. 2015 Mar 8;15:13. doi: 10.1186/s12874-015-0001-6.
9
Evaluating the impact of unmeasured confounding with internal validation data: an example cost evaluation in type 2 diabetes.利用内部验证数据评估未测量混杂的影响:2 型糖尿病成本评估实例。
Value Health. 2013 Mar-Apr;16(2):259-66. doi: 10.1016/j.jval.2012.10.012. Epub 2013 Jan 23.
10
Validation of body mass index (BMI)-related ICD-9-CM and ICD-10-CM administrative diagnosis codes recorded in US claims data.验证美国索赔数据中与体重指数(BMI)相关的 ICD-9-CM 和 ICD-10-CM 行政诊断代码。
Pharmacoepidemiol Drug Saf. 2018 Oct;27(10):1092-1100. doi: 10.1002/pds.4617. Epub 2018 Jul 12.

引用本文的文献

1
The validity of electronic health data for measuring smoking status: a systematic review and meta-analysis.电子健康数据测量吸烟状况的有效性:系统评价和荟萃分析。
BMC Med Inform Decis Mak. 2024 Feb 2;24(1):33. doi: 10.1186/s12911-024-02416-3.
2
Deep phenotyping: Embracing complexity and temporality-Towards scalability, portability, and interoperability.深度表型分析:拥抱复杂性和时间性——迈向可扩展性、便携性和互操作性。
J Biomed Inform. 2020 May;105:103433. doi: 10.1016/j.jbi.2020.103433. Epub 2020 Apr 23.