Suppr超能文献

一种用于文件链接以分析临终医疗费用的贝叶斯程序。

A Bayesian Procedure for File Linking to Analyze End-of-Life Medical Costs.

作者信息

Gutman Roee, Afendulis Christopher C, Zaslavsky Alan M

机构信息

Department of Biostatistics, Brown University, Providence, RI 02912.

出版信息

J Am Stat Assoc. 2013 Jan 1;108(501):34-47. doi: 10.1080/01621459.2012.726889.

Abstract

End-of-life medical expenses are a significant proportion of all health care expenditures. These costs were studied using costs of services from Medicare claims and cause of death (CoD) from death certificates. In the absence of a unique identifier linking the two datasets, common variables identified unique matches for only 33% of deaths. The remaining cases formed cells with multiple cases (32% in cells with an equal number of cases from each file and 35% in cells with an unequal number). We sampled from the joint posterior distribution of model parameters and the permutations that link cases from the two files within each cell. The linking models included the regression of location of death on CoD and other parameters, and the regression of cost measures with a monotone missing data pattern on CoD and other demographic characteristics. Permutations were sampled by enumerating the exact distribution for small cells and by the Metropolis algorithm for large cells. Sparse matrix data structures enabled efficient calculations despite the large dataset (≈1.7 million cases). The procedure generates datasets in which the matches between the two files are imputed. The datasets can be analyzed independently and results combined using Rubin's multiple imputation rules. Our approach can be applied in other file linking applications.

摘要

临终医疗费用在所有医疗保健支出中占很大比例。这些费用通过医疗保险理赔的服务成本和死亡证明上的死因(CoD)进行研究。由于缺乏将这两个数据集联系起来的唯一标识符,共同变量仅为33%的死亡病例找到了唯一匹配项。其余病例形成了包含多个病例的单元格(每个文件病例数相等的单元格中占32%,病例数不相等的单元格中占35%)。我们从模型参数的联合后验分布以及每个单元格中链接两个文件病例的排列中进行抽样。链接模型包括死亡地点对死因及其他参数的回归,以及具有单调缺失数据模式的成本度量对死因及其他人口统计学特征的回归。通过枚举小单元格的精确分布和对大单元格使用Metropolis算法对排列进行抽样。尽管数据集很大(约170万个病例),稀疏矩阵数据结构仍能实现高效计算。该程序生成两个文件之间匹配项被插补的数据集。这些数据集可以独立分析,并使用鲁宾多重插补规则合并结果。我们的方法可应用于其他文件链接应用程序。

相似文献

4
Bayesian record linkage with variables in one file.含单个文件中变量的贝叶斯记录链接
Stat Med. 2023 Nov 30;42(27):4931-4951. doi: 10.1002/sim.9894. Epub 2023 Aug 31.

引用本文的文献

7
(Almost) all of entity resolution.(几乎)所有的实体解析。
Sci Adv. 2022 Mar 25;8(12):eabi8021. doi: 10.1126/sciadv.abi8021.

本文引用的文献

1
A Review of Hot Deck Imputation for Survey Non-response.调查无应答的热卡填充法综述
Int Stat Rev. 2010 Apr;78(1):40-64. doi: 10.1111/j.1751-5823.2010.00103.x.
2
Estimating log models: to transform or not to transform?估计对数模型:是否进行变换?
J Health Econ. 2001 Jul;20(4):461-94. doi: 10.1016/s0167-6296(01)00086-8.
4
Health care expenditure in the last months of life.生命最后几个月的医疗保健支出。
J Health Econ. 2000 Sep;19(5):679-95. doi: 10.1016/s0167-6296(00)00039-4.
5
Trends in Medicare payments in the last year of life.临终前一年医疗保险支付情况的趋势。
N Engl J Med. 1993 Apr 15;328(15):1092-6. doi: 10.1056/NEJM199304153281506.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验