Hammill Bradley G, Hernandez Adrian F, Peterson Eric D, Fonarow Gregg C, Schulman Kevin A, Curtis Lesley H
Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC 27715, USA.
Am Heart J. 2009 Jun;157(6):995-1000. doi: 10.1016/j.ahj.2009.04.002.
Inpatient clinical registries generally have limited ability to provide a longitudinal perspective on care beyond the acute episode. We present a method to link hospitalization records from registries with Medicare inpatient claims data, without using direct identifiers, to create a unique data source that pairs rich clinical data with long-term outcome data.
The method takes advantage of the hospital clustering observed in each database by demonstrating that different combinations of indirect identifiers within hospitals yield a large proportion of unique patient records. This high level of uniqueness also allows linking without advance knowledge of the Medicare provider number of each registry hospital. We applied this method to 2 inpatient databases and were able to identify 81% of 39,178 records in a large clinical registry of patients with heart failure and 91% of 6,581 heart failure records from a hospital inpatient database. The quality of the link is high, and reasons for incomplete linkage are explored. Finally, we discuss the unique opportunities afforded by combining claims and clinical data for specific analyses.
In the absence of direct identifiers, it is possible to create a high-quality link between inpatient clinical registry data and Medicare claims data. The method will allow researchers to use existing data to create a linked claims-clinical database that capitalizes on the strengths of both types of data sources.
住院临床登记系统通常在提供急性发作期以外护理的纵向视角方面能力有限。我们提出一种方法,在不使用直接标识符的情况下,将登记系统中的住院记录与医疗保险住院理赔数据相链接,以创建一个独特的数据源,将丰富的临床数据与长期结局数据配对。
该方法利用在每个数据库中观察到的医院聚类情况,证明医院内间接标识符的不同组合可产生很大比例的唯一患者记录。这种高度的唯一性还允许在不预先了解每个登记医院的医疗保险提供者编号的情况下进行链接。我们将此方法应用于两个住院数据库,在一个大型心力衰竭患者临床登记系统中,能够识别出39178条记录中的81%,在一个医院住院数据库中,能够识别出6581条心力衰竭记录中的91%。链接质量很高,并对链接不完整的原因进行了探讨。最后,我们讨论了将理赔数据和临床数据相结合进行特定分析所带来的独特机遇。
在没有直接标识符的情况下,有可能在住院临床登记数据和医疗保险理赔数据之间创建高质量的链接。该方法将使研究人员能够利用现有数据创建一个链接的理赔 - 临床数据库,充分利用这两种数据源的优势。