Sammon Cormac J, Miller Anne, Mahtani Kamal R, Holt Tim A, McHugh Neil J, Luqmani Raashid A, Nightingale Alison L
Department of Pharmacy and Pharmacology, University of Bath, Claverton Down, Bath, BA2 7AY, United Kingdom.
Pharmacoepidemiol Drug Saf. 2015 May;24(5):504-9. doi: 10.1002/pds.3750. Epub 2015 Mar 10.
To investigate whether information from the literature could be used to identify periods of practice data in an electronic healthcare database during which rheumatoid factor (RF) test results are likely to be missing-not-at-random (MNAR).
RF tests recorded in the Clinical Practice Research Datalink (CPRD) were identified and defined as having a positive, negative or missing result. The proportion of positive test results was then calculated based on (i) complete-case analysis (ii) after restriction to tests from practice years with no missing test results and (iii) following multiple imputation of missing test results. The same three analyses were then carried out after excluding practice years with a proportion of positive tests incompatible with the missing completely at random (MCAR) assumption.
We identified 127,969 RF test records, 30.4% of which did not have an associated test result. Among tests with results available, 19% were positive. Both multiple imputation of the 38,867 missing test results and restriction of the study population to the 491 practice years with complete data had little impact on the percentage of positive tests. Following exclusion of the 544 practice years in which data were likely to be MNAR the percentage of positive tests in all analyses decreased to ~7%.
Recording of RF tests and RF test results in the CPRD is incomplete, with data likely to be MNAR in many practices. Exclusion of practice years with a high proportion of positive tests brought the distribution of positive tests in the study in line with the literature.
研究能否利用文献中的信息来识别电子医疗数据库中类风湿因子(RF)检测结果可能存在非随机缺失(MNAR)的执业数据时间段。
识别并确定临床实践研究数据链(CPRD)中记录的RF检测,将其定义为具有阳性、阴性或缺失结果。然后基于以下情况计算阳性检测结果的比例:(i)完全病例分析;(ii)限制为来自无检测结果缺失的执业年份的检测;(iii)对缺失检测结果进行多次插补之后。在排除阳性检测比例与完全随机缺失(MCAR)假设不相符的执业年份后,再进行相同的三项分析。
我们识别出127,969条RF检测记录,其中30.4%没有相关检测结果。在有可用结果的检测中,19%为阳性。对38,867条缺失检测结果进行多次插补以及将研究人群限制为有完整数据的491个执业年份,对阳性检测的百分比影响不大。在排除544个数据可能为MNAR的执业年份后,所有分析中阳性检测的百分比降至约7%。
CPRD中RF检测及RF检测结果的记录不完整,许多执业点的数据可能为MNAR。排除阳性检测比例高的执业年份后,研究中阳性检测的分布与文献相符。