• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较校正基于行政数据库诊断代码的分类偏倚的方法。

A comparison of methods to correct for misclassification bias from administrative database diagnostic codes.

机构信息

Departments of Medicine and Epidemiology & Community Medicine, University of Ottawa, ASB1-003 1053, Carling Ave, Ottawa ON, K1Y 4E9, Canada.

出版信息

Int J Epidemiol. 2018 Apr 1;47(2):605-616. doi: 10.1093/ije/dyx253.

DOI:10.1093/ije/dyx253
PMID:29253160
Abstract

BACKGROUND

In administrative database research, misclassification bias can result from diagnostic codes that imperfectly represent the condition being studied. It is unclear how to correct for this bias.

METHODS

Severe renal failure and Colles' fracture status were determined in two distinct cohorts using gold standard methods. True disease prevalence and disease association with other covariables were measured and compared with results when disease status was determined using diagnostic codes. Differences ('misclassification bias') were then adjusted for using two methods: quantitative bias analysis (QBA) with bias parameters (code sensitivity and specificity) of varying accuracy; and disease status imputation using bootstrap methods and disease probability models.

RESULTS

Prevalences of severe renal failure (n = 50 074) and Colles' fracture (n = 5680) were 7.5% and 37.0%, respectively. Compared with true values, important bias resulted when diagnostic codes were used to measure disease prevalence and disease-covariable associations. QBA increased bias when population-based (vs strata-specific) bias parameters were used. QBA's ability to account for misclassification bias was most dependent upon deviations in code specificity. Bootstrap imputation accounted for misclassification bias, but this depended on disease model calibration.

CONCLUSIONS

Extensive bias can result from using inaccurate diagnostic codes to determine disease status. This bias can be addressed with QBA using accurate bias parameter measures, or by bootstrap imputation using well-calibrated disease prediction models.

摘要

背景

在行政数据库研究中,诊断代码可能无法完美地代表所研究的疾病,从而导致分类错误偏倚。目前尚不清楚如何纠正这种偏倚。

方法

使用金标准方法在两个不同队列中确定严重肾衰竭和科雷氏骨折的状态。通过诊断代码确定疾病状态时,测量并比较真实疾病流行率和疾病与其他协变量的相关性。然后使用两种方法对差异(“分类错误偏倚”)进行调整:使用具有不同准确性的偏置参数(代码灵敏度和特异性)的定量偏倚分析(QBA);以及使用 bootstrap 方法和疾病概率模型进行疾病状态推断。

结果

严重肾衰竭(n=50074)和科雷氏骨折(n=5680)的患病率分别为 7.5%和 37.0%。与真实值相比,使用诊断代码来衡量疾病流行率和疾病与协变量的关联会产生重要的偏倚。当使用基于人群的(而非分层特异性)偏置参数时,QBA 会增加偏倚。QBA 纠正分类错误偏倚的能力主要取决于代码特异性的偏差。bootstrap 推断可以纠正分类错误偏倚,但这取决于疾病模型的校准。

结论

使用不准确的诊断代码来确定疾病状态可能会导致严重的偏倚。可以使用准确的偏置参数测量值通过 QBA 来解决这种偏倚,也可以使用经过良好校准的疾病预测模型进行 bootstrap 推断。

相似文献

1
A comparison of methods to correct for misclassification bias from administrative database diagnostic codes.比较校正基于行政数据库诊断代码的分类偏倚的方法。
Int J Epidemiol. 2018 Apr 1;47(2):605-616. doi: 10.1093/ije/dyx253.
2
Improved Correction of Misclassification Bias With Bootstrap Imputation.Bootstrap 插补改进错误分类偏倚的校正。
Med Care. 2018 Jul;56(7):e39-e45. doi: 10.1097/MLR.0000000000000787.
3
Bootstrap imputation with a disease probability model minimized bias from misclassification due to administrative database codes.使用疾病概率模型进行Bootstrap插补可将因行政数据库编码导致的错误分类偏差降至最低。
J Clin Epidemiol. 2017 Apr;84:114-120. doi: 10.1016/j.jclinepi.2017.01.007. Epub 2017 Feb 4.
4
Bootstrap imputation minimized misclassification bias when measuring Colles' fracture prevalence and its associations using health administrative data.使用健康管理数据测量 Colles 骨折患病率及其相关性时,引导插补最小化了分类偏倚。
J Clin Epidemiol. 2018 Apr;96:93-100. doi: 10.1016/j.jclinepi.2017.12.012. Epub 2017 Dec 26.
5
Minimizing misclassification bias with a model to identify acetabular fractures using health administrative data: A cohort study.使用基于健康管理数据的模型最小化髋臼骨折分类偏倚:一项队列研究。
Medicine (Baltimore). 2021 Dec 30;100(52):e28223. doi: 10.1097/MD.0000000000028223.
6
Robustness of prevalence estimates derived from misclassified data from administrative databases.源自行政数据库错误分类数据的患病率估计值的稳健性。
Biometrics. 2007 Mar;63(1):272-9. doi: 10.1111/j.1541-0420.2006.00665.x.
7
Development of the multivariate administrative data cystectomy model and its impact on misclassification bias.多变量行政数据膀胱切除术模型的开发及其对分类偏倚的影响。
BMC Med Res Methodol. 2024 Mar 21;24(1):73. doi: 10.1186/s12874-024-02199-1.
8
Administrative database code accuracy did not vary notably with changes in disease prevalence.管理数据库代码的准确性并未随疾病流行率的变化而显著变化。
J Clin Epidemiol. 2016 Nov;79:86-89. doi: 10.1016/j.jclinepi.2016.05.009. Epub 2016 Jun 3.
9
Bias from outcome misclassification in immunization schedule safety research.免疫接种计划安全性研究中结局错误分类导致的偏倚。
Pharmacoepidemiol Drug Saf. 2018 Feb;27(2):221-228. doi: 10.1002/pds.4374. Epub 2018 Jan 2.
10
Correcting hazard ratio estimates for outcome misclassification using multiple imputation with internal validation data.使用带有内部验证数据的多重填补法校正结局误分类的风险比估计值。
Pharmacoepidemiol Drug Saf. 2017 Aug;26(8):925-934. doi: 10.1002/pds.4223. Epub 2017 May 15.

引用本文的文献

1
A Swiss Cross-Sectional Study on Patients' Perspectives on Chronic Pain Management, Analgesic Treatment, and Genetic Susceptibility.一项关于患者对慢性疼痛管理、镇痛治疗和遗传易感性看法的瑞士横断面研究。
J Pain Res. 2025 Sep 5;18:4575-4587. doi: 10.2147/JPR.S543460. eCollection 2025.
2
The AMerican PREGNANcy Mother-Child CohorT: description and prevalence of baseline outcomes and medication dispensing.美国母婴队列研究:基线结局及药物配给情况的描述与患病率
Front Pharmacol. 2025 Aug 5;16:1608403. doi: 10.3389/fphar.2025.1608403. eCollection 2025.
3
Validation of Codes to Distinguish Between Claudication and Chronic Limb-Threatening Ischemia in Patients Undergoing Peripheral Vascular Intervention Using Medicare-Matched Registry Data.
利用医疗保险匹配登记数据验证区分接受外周血管介入治疗患者间歇性跛行和慢性肢体威胁性缺血的编码
Circ Cardiovasc Qual Outcomes. 2025 Jul;18(7):e011467. doi: 10.1161/CIRCOUTCOMES.124.011467. Epub 2025 May 29.
4
Development of the multivariate administrative data cystectomy model and its impact on misclassification bias.多变量行政数据膀胱切除术模型的开发及其对分类偏倚的影响。
BMC Med Res Methodol. 2024 Mar 21;24(1):73. doi: 10.1186/s12874-024-02199-1.
5
Prediction of Venous Thromboembolism in Diverse Populations Using Machine Learning and Structured Electronic Health Records.利用机器学习和结构化电子健康记录预测不同人群中的静脉血栓栓塞症
Arterioscler Thromb Vasc Biol. 2024 Feb;44(2):491-504. doi: 10.1161/ATVBAHA.123.320331. Epub 2023 Dec 14.
6
Estimating the number of probable new SARS-CoV-2 infections among tested subjects from the number of confirmed cases.从确诊病例数估计检测对象中可能的新 SARS-CoV-2 感染人数。
BMC Med Res Methodol. 2023 Nov 17;23(1):272. doi: 10.1186/s12874-023-02077-2.
7
Identifying encephalopathy in patients admitted to an intensive care unit: Going beyond structured information using natural language processing.识别重症监护病房患者的脑病:利用自然语言处理超越结构化信息。
Front Digit Health. 2023 Jan 23;5:1085602. doi: 10.3389/fdgth.2023.1085602. eCollection 2023.
8
Developing a random forest algorithm to identify patent foramen ovale and atrial septal defects in Ontario administrative databases.开发随机森林算法以识别安大略省行政数据库中的卵圆孔未闭和房间隔缺损。
BMC Med Inform Decis Mak. 2022 Apr 6;22(1):93. doi: 10.1186/s12911-022-01837-2.
9
Common Combinations of Pregestational Diagnosis and Pregnancy Complications.孕前诊断与妊娠并发症的常见组合。
Cureus. 2021 Nov 3;13(11):e19239. doi: 10.7759/cureus.19239. eCollection 2021 Nov.
10
Spinal pain and major depression in a military cohort: bias analysis of dependent misclassification in electronic medical records.军人队列中脊柱疼痛与重度抑郁:电子病历中依赖型错误分类偏倚分析。
Soc Psychiatry Psychiatr Epidemiol. 2022 Mar;57(3):575-581. doi: 10.1007/s00127-021-02160-3. Epub 2021 Aug 10.