• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从流行病学研究的关联行政数据中产生的偏倚:从注册到分析的概念框架。

Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses.

机构信息

MRC/CSO Social and Public Health Sciences Unit, University of Glasgow, Berkeley Square, 99 Berkeley Street, Glasgow, G3 7HR, UK.

UCL Great Ormond Street Institute of Child Health, UCL, London, UK.

出版信息

Eur J Epidemiol. 2022 Dec;37(12):1215-1224. doi: 10.1007/s10654-022-00934-w. Epub 2022 Nov 5.

DOI:10.1007/s10654-022-00934-w
PMID:36333542
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9792414/
Abstract

Linked administrative data offer a rich source of information that can be harnessed to describe patterns of disease, understand their causes and evaluate interventions. However, administrative data are primarily collected for operational reasons such as recording vital events for legal purposes, and planning, provision and monitoring of services. The processes involved in generating and linking administrative datasets may generate sources of bias that are often not adequately considered by researchers. We provide a framework describing these biases, drawing on our experiences of using the 100 Million Brazilian Cohort (100MCohort) which contains records of more than 131 million people whose families applied for social assistance between 2001 and 2018. Datasets for epidemiological research were derived by linking the 100MCohort to health-related databases such as the Mortality Information System and the Hospital Information System. Using the framework, we demonstrate how selection and misclassification biases may be introduced in three different stages: registering and recording of people's life events and use of services, linkage across administrative databases, and cleaning and coding of variables from derived datasets. Finally, we suggest eight recommendations which may reduce biases when analysing data from administrative sources.

摘要

关联行政数据提供了丰富的信息来源,可以用来描述疾病模式,了解其病因,并评估干预措施。然而,行政数据主要是出于操作目的而收集的,例如为法律目的记录重要事件,以及规划、提供和监测服务。在生成和关联行政数据集的过程中,可能会产生研究人员通常没有充分考虑的偏差来源。我们提供了一个描述这些偏差的框架,借鉴了我们使用 1 亿巴西队列(100MCohort)的经验,该队列包含了超过 1.31 亿人的记录,他们的家庭在 2001 年至 2018 年期间申请了社会援助。用于流行病学研究的数据集是通过将 100MCohort 与健康相关的数据库(如死亡率信息系统和医院信息系统)链接而得出的。使用该框架,我们展示了选择和分类错误偏差可能在三个不同阶段引入:记录和记录人们的生命事件和服务的使用,跨行政数据库的链接,以及从派生数据集中清理和编码变量。最后,我们提出了八项建议,这些建议可能会减少分析行政来源数据时的偏差。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e268/9792414/18ffa85731d0/10654_2022_934_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e268/9792414/03186d6b541e/10654_2022_934_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e268/9792414/18ffa85731d0/10654_2022_934_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e268/9792414/03186d6b541e/10654_2022_934_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e268/9792414/18ffa85731d0/10654_2022_934_Fig2_HTML.jpg

相似文献

1
Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses.从流行病学研究的关联行政数据中产生的偏倚:从注册到分析的概念框架。
Eur J Epidemiol. 2022 Dec;37(12):1215-1224. doi: 10.1007/s10654-022-00934-w. Epub 2022 Nov 5.
2
Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort.利用全国性巴西行政数据库检查记录链接过程的质量,以建立一个大型出生队列。
BMC Med Inform Decis Mak. 2020 Jul 25;20(1):173. doi: 10.1186/s12911-020-01192-0.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Describing the linkage between administrative social assistance and health care databases in Ontario, Canada.描述加拿大安大略省行政社会援助数据库与医疗保健数据库之间的联系。
Int J Popul Data Sci. 2022 Mar 3;7(1):1689. doi: 10.23889/ijpds.v6i1.1689. eCollection 2022.
5
Evaluation of record linkage of two large administrative databases in a middle income country: stillbirths and notifications of dengue during pregnancy in Brazil.中等收入国家两个大型行政数据库的记录关联评估:巴西的死产与孕期登革热通报情况
BMC Med Inform Decis Mak. 2017 Jul 17;17(1):108. doi: 10.1186/s12911-017-0506-5.
6
GUILD: GUidance for Information about Linking Data sets.GUILD:数据集链接信息指南。
J Public Health (Oxf). 2018 Mar 1;40(1):191-198. doi: 10.1093/pubmed/fdx037.
7
Utilising identifier error variation in linkage of large administrative data sources.利用大型行政数据源链接中的标识符错误变异。
BMC Med Res Methodol. 2017 Feb 7;17(1):23. doi: 10.1186/s12874-017-0306-8.
8
Linking education and hospital data in England: linkage process and quality.链接英格兰的教育和医院数据:链接过程和质量。
Int J Popul Data Sci. 2021 Sep 16;6(1):1671. doi: 10.23889/ijpds.v6i1.1671. eCollection 2021.
9
Challenges in administrative data linkage for research.研究中行政数据链接的挑战。
Big Data Soc. 2017 Dec 5;4(2):2053951717745678. doi: 10.1177/2053951717745678.
10
Validating linkage of multiple population-based administrative databases in Brazil.验证巴西多个基于人群的行政数据库的关联性。
PLoS One. 2019 Mar 28;14(3):e0214050. doi: 10.1371/journal.pone.0214050. eCollection 2019.

引用本文的文献

1
Validity of administrative health data case definitions for identifying polycystic ovary syndrome: a systematic review and meta-analysis.用于识别多囊卵巢综合征的行政健康数据病例定义的有效性:一项系统评价和荟萃分析。
Hum Reprod. 2025 Aug 1;40(8):1579-1586. doi: 10.1093/humrep/deaf094.
2
Occupational differences in COVID-19 hospital admission and mortality risks between women and men in Scotland: a population-based study using linked administrative data.苏格兰男女在新冠病毒肺炎住院及死亡风险方面的职业差异:一项基于人群的使用关联行政数据的研究
Occup Environ Med. 2025 May 18;82(3):128-137. doi: 10.1136/oemed-2024-109562.
3

本文引用的文献

1
Data linkage in medical research.医学研究中的数据关联
BMJ Med. 2022 Mar 2;1(1):e000087. doi: 10.1136/bmjmed-2021-000087. eCollection 2022.
2
Strategies to record and use ethnicity information in routine health data.在常规健康数据中记录和使用种族信息的策略。
Nat Med. 2022 Jul;28(7):1338-1342. doi: 10.1038/s41591-022-01842-y.
3
Demographic Variation in Health Insurance Coverage:United States, 2020.2020年美国医疗保险覆盖情况的人口统计学差异
Associations between different measures of SARS-CoV-2 infection status and subsequent economic inactivity: A pooled analysis of five longitudinal surveys linked to healthcare records.
严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染状况的不同衡量指标与随后的经济不活动之间的关联:一项与医疗记录相关的五项纵向调查的汇总分析。
PLoS One. 2025 Apr 9;20(4):e0321201. doi: 10.1371/journal.pone.0321201. eCollection 2025.
4
A maternal and child health administrative cohort in Scotland: the utility of linked administrative data for understanding early years' outcomes and inequalities.苏格兰的一个母婴健康管理队列:关联管理数据在理解早期结果和不平等方面的效用。
Int J Popul Data Sci. 2024 Nov 27;9(2):2402. doi: 10.23889/ijpds.v9i2.2402. eCollection 2024.
5
Impact of Primary Health Care Data Quality on Infectious Disease Surveillance in Brazil: Case Study.巴西初级卫生保健数据质量对传染病监测的影响:案例研究
JMIR Public Health Surveill. 2025 Feb 21;11:e67050. doi: 10.2196/67050.
6
Examining the Potential Mediating Role of Maternal Mental Health in the Association Between Socioeconomic Deprivation and Child Development Outcomes.探讨母亲心理健康在社会经济剥夺与儿童发展结果之间关联中的潜在中介作用。
Matern Child Health J. 2025 Mar;29(3):338-348. doi: 10.1007/s10995-025-04050-5. Epub 2025 Feb 7.
7
Epidemiological methods in transition: Minimizing biases in classical and digital approaches.转型中的流行病学方法:最大限度减少传统方法和数字方法中的偏差。
PLOS Digit Health. 2025 Jan 13;4(1):e0000670. doi: 10.1371/journal.pdig.0000670. eCollection 2025 Jan.
8
Leveraging Administrative Health Databases to Address Health Challenges in Farming Populations: Scoping Review and Bibliometric Analysis (1975-2024).利用行政健康数据库应对农业人口的健康挑战:范围综述与文献计量分析(1975 - 2024年)
JMIR Public Health Surveill. 2025 Jan 9;11:e62939. doi: 10.2196/62939.
9
Child welfare worker perspectives on documentation and case recording practices in Canada: A mixed-methods study protocol.加拿大儿童福利工作者对文件记录和案例记录实践的看法:一项混合方法研究方案。
PLoS One. 2025 Jan 7;20(1):e0316238. doi: 10.1371/journal.pone.0316238. eCollection 2025.
10
Tackling algorithmic bias and promoting transparency in health datasets: the STANDING Together consensus recommendations.应对健康数据集中的算法偏差并提高透明度:“携手同行”共识建议
Lancet Digit Health. 2025 Jan;7(1):e64-e88. doi: 10.1016/S2589-7500(24)00224-3. Epub 2024 Dec 18.
Natl Health Stat Report. 2022 Feb(169):1-15.
4
Cohort Profile: The 100 Million Brazilian Cohort.队列简介:巴西一亿人队列。
Int J Epidemiol. 2022 May 9;51(2):e27-e38. doi: 10.1093/ije/dyab213.
5
Conditional cash transfer program and child mortality: A cross-sectional analysis nested within the 100 Million Brazilian Cohort.有条件现金转移计划与儿童死亡率:1 亿巴西队列研究中的嵌套横断面分析。
PLoS Med. 2021 Sep 28;18(9):e1003509. doi: 10.1371/journal.pmed.1003509. eCollection 2021 Sep.
6
The Centre for Data and Knowledge Integration for Health (CIDACS): Linking Health and Social Data in Brazil.巴西健康数据与知识整合中心(CIDACS):连接巴西的健康与社会数据
Int J Popul Data Sci. 2019 Nov 20;4(2):1140. doi: 10.23889/ijpds.v4i2.1140.
7
Evaluation measure for group-based record linkage.基于群组的记录链接的评估指标。
Int J Popul Data Sci. 2019 Nov 29;4(1):1127. doi: 10.23889/ijpds.v4i1.1127.
8
Cohort Profile: Early Pandemic Evaluation and Enhanced Surveillance of COVID-19 (EAVE II) Database.队列简介:新冠疫情早期评估与强化监测(EAVE II)数据库
Int J Epidemiol. 2021 Aug 30;50(4):1064-1074. doi: 10.1093/ije/dyab028.
9
Ethnic bias in data linkage.数据关联中的种族偏见。
Lancet Digit Health. 2021 Jun;3(6):e339. doi: 10.1016/S2589-7500(21)00081-9.
10
The challenges and opportunities of mental health data sharing in the UK.英国心理健康数据共享的挑战与机遇。
Lancet Digit Health. 2021 Jun;3(6):e333-e336. doi: 10.1016/S2589-7500(21)00078-9.