• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

英国初级医疗电子健康记录中的健康指标记录:处理缺失数据的关键影响

Health indicator recording in UK primary care electronic health records: key implications for handling missing data.

作者信息

Petersen Irene, Welch Catherine A, Nazareth Irwin, Walters Kate, Marston Louise, Morris Richard W, Carpenter James R, Morris Tim P, Pham Tra My

机构信息

Department of Primary Care and Population Health, University College London, London NW3 2PF, UK,

Department of Clinical Epidemiology, Aarhus University, 8200 Aarhus N, Denmark,

出版信息

Clin Epidemiol. 2019 Feb 11;11:157-167. doi: 10.2147/CLEP.S191437. eCollection 2019.

DOI:10.2147/CLEP.S191437
PMID:30809103
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6377050/
Abstract

BACKGROUND

Clinical databases are increasingly used for health research; many of them capture information on common health indicators including height, weight, blood pressure, cholesterol level, smoking status, and alcohol consumption. However, these are often not recorded on a regular basis; missing data are ubiquitous. We described the recording of health indicators in UK primary care and evaluated key implications for handling missing data.

METHODS

We examined the recording of health indicators in The Health Improvement Network (THIN) UK primary care database over time, by demographic variables (age and sex) and chronic diseases (diabetes, myocardial infarction, and stroke). Using weight as an example, we fitted linear and logistic regression models to examine the associations of weight measurements and the probability of having weight recorded with individuals' demographic characteristics and chronic diseases.

RESULTS

In total, 6,345,851 individuals aged 18-99 years contributed data to THIN between 2000 and 2015. Women aged 18-65 years were more likely than men of the same age to have health indicators recorded; this gap narrowed after age 65. About 60-80% of individuals had their height, weight, blood pressure, smoking status, and alcohol consumption recorded during the first year of registration. In the years following registration, these proportions fell to 10%-40%. Individuals with chronic diseases were more likely to have health indicators recorded, particularly after the introduction of a General Practitioner incentive scheme. Individuals' demographic characteristics and chronic diseases were associated with both observed weight measurements and missingness in weight.

CONCLUSION

Missing data in common health indicators will affect statistical analysis in health research studies. A single analysis of primary care data using the available information alone may be misleading. Multiple imputation of missing values accounting for demographic characteristics and disease status is recommended but should be considered and implemented carefully. Sensitivity analysis exploring alternative assumptions for missing data should also be evaluated.

摘要

背景

临床数据库越来越多地用于健康研究;其中许多数据库收集了包括身高、体重、血压、胆固醇水平、吸烟状况和饮酒情况等常见健康指标的信息。然而,这些指标往往没有定期记录;缺失数据普遍存在。我们描述了英国初级保健中健康指标的记录情况,并评估了处理缺失数据的关键影响因素。

方法

我们研究了英国初级保健数据库“健康改善网络”(THIN)中健康指标随时间的记录情况,按人口统计学变量(年龄和性别)以及慢性病(糖尿病、心肌梗死和中风)进行分析。以体重为例,我们拟合了线性和逻辑回归模型,以研究体重测量值以及记录体重的概率与个体人口统计学特征和慢性病之间的关联。

结果

2000年至2015年期间,共有6345851名18 - 99岁的个体向THIN贡献了数据。18 - 65岁的女性比同龄男性更有可能记录健康指标;65岁以后这种差距缩小。约60% - 80%的个体在注册的第一年记录了身高、体重、血压、吸烟状况和饮酒情况。在注册后的几年里,这些比例降至10% - 40%。患有慢性病的个体更有可能记录健康指标,特别是在引入全科医生激励计划之后。个体的人口统计学特征和慢性病与观察到的体重测量值以及体重数据缺失均有关联。

结论

常见健康指标中的缺失数据将影响健康研究中的统计分析。仅使用可用信息对初级保健数据进行单一分析可能会产生误导。建议对缺失值进行多重插补,同时考虑人口统计学特征和疾病状况,但应谨慎考虑并实施。还应评估探索缺失数据替代假设的敏感性分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd1/6377050/3ec0f86aa4f2/clep-11-157Fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd1/6377050/712bd0a1f28c/clep-11-157Fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd1/6377050/a1437245993f/clep-11-157Fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd1/6377050/90f7f1a30efc/clep-11-157Fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd1/6377050/3ec0f86aa4f2/clep-11-157Fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd1/6377050/712bd0a1f28c/clep-11-157Fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd1/6377050/a1437245993f/clep-11-157Fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd1/6377050/90f7f1a30efc/clep-11-157Fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd1/6377050/3ec0f86aa4f2/clep-11-157Fig4.jpg

相似文献

1
Health indicator recording in UK primary care electronic health records: key implications for handling missing data.英国初级医疗电子健康记录中的健康指标记录:处理缺失数据的关键影响
Clin Epidemiol. 2019 Feb 11;11:157-167. doi: 10.2147/CLEP.S191437. eCollection 2019.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Issues in multiple imputation of missing data for large general practice clinical databases.大型全科临床数据库缺失数据多重插补中的问题。
Pharmacoepidemiol Drug Saf. 2010 Jun;19(6):618-26. doi: 10.1002/pds.1934.
4
Imputation and Missing Indicators for Handling Missing Longitudinal Data: Data Simulation Analysis Based on Electronic Health Record Data.处理纵向缺失数据的插补与缺失指示符:基于电子健康记录数据的模拟分析
JMIR Med Inform. 2025 Mar 13;13:e64354. doi: 10.2196/64354.
5
The impact of the Quality and Outcomes Framework (QOF) on the recording of smoking targets in primary care medical records: cross-sectional analyses from The Health Improvement Network (THIN) database.质量和结果框架(QOF)对初级保健医疗记录中吸烟目标记录的影响:来自健康改善网络(THIN)数据库的横断面分析。
BMC Public Health. 2012 Jul 10;12:329. doi: 10.1186/1471-2458-12-329.
6
Methods to improve the quality of smoking records in a primary care EMR database: exploring multiple imputation and pattern-matching algorithms.改进初级保健 EMR 数据库中吸烟记录质量的方法:探索多种插补和模式匹配算法。
BMC Med Inform Decis Mak. 2020 Mar 14;20(1):56. doi: 10.1186/s12911-020-1068-5.
7
Imputation and missing indicators for handling missing data in the development and deployment of clinical prediction models: A simulation study.处理临床预测模型开发和部署中缺失数据的插补和缺失指标:一项模拟研究。
Stat Methods Med Res. 2023 Aug;32(8):1461-1477. doi: 10.1177/09622802231165001. Epub 2023 Apr 27.
8
Smoker, ex-smoker or non-smoker? The validity of routinely recorded smoking status in UK primary care: a cross-sectional study.吸烟者、曾经吸烟者还是非吸烟者?英国初级医疗中常规记录的吸烟状况的有效性:一项横断面研究。
BMJ Open. 2014 Apr 23;4(4):e004958. doi: 10.1136/bmjopen-2014-004958.
9
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
10
Handling of missing data with multiple imputation in observational studies that address causal questions: protocol for a scoping review.针对因果问题的观察性研究中缺失数据的多重插补处理:范围综述的方案。
BMJ Open. 2023 Feb 1;13(2):e065576. doi: 10.1136/bmjopen-2022-065576.

引用本文的文献

1
NHS national data opt-outs: trends and potential consequences for health data research.英国国家医疗服务体系(NHS)的全国数据退出选项:健康数据研究的趋势及潜在后果
BJGP Open. 2024 Oct 29;8(3). doi: 10.3399/BJGPO.2024.0020. Print 2024 Oct.
2
Ethnicity data resource in population-wide health records: completeness, coverage and granularity of diversity.人群健康记录中的种族数据资源:多样性的完整性、涵盖范围和粒度。
Sci Data. 2024 Feb 22;11(1):221. doi: 10.1038/s41597-024-02958-1.
3
UK research data resources based on primary care electronic health records: review and summary for potential users.

本文引用的文献

1
Population-calibrated multiple imputation for a binary/categorical covariate in categorical regression models.对分类回归模型中二项式/分类协变量进行人群校准的多重插补。
Stat Med. 2019 Feb 28;38(5):792-808. doi: 10.1002/sim.8004. Epub 2018 Oct 16.
2
Missing data and multiple imputation in clinical epidemiological research.临床流行病学研究中的缺失数据与多重填补
Clin Epidemiol. 2017 Mar 15;9:157-166. doi: 10.2147/CLEP.S129785. eCollection 2017.
3
Time trends in the prescription of statins for the primary prevention of cardiovascular disease in the United Kingdom: a cohort study using The Health Improvement Network primary care data.
基于初级保健电子健康记录的英国研究数据资源:面向潜在用户的综述与总结
BJGP Open. 2023 Sep 19;7(3). doi: 10.3399/BJGPO.2023.0057. Print 2023 Sep.
4
Association between childhood maltreatment and atopy in the UK: A population based retrospective cohort study.英国儿童期虐待与特应性之间的关联:一项基于人群的回顾性队列研究。
EClinicalMedicine. 2022 Nov 14;53:101730. doi: 10.1016/j.eclinm.2022.101730. eCollection 2022 Nov.
5
Predicted cardiovascular disease risk and prescribing of antihypertensive therapy among patients with hypertension in Australia using MedicineInsight.使用 MedicineInsight 预测澳大利亚高血压患者的心血管疾病风险和降压治疗处方。
J Hum Hypertens. 2023 May;37(5):370-378. doi: 10.1038/s41371-022-00691-z. Epub 2022 May 2.
6
Sociodemographic characteristics associated with parenthood amongst patients with a psychotic diagnosis: a cross-sectional study using patient clinical records.与精神诊断患者父母身份相关的社会人口学特征:使用患者临床记录的横断面研究。
Soc Psychiatry Psychiatr Epidemiol. 2022 Sep;57(9):1897-1906. doi: 10.1007/s00127-022-02279-x. Epub 2022 Apr 21.
7
Establishing a National Cardiovascular Disease Surveillance System in the United States Using Electronic Health Record Data: Key Strengths and Limitations.利用电子健康记录数据在美国建立国家心血管疾病监测系统:主要优势和局限性。
J Am Heart Assoc. 2022 Apr 19;11(8):e024409. doi: 10.1161/JAHA.121.024409. Epub 2022 Apr 12.
8
Constructing Epidemiologic Cohorts from Electronic Health Record Data.从电子健康记录数据中构建流行病学队列。
Int J Environ Res Public Health. 2021 Dec 14;18(24):13193. doi: 10.3390/ijerph182413193.
9
Using Electronic Medical Records to Identify Potentially Eligible Study Subjects for Lung Cancer Screening with Biomarkers.利用电子病历识别可能符合条件的生物标志物肺癌筛查研究对象。
Cancers (Basel). 2021 Oct 29;13(21):5449. doi: 10.3390/cancers13215449.
10
A narrative review on the validity of electronic health record-based research in epidemiology.基于电子健康记录的流行病学研究的有效性的叙述性综述。
BMC Med Res Methodol. 2021 Oct 27;21(1):234. doi: 10.1186/s12874-021-01416-5.
英国他汀类药物用于心血管疾病一级预防的处方时间趋势:一项使用健康改善网络初级保健数据的队列研究
Clin Epidemiol. 2016 May 27;8:123-32. doi: 10.2147/CLEP.S104258. eCollection 2016.
4
Trends in incidence, prevalence and prescribing in type 2 diabetes mellitus between 2000 and 2013 in primary care: a retrospective cohort study.2000年至2013年基层医疗中2型糖尿病的发病率、患病率及处方趋势:一项回顾性队列研究。
BMJ Open. 2016 Jan 13;6(1):e010210. doi: 10.1136/bmjopen-2015-010210.
5
Asymptotically Unbiased Estimation of Exposure Odds Ratios in Complete Records Logistic Regression.完全记录逻辑回归中暴露比值比的渐近无偏估计
Am J Epidemiol. 2015 Oct 15;182(8):730-6. doi: 10.1093/aje/kwv114. Epub 2015 Sep 30.
6
Data Resource Profile: Clinical Practice Research Datalink (CPRD).数据资源简介:临床实践研究数据链(CPRD)
Int J Epidemiol. 2015 Jun;44(3):827-36. doi: 10.1093/ije/dyv098. Epub 2015 Jun 6.
7
Application of multiple imputation using the two-fold fully conditional specification algorithm in longitudinal clinical data.在纵向临床数据中使用双重完全条件设定算法进行多重填补的应用。
Stata J. 2014 Apr 1;14(2):418-431.
8
Evaluation of two-fold fully conditional specification multiple imputation for longitudinal electronic health record data.纵向电子健康记录数据的双重完全条件指定多重填补法评估
Stat Med. 2014 Sep 20;33(21):3725-37. doi: 10.1002/sim.6184. Epub 2014 Apr 30.
9
Multiple imputation of covariates by fully conditional specification: Accommodating the substantive model.通过完全条件设定对协变量进行多重填补:适配实质性模型。
Stat Methods Med Res. 2015 Aug;24(4):462-87. doi: 10.1177/0962280214521348. Epub 2014 Feb 12.
10
Representativeness and optimal use of body mass index (BMI) in the UK Clinical Practice Research Datalink (CPRD).英国临床实践研究数据库(CPRD)中体重指数(BMI)的代表性和最佳使用。
BMJ Open. 2013 Sep 13;3(9):e003389. doi: 10.1136/bmjopen-2013-003389.