• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发一种从英国电子初级保健记录中确定一生中吸烟状况和行为的算法。

Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records.

作者信息

Atkinson Mark D, Kennedy Jonathan I, John Ann, Lewis Keir E, Lyons Ronan A, Brophy Sinead T

机构信息

Farr Institute, Swansea University Medical School, Swansea, SA2 8PP, UK.

Prince Philip Hospital, Hywel Dda Health Board, Llanelli, UK.

出版信息

BMC Med Inform Decis Mak. 2017 Jan 5;17(1):2. doi: 10.1186/s12911-016-0400-6.

DOI:10.1186/s12911-016-0400-6
PMID:28056955
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5217540/
Abstract

BACKGROUND

Patients' smoking status is routinely collected by General Practitioners (GP) in UK primary health care. There is an abundance of Read codes pertaining to smoking, including those relating to smoking cessation therapy, prescription, and administration codes, in addition to the more regularly employed smoking status codes. Large databases of primary care data are increasingly used for epidemiological analysis; smoking status is an important covariate in many such analyses. However, the variable definition is rarely documented in the literature.

METHODS

The Secure Anonymised Information Linkage (SAIL) databank is a repository for a national collection of person-based anonymised health and socio-economic administrative data in Wales, UK. An exploration of GP smoking status data from the SAIL databank was carried out to explore the range of codes available and how they could be used in the identification of different categories of smokers, ex-smokers and never smokers. An algorithm was developed which addresses inconsistencies and changes in smoking status recording across the life course and compared with recorded smoking status as recorded in the Welsh Health Survey (WHS), 2013 and 2014 at individual level. However, the WHS could not be regarded as a "gold standard" for validation.

RESULTS

There were 6836 individuals in the linked dataset. Missing data were more common in GP records (6%) than in WHS (1.1%). Our algorithm assigns ex-smoker status to 34% of never-smokers, and detects 30% more smokers than are declared in the WHS data. When distinguishing between current smokers and non-smokers, the similarity between the WHS and GP data using the nearest date of comparison was κ = 0.78. When temporal conflicts had been accounted for, the similarity was κ = 0.64, showing the importance of addressing conflicts.

CONCLUSIONS

We present an algorithm for the identification of a patient's smoking status using GP self-reported data. We have included sufficient details to allow others to replicate this work, thus increasing the standards of documentation within this research area and assessment of smoking status in routine data.

摘要

背景

在英国初级医疗保健中,全科医生(GP)会定期收集患者的吸烟状况。除了更常用的吸烟状况代码外,还有大量与吸烟相关的瑞德编码,包括与戒烟治疗、处方和用药编码有关的内容。初级保健数据的大型数据库越来越多地用于流行病学分析;吸烟状况是许多此类分析中的一个重要协变量。然而,变量定义在文献中很少有记录。

方法

安全匿名信息链接(SAIL)数据库是英国威尔士一个基于个人的匿名健康和社会经济管理数据的国家集合库。对SAIL数据库中的全科医生吸烟状况数据进行了探索,以研究可用编码的范围以及如何将它们用于识别不同类别的吸烟者、已戒烟者和从不吸烟者。开发了一种算法,该算法解决了整个生命过程中吸烟状况记录的不一致性和变化问题,并在个体层面与2013年和2014年威尔士健康调查(WHS)中记录的吸烟状况进行了比较。然而,WHS不能被视为验证的“金标准”。

结果

链接数据集中有6836人。全科医生记录中的缺失数据(6%)比WHS(1.1%)中更常见。我们的算法将34%的从不吸烟者判定为已戒烟者,并且检测出的吸烟者比WHS数据中申报的多30%。在区分当前吸烟者和非吸烟者时,使用最近比较日期的WHS和全科医生数据之间的相似性为κ = 0.78。在考虑了时间冲突后,相似性为κ = 0.64,这表明解决冲突的重要性。

结论

我们提出了一种使用全科医生自我报告数据识别患者吸烟状况的算法。我们提供了足够的细节,以便其他人能够复制这项工作,从而提高该研究领域的文档标准以及常规数据中吸烟状况的评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f3d/5217540/13708671e1e4/12911_2016_400_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f3d/5217540/224da53363e0/12911_2016_400_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f3d/5217540/b4c04db2b846/12911_2016_400_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f3d/5217540/13708671e1e4/12911_2016_400_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f3d/5217540/224da53363e0/12911_2016_400_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f3d/5217540/b4c04db2b846/12911_2016_400_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f3d/5217540/13708671e1e4/12911_2016_400_Fig3_HTML.jpg

相似文献

1
Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records.开发一种从英国电子初级保健记录中确定一生中吸烟状况和行为的算法。
BMC Med Inform Decis Mak. 2017 Jan 5;17(1):2. doi: 10.1186/s12911-016-0400-6.
2
Case-finding for common mental disorders of anxiety and depression in primary care: an external validation of routinely collected data.基层医疗中焦虑和抑郁等常见精神障碍的病例发现:常规收集数据的外部验证
BMC Med Inform Decis Mak. 2016 Mar 15;16:35. doi: 10.1186/s12911-016-0274-7.
3
Smoker, ex-smoker or non-smoker? The validity of routinely recorded smoking status in UK primary care: a cross-sectional study.吸烟者、曾经吸烟者还是非吸烟者?英国初级医疗中常规记录的吸烟状况的有效性:一项横断面研究。
BMJ Open. 2014 Apr 23;4(4):e004958. doi: 10.1136/bmjopen-2014-004958.
4
Validating epilepsy diagnoses in routinely collected data.验证常规收集数据中的癫痫诊断。
Seizure. 2017 Nov;52:195-198. doi: 10.1016/j.seizure.2017.10.008. Epub 2017 Oct 13.
5
Methods to improve the quality of smoking records in a primary care EMR database: exploring multiple imputation and pattern-matching algorithms.改进初级保健 EMR 数据库中吸烟记录质量的方法:探索多种插补和模式匹配算法。
BMC Med Inform Decis Mak. 2020 Mar 14;20(1):56. doi: 10.1186/s12911-020-1068-5.
6
Measuring follow-up time in routinely-collected health datasets: Challenges and solutions.测量常规收集的健康数据集的随访时间:挑战与解决方案。
PLoS One. 2020 Feb 11;15(2):e0228545. doi: 10.1371/journal.pone.0228545. eCollection 2020.
7
Children and young people's body mass index measures derived from routine data sources: A national data linkage study in Wales.儿童和青少年的体重指数测量值源自常规数据源:威尔士的一项全国性数据链接研究。
PLoS One. 2024 May 10;19(5):e0300221. doi: 10.1371/journal.pone.0300221. eCollection 2024.
8
Socioeconomic variations in access to smoking cessation interventions in UK primary care: insights using the Mosaic classification in a large dataset of primary care records.英国初级保健中戒烟干预措施获取的社会经济差异:在大型初级保健记录数据集使用马赛克分类法的见解。
BMC Public Health. 2013 Jun 5;13:546. doi: 10.1186/1471-2458-13-546.
9
Use of varenicline and nicotine replacement therapy in people with and without general practitioner-recorded dementia: retrospective cohort study of routine electronic medical records.在有和没有全科医生记录的痴呆症患者中使用伐尼克兰和尼古丁替代疗法:对常规电子病历的回顾性队列研究。
BMJ Open. 2019 Aug 30;9(8):e027569. doi: 10.1136/bmjopen-2018-027569.
10
Cohort profile: HealthWise Wales. A research register and population health data platform with linkage to National Health Service data sets in Wales.队列资料简介:HealthWise Wales。一个研究注册和人群健康数据平台,与威尔士国民保健署数据集相链接。
BMJ Open. 2019 Dec 2;9(12):e031705. doi: 10.1136/bmjopen-2019-031705.

引用本文的文献

1
Mediation of socioeconomic inequalities in preterm birth. A cohort analysis of Welsh linked data.早产中社会经济不平等的中介作用。威尔士关联数据的队列分析。
Acta Obstet Gynecol Scand. 2025 Jun;104(6):1081-1091. doi: 10.1111/aogs.15101. Epub 2025 Apr 16.
2
Optimal Pathways to Lung Cancer Screening in Primary Care Settings: A Scoping Review.基层医疗环境中肺癌筛查的最佳途径:一项范围综述
Curr Oncol. 2024 Dec 26;32(1):8. doi: 10.3390/curroncol32010008.
3
The doctor will polygraph you now.医生现在要给你做测谎检查。

本文引用的文献

1
Validity of smoking prevalence estimates from primary care electronic health records compared with national population survey data for England, 2007 to 2011.2007年至2011年英格兰初级保健电子健康记录中吸烟流行率估计值与全国人口调查数据的有效性比较。
Pharmacoepidemiol Drug Saf. 2013 Dec;22(12):1357-61. doi: 10.1002/pds.3537.
2
Data resource profile: cardiovascular disease research using linked bespoke studies and electronic health records (CALIBER).数据资源简介:利用定制研究和电子健康记录进行心血管疾病研究(CALIBER)。
Int J Epidemiol. 2012 Dec;41(6):1625-38. doi: 10.1093/ije/dys188. Epub 2012 Dec 5.
3
The impact of the Quality and Outcomes Framework (QOF) on the recording of smoking targets in primary care medical records: cross-sectional analyses from The Health Improvement Network (THIN) database.
Npj Health Syst. 2024;1(1):1. doi: 10.1038/s44401-024-00001-4. Epub 2024 Dec 5.
4
Incidence of long-term conditions in the Latin American community of London: A validation and retrospective cohort study of 890,922 primary care records, 2005-2022.伦敦拉美裔人群中长期患病状况的发生率:2005-2022 年对 890922 份初级保健记录进行验证和回顾性队列研究。
PLoS One. 2024 Nov 27;19(11):e0312311. doi: 10.1371/journal.pone.0312311. eCollection 2024.
5
Digital Vocal Biomarker of Smoking Status Using Ecological Audio Recordings: Results from the Colive Voice Study.利用生态录音的吸烟状况数字语音生物标志物:Colive Voice研究结果
Digit Biomark. 2024 Aug 28;8(1):159-170. doi: 10.1159/000540327. eCollection 2024 Jan-Dec.
6
The doctor will polygraph you now: ethical concerns with AI for fact-checking patients.医生现在将对你进行测谎:人工智能用于核实患者情况的伦理问题。
ArXiv. 2024 Nov 11:arXiv:2408.07896v2.
7
The validity of electronic health data for measuring smoking status: a systematic review and meta-analysis.电子健康数据测量吸烟状况的有效性:系统评价和荟萃分析。
BMC Med Inform Decis Mak. 2024 Feb 2;24(1):33. doi: 10.1186/s12911-024-02416-3.
8
Assessing, updating and utilising primary care smoking records for lung cancer screening.评估、更新和利用初级保健吸烟记录进行肺癌筛查。
BMC Pulm Med. 2023 Nov 16;23(1):445. doi: 10.1186/s12890-023-02746-4.
9
Personalised lung cancer risk stratification and lung cancer screening: do general practice electronic medical records have a role?个体化肺癌风险分层与肺癌筛查:全科医疗电子病历是否发挥作用?
Br J Cancer. 2023 Dec;129(12):1968-1977. doi: 10.1038/s41416-023-02467-9. Epub 2023 Oct 25.
10
Body mass index and incident cardiometabolic conditions in relation to obesity-related cancer risk: A population-based cohort study in Catalonia, Spain.体质指数与心血管代谢疾病事件及肥胖相关癌症风险的关系:西班牙加泰罗尼亚地区的一项基于人群的队列研究。
Cancer Med. 2023 Oct;12(19):20188-20200. doi: 10.1002/cam4.6603. Epub 2023 Sep 28.
质量和结果框架(QOF)对初级保健医疗记录中吸烟目标记录的影响:来自健康改善网络(THIN)数据库的横断面分析。
BMC Public Health. 2012 Jul 10;12:329. doi: 10.1186/1471-2458-12-329.
4
Can primary care data be used to monitor regional smoking prevalence? An analysis of The Health Improvement Network primary care data.初级保健数据可用于监测地区吸烟流行率吗?对健康改进网络初级保健数据的分析。
BMC Public Health. 2011 Oct 7;11:773. doi: 10.1186/1471-2458-11-773.
5
Can data from primary care medical records be used to monitor national smoking prevalence?基层医疗保健记录中的数据可用于监测全国吸烟流行率吗?
J Epidemiol Community Health. 2012 Sep;66(9):791-5. doi: 10.1136/jech.2010.120154. Epub 2011 May 13.
6
Is smoking status routinely recorded when patients register with a new GP?当患者在新的全科医生处注册时,是否会例行记录吸烟状况?
Fam Pract. 2010 Dec;27(6):673-5. doi: 10.1093/fampra/cmq046. Epub 2010 Jul 11.
7
Validation of The Health Improvement Network (THIN) primary care database for monitoring prescriptions for smoking cessation medications.验证健康改进网络(THIN)初级保健数据库,以监测戒烟药物处方。
Pharmacoepidemiol Drug Saf. 2010 Jun;19(6):586-90. doi: 10.1002/pds.1960.
8
Trends in the epidemiology of smoking recorded in UK general practice.英国普通实践中记录的吸烟流行病学趋势。
Br J Gen Pract. 2010 Mar;60(572):e121-7. doi: 10.3399/bjgp10X483544.
9
Risk of ischemic stroke in patients with Crohn's disease: a population-based nested case-control study.克罗恩病患者缺血性卒中的风险:基于人群的巢式病例对照研究。
Inflamm Bowel Dis. 2010 Aug;16(8):1387-92. doi: 10.1002/ibd.21187.
10
The SAIL Databank: building a national architecture for e-health research and evaluation.SAIL数据库:构建电子健康研究与评估的国家架构。
BMC Health Serv Res. 2009 Sep 4;9:157. doi: 10.1186/1472-6963-9-157.