• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超越人行横道:职业流行病学中文本自由格式工作描述自动编码后暴露评估的可靠性

Beyond crosswalks: reliability of exposure assessment following automated coding of free-text job descriptions for occupational epidemiology.

作者信息

Burstyn Igor, Slutsky Anton, Lee Derrick G, Singer Alison B, An Yuan, Michael Yvonne L

机构信息

1. Department of Environmental and Occupational Health, School of Public Health, Drexel University, Philadelphia, PA, USA.

出版信息

Ann Occup Hyg. 2014 May;58(4):482-92. doi: 10.1093/annhyg/meu006. Epub 2014 Feb 6.

DOI:10.1093/annhyg/meu006
PMID:24504175
Abstract

Epidemiologists typically collect narrative descriptions of occupational histories because these are less prone than self-reported exposures to recall bias of exposure to a specific hazard. However, the task of coding these narratives can be daunting and prohibitively time-consuming in some settings. The aim of this manuscript is to evaluate the performance of a computer algorithm to translate the narrative description of occupational codes into standard classification of jobs (2010 Standard Occupational Classification) in an epidemiological context. The fundamental question we address is whether exposure assignment resulting from manual (presumed gold standard) coding of the narratives is materially different from that arising from the application of automated coding. We pursued our work through three motivating examples: assessment of physical demands in Women's Health Initiative observational study, evaluation of predictors of exposure to coal tar pitch volatiles in the US Occupational Safety and Health Administration's (OSHA) Integrated Management Information System, and assessment of exposure to agents known to cause occupational asthma in a pregnancy cohort. In these diverse settings, we demonstrate that automated coding of occupations results in assignment of exposures that are in reasonable agreement with results that can be obtained through manual coding. The correlation between physical demand scores based on manual and automated job classification schemes was reasonable (r = 0.5). The agreement between predictive probability of exceeding the OSHA's permissible exposure level for polycyclic aromatic hydrocarbons, using coal tar pitch volatiles as a surrogate, based on manual and automated coding of jobs was modest (Kendall rank correlation = 0.29). In the case of binary assignment of exposure to asthmagens, we observed that fair to excellent agreement in classifications can be reached, depending on presence of ambiguity in assigned job classification (κ = 0.5-0.8). Thus, the success of automated coding appears to depend on the setting and type of exposure that is being assessed. Our overall recommendation is that automated translation of short narrative descriptions of jobs for exposure assessment is feasible in some settings and essential for large cohorts, especially if combined with manual coding to both assess reliability of coding and to further refine the coding algorithm.

摘要

流行病学家通常收集职业史的叙述性描述,因为与自我报告的暴露情况相比,这些描述较不易受到回忆特定危害暴露偏差的影响。然而,在某些情况下,对这些叙述进行编码的任务可能令人生畏且耗时过长。本手稿的目的是评估一种计算机算法在流行病学背景下将职业代码的叙述性描述转换为标准职业分类(2010年标准职业分类)的性能。我们要解决的基本问题是,对叙述进行人工编码(假定为金标准)所产生的暴露赋值与应用自动编码所产生的暴露赋值是否存在实质性差异。我们通过三个具有启发性的例子开展工作:在妇女健康倡议观察性研究中评估体力需求、在美国职业安全与健康管理局(OSHA)的综合管理信息系统中评估接触煤焦油沥青挥发物的预测因素,以及在一个妊娠队列中评估接触已知会导致职业性哮喘的物质的情况。在这些不同的场景中,我们证明职业的自动编码所产生的暴露赋值与通过人工编码获得的结果合理一致。基于人工和自动职业分类方案的体力需求得分之间的相关性合理(r = 0.5)。基于工作的人工和自动编码,以煤焦油沥青挥发物为替代物,超过OSHA多环芳烃允许暴露水平的预测概率之间的一致性一般(肯德尔等级相关 = 0.29)。在哮喘原暴露的二元赋值情况下,我们观察到根据所分配职业分类中是否存在模糊性,分类的一致性可达一般到良好(κ = 0.5 - 0.8)。因此,自动编码的成功似乎取决于所评估的暴露场景和类型。我们的总体建议是,在某些情况下,对用于暴露评估的简短工作叙述性描述进行自动翻译是可行的,对于大型队列来说是必不可少的,特别是如果与人工编码相结合,既能评估编码的可靠性,又能进一步完善编码算法。

相似文献

1
Beyond crosswalks: reliability of exposure assessment following automated coding of free-text job descriptions for occupational epidemiology.超越人行横道:职业流行病学中文本自由格式工作描述自动编码后暴露评估的可靠性
Ann Occup Hyg. 2014 May;58(4):482-92. doi: 10.1093/annhyg/meu006. Epub 2014 Feb 6.
2
JEMs and incompatible occupational coding systems: effect of manual and automatic recoding of job codes on exposure assignment.职业暴露监测系统(JEMs)与不兼容的职业编码系统:工作代码的手动和自动重新编码对暴露赋值的影响。
Ann Occup Hyg. 2013 Jan;57(1):107-14. doi: 10.1093/annhyg/mes046. Epub 2012 Jul 17.
3
Computer-based coding of free-text job descriptions to efficiently identify occupations in epidemiological studies.基于计算机的自由文本职位描述编码,以在流行病学研究中高效识别职业。
Occup Environ Med. 2016 Jun;73(6):417-24. doi: 10.1136/oemed-2015-103152. Epub 2016 Apr 21.
4
Statistical Modeling of Occupational Exposure to Polycyclic Aromatic Hydrocarbons Using OSHA Data.使用职业安全与健康管理局(OSHA)数据对多环芳烃职业暴露进行统计建模
J Occup Environ Hyg. 2015;12(10):729-42. doi: 10.1080/15459624.2015.1043049.
5
Evaluation of Automatically Assigned Job-Specific Interview Modules.自动分配的特定工作面试模块评估
Ann Occup Hyg. 2016 Aug;60(7):885-99. doi: 10.1093/annhyg/mew029. Epub 2016 Jun 1.
6
Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison.从一般人群研究中自动编码工作描述:现有工具概述、应用及比较。
Ann Work Expo Health. 2023 Jun 6;67(5):663-672. doi: 10.1093/annweh/wxad002.
7
Development of a Coding and Crosswalk Tool for Occupations and Industries.职业和行业编码及转换工具的开发。
Ann Work Expo Health. 2018 Aug 13;62(7):796-807. doi: 10.1093/annweh/wxy052.
8
Occupational self-coding and automatic recording (OSCAR): a novel web-based tool to collect and code lifetime job histories in large population-based studies.职业自我编码与自动记录(OSCAR):一种用于在大型基于人群的研究中收集和编码终生工作经历的新型网络工具。
Scand J Work Environ Health. 2017 Mar 1;43(2):181-186. doi: 10.5271/sjweh.3613. Epub 2016 Dec 14.
9
Evaluation of the updated SOCcer v2 algorithm for coding free-text job descriptions in three epidemiologic studies.评估更新后的 SOCcer v2 算法在三项流行病学研究中对自由文本工作描述进行编码的效果。
Ann Work Expo Health. 2023 Jul 6;67(6):772-783. doi: 10.1093/annweh/wxad020.
10
Testing and Validating Semi-automated Approaches to the Occupational Exposure Assessment of Polycyclic Aromatic Hydrocarbons.测试和验证多环芳烃职业暴露评估的半自动方法。
Ann Work Expo Health. 2021 Jul 3;65(6):682-693. doi: 10.1093/annweh/wxab002.

引用本文的文献

1
OPERAS decision support system versus manual job coding: a quantitative analysis on coding time and inter-coder reliability.OPERAS决策支持系统与手工工作编码:编码时间和编码员间信度的定量分析
Occup Environ Med. 2025 Jul 9;82(4):183-190. doi: 10.1136/oemed-2024-109823.
2
Occupation classification model based on DistilKoBERT: using the 5th and 6th Korean Working Condition Surveys.基于DistilKoBERT的职业分类模型:使用韩国第五次和第六次工作条件调查
Ann Occup Environ Med. 2024 Aug 6;36:e19. doi: 10.35371/aoem.2024.36.e19. eCollection 2024.
3
Artificial intelligence exceeds humans in epidemiological job coding.
在流行病学工作编码方面,人工智能超越了人类。
Commun Med (Lond). 2023 Nov 4;3(1):160. doi: 10.1038/s43856-023-00397-4.
4
Occupational groups and lower urinary tract symptoms: A cross-sectional analysis of women in the Boston Area Community Health Study.职业群体与下尿路症状:波士顿地区社区健康研究女性的横断面分析。
Neurourol Urodyn. 2024 Jan;43(1):88-104. doi: 10.1002/nau.25292. Epub 2023 Oct 3.
5
Evaluation of the updated SOCcer v2 algorithm for coding free-text job descriptions in three epidemiologic studies.评估更新后的 SOCcer v2 算法在三项流行病学研究中对自由文本工作描述进行编码的效果。
Ann Work Expo Health. 2023 Jul 6;67(6):772-783. doi: 10.1093/annweh/wxad020.
6
Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison.从一般人群研究中自动编码工作描述:现有工具概述、应用及比较。
Ann Work Expo Health. 2023 Jun 6;67(5):663-672. doi: 10.1093/annweh/wxad002.
7
Asbestos Exposure in Patients with Malignant Pleural Mesothelioma included in the PRIMATE Study, Lombardy, Italy.意大利伦巴第地区 PRIMATE 研究中恶性胸膜间皮瘤患者的石棉暴露情况。
Int J Environ Res Public Health. 2022 Mar 13;19(6):3390. doi: 10.3390/ijerph19063390.
8
Impact of Variability in Job Coding on Reliability in Exposure Estimates Obtained via a Job-Exposure Matrix.工作编码变异性对通过工作暴露矩阵获得的暴露估计可靠性的影响。
Ann Work Expo Health. 2022 Jun 6;66(5):551-562. doi: 10.1093/annweh/wxab106.
9
The relationship between work and mental health outcomes in Black men after serious injury.黑人男性在遭受重伤后,工作与心理健康结果之间的关系。
Injury. 2021 Apr;52(4):750-756. doi: 10.1016/j.injury.2021.02.021. Epub 2021 Feb 14.
10
Occupation Coding of Job Titles: Iterative Development of an Automated Coding Algorithm for the Canadian National Occupation Classification (ACA-NOC).职位名称的职业编码:加拿大国家职业分类(ACA-NOC)自动编码算法的迭代开发
JMIR Form Res. 2020 Aug 5;4(8):e16422. doi: 10.2196/16422.