• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

小心灰熊人:使用手动编码和 NIOSH NIOCCS 机器学习算法比较基于工作和行业的噪声暴露估计。

Beware the Grizzlyman: A comparison of job- and industry-based noise exposure estimates using manual coding and the NIOSH NIOCCS machine learning algorithm.

机构信息

Cardno ChemRisk, Chicago, Illinois.

Department of Environmental Health Sciences, University of Michigan School of Public Health, Ann Arbor, Michigan.

出版信息

J Occup Environ Hyg. 2022 Jul;19(7):437-447. doi: 10.1080/15459624.2022.2076860. Epub 2022 Jun 7.

DOI:10.1080/15459624.2022.2076860
PMID:35537195
Abstract

Recently, the National Institute for Occupational Safety and Health (NIOSH) released an updated version of the NIOSH Industry and Occupation Computerized Coding System (NIOCCS), which uses supervised machine learning to assign industry and occupational codes based on provided free-text information. However, no efforts have been made to externally verify the quality of assigned industry and job titles when the algorithm is provided with inputs of varying quality. This study sought to evaluate whether the NIOCCS algorithm was sufficiently robust with low-quality inputs and how variable quality could impact subsequent job estimated exposures in a large job-exposure matrix for noise (NoiseJEM). Using free-text industry and job descriptions from >700,000 noise measurements in the NoiseJEM, three files were created and input into NIOCCS: (1) N1, "raw" industries and job titles; (2) N2, "refined" industries and "raw" job titles; and (3) N3, "refined" industries and job titles. Standardized industry and occupation codes were output by NIOCCS. Descriptive statistics of performance metrics (e.g., misclassification/discordance of occupation codes) were evaluated for each input relative to the original NoiseJEM dataset (N0). Across major Standardized Occupational Classifications (SOC), total discordance rates for N1, N2, and N3 compared to N0 were 53.6%, 42.3%, and 5.0%, respectively. The impact of discordance on the major SOC group varied and included both over- and under-estimates of average noise exposure compared to N0. N2 had the most accurate noise exposure estimates (i.e., smallest bias) across major SOC groups compared to N1 and N3. Further refinement of job titles in N3 showed little improvement. Some variation in classification efficacy was seen over time, particularly prior to 1985. Machine learning algorithms can systematically and consistently classify data but are highly dependent on the quality and amount of input data. The greatest benefit for an end-user may come from cleaning industry information before applying this method for job classification. Our results highlight the need for standardized classification methods that remain constant over time.

摘要

最近,美国职业安全与健康研究所(NIOSH)发布了 NIOSH 行业和职业计算机编码系统(NIOCCS)的更新版本,该系统使用有监督的机器学习,根据提供的自由文本信息分配行业和职业代码。然而,当算法提供输入质量不同时,没有努力对外验证分配的行业和职位的质量。本研究旨在评估 NIOCCS 算法在低质量输入时是否足够稳健,以及可变质量如何影响噪声大型职业暴露矩阵(NoiseJEM)中的后续职业估计暴露。使用 NoiseJEM 中超过 70 万次噪声测量的自由文本行业和工作描述,创建了三个文件并输入到 NIOCCS 中:(1)N1,“原始”行业和工作标题;(2)N2,“精炼”行业和“原始”工作标题;(3)N3,“精炼”行业和工作标题。NIOSCCS 输出标准化的行业和职业代码。相对于原始 NoiseJEM 数据集(N0),评估了每个输入的性能指标(例如职业代码的分类错误/不一致)的描述性统计数据。在主要标准职业分类(SOC)中,与 N0 相比,N1、N2 和 N3 的总不一致率分别为 53.6%、42.3%和 5.0%。不一致对主要 SOC 群体的影响各不相同,包括与 N0 相比,噪声暴露的高估和低估。与 N1 和 N3 相比,N2 在主要 SOC 群体中具有最准确的噪声暴露估计值(即最小偏差)。在 N3 中进一步细化工作标题几乎没有改善。随着时间的推移,分类效果存在一些变化,尤其是在 1985 年之前。机器学习算法可以系统地、一致地对数据进行分类,但高度依赖输入数据的质量和数量。对于最终用户来说,最大的好处可能是在应用这种方法进行工作分类之前清理行业信息。我们的结果强调了标准化分类方法的必要性,这些方法应随着时间的推移保持不变。

相似文献

1
Beware the Grizzlyman: A comparison of job- and industry-based noise exposure estimates using manual coding and the NIOSH NIOCCS machine learning algorithm.小心灰熊人:使用手动编码和 NIOSH NIOCCS 机器学习算法比较基于工作和行业的噪声暴露估计。
J Occup Environ Hyg. 2022 Jul;19(7):437-447. doi: 10.1080/15459624.2022.2076860. Epub 2022 Jun 7.
2
Computer-based coding of free-text job descriptions to efficiently identify occupations in epidemiological studies.基于计算机的自由文本职位描述编码,以在流行病学研究中高效识别职业。
Occup Environ Med. 2016 Jun;73(6):417-24. doi: 10.1136/oemed-2015-103152. Epub 2016 Apr 21.
3
Industry and Occupation in the Electronic Health Record: An Investigation of the National Institute for Occupational Safety and Health Industry and Occupation Computerized Coding System.电子健康记录中的行业和职业:对国家职业安全与健康研究所行业和职业计算机编码系统的调查。
JMIR Med Inform. 2016 Feb 15;4(1):e5. doi: 10.2196/medinform.4839.
4
Efficiency of autocoding programs for converting job descriptors into standard occupational classification (SOC) codes.自动编码程序将工作描述转换为标准职业分类(SOC)代码的效率。
Am J Ind Med. 2019 Jan;62(1):59-68. doi: 10.1002/ajim.22928. Epub 2018 Dec 5.
5
Systematically extracting metal- and solvent-related occupational information from free-text responses to lifetime occupational history questionnaires.从终身职业史问卷的自由文本回答中系统提取与金属和溶剂相关的职业信息。
Ann Occup Hyg. 2014 Jun;58(5):612-24. doi: 10.1093/annhyg/meu012. Epub 2014 Mar 3.
6
Standard Occupational Classification Codes: Gaps in Federal Data on the Public Health Workforce.标准职业分类代码:公共卫生劳动力联邦数据中的差距。
Am J Public Health. 2024 Jan;114(1):48-56. doi: 10.2105/AJPH.2023.307463.
7
Coding of Central Cancer Registry Industry and Occupation Information: The Texas and Louisiana Experiences.中央癌症登记处行业和职业信息编码:得克萨斯州和路易斯安那州的经验
J Registry Manag. 2015 Fall;42(3):103-10.
8
Imputation of missing values in a large job exposure matrix using hierarchical information.利用分层信息对大型工作暴露矩阵中的缺失值进行推断。
J Expo Sci Environ Epidemiol. 2018 Nov;28(6):615-648. doi: 10.1038/s41370-018-0037-x. Epub 2018 May 23.
9
Evaluation of the updated SOCcer v2 algorithm for coding free-text job descriptions in three epidemiologic studies.评估更新后的 SOCcer v2 算法在三项流行病学研究中对自由文本工作描述进行编码的效果。
Ann Work Expo Health. 2023 Jul 6;67(6):772-783. doi: 10.1093/annweh/wxad020.
10
Evaluating the impact of occupational noise exposure on workplace fatal and nonfatal injuries in the U.S. (2006-2020).评估美国职业噪声暴露对工作场所致命和非致命伤害的影响(2006 - 2020年)
Int J Hyg Environ Health. 2025 Jan;263:114468. doi: 10.1016/j.ijheh.2024.114468. Epub 2024 Sep 26.