• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

来自4200万个非结构化招聘信息的职业模型。

Occupational models from 42 million unstructured job postings.

作者信息

Dixon Nile, Goggins Marcelle, Ho Ethan, Howison Mark, Long Joe, Northcott Emma, Shen Karen, Yeats Carrie

机构信息

Research Improving People's Lives, 1 Park Row, Suite 401, Providence, RI 02903, USA.

National Association of State Workforce Agencies, 444 N. Capitol Street NW, Suite 300, Washington, DC 20001, USA.

出版信息

Patterns (N Y). 2023 May 22;4(7):100757. doi: 10.1016/j.patter.2023.100757. eCollection 2023 Jul 14.

DOI:10.1016/j.patter.2023.100757
PMID:37521040
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10382938/
Abstract

Structuring jobs into occupations is the first step for analysis tasks in many fields of research, including economics and public health, as well as for practical applications like matching job seekers to available jobs. We present a data resource, derived with natural language processing techniques from over 42 million unstructured job postings in the National Labor Exchange, that empirically models the associations between occupation codes (estimated initially by the Standardized Occupation Coding for Computer-assisted Epidemiological Research method), skill keywords, job titles, and full-text job descriptions in the United States during the years 2019 and 2021. We model the probability that a job title is associated with an occupation code and that a job description is associated with skill keywords and occupation codes. Our models are openly available in the python package, which can assign occupation codes to job titles, parse skills from and assign occupation codes to job postings and resumes, and estimate occupational similarity among job postings, resumes, and occupation codes.

摘要

将工作结构化到职业类别中,是包括经济学和公共卫生在内的许多研究领域进行分析任务的第一步,也是诸如将求职者与现有工作进行匹配等实际应用的第一步。我们展示了一种数据资源,它是通过自然语言处理技术从国家劳动力交易所的4200多万个非结构化招聘信息中提取出来的,该数据资源对2019年至2021年期间美国职业代码(最初由计算机辅助流行病学研究的标准化职业编码方法估算)、技能关键词、职位名称和全文招聘信息之间的关联进行了实证建模。我们对职位名称与职业代码相关联以及招聘信息与技能关键词和职业代码相关联的概率进行建模。我们的模型在Python包中公开可用,该包可以为职位名称分配职业代码,从招聘信息和简历中解析技能并为其分配职业代码,还可以估算招聘信息、简历和职业代码之间的职业相似度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/669b212cf190/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/a49fabacf59f/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/734f7f647f74/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/056286655327/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/62a601fb3ed6/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/0b18dc43dd40/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/669b212cf190/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/a49fabacf59f/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/734f7f647f74/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/056286655327/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/62a601fb3ed6/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/0b18dc43dd40/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/10382938/669b212cf190/gr6.jpg

相似文献

1
Occupational models from 42 million unstructured job postings.来自4200万个非结构化招聘信息的职业模型。
Patterns (N Y). 2023 May 22;4(7):100757. doi: 10.1016/j.patter.2023.100757. eCollection 2023 Jul 14.
2
Computer-based coding of free-text job descriptions to efficiently identify occupations in epidemiological studies.基于计算机的自由文本职位描述编码,以在流行病学研究中高效识别职业。
Occup Environ Med. 2016 Jun;73(6):417-24. doi: 10.1136/oemed-2015-103152. Epub 2016 Apr 21.
3
Computer-Based Coding of Occupation Codes for Epidemiological Analyses.用于流行病学分析的职业代码的计算机编码
Proc IEEE Int Symp Comput Based Med Syst. 2014 May;2014:347-350. doi: 10.1109/CBMS.2014.79.
4
Beware the Grizzlyman: A comparison of job- and industry-based noise exposure estimates using manual coding and the NIOSH NIOCCS machine learning algorithm.小心灰熊人:使用手动编码和 NIOSH NIOCCS 机器学习算法比较基于工作和行业的噪声暴露估计。
J Occup Environ Hyg. 2022 Jul;19(7):437-447. doi: 10.1080/15459624.2022.2076860. Epub 2022 Jun 7.
5
Systematically extracting metal- and solvent-related occupational information from free-text responses to lifetime occupational history questionnaires.从终身职业史问卷的自由文本回答中系统提取与金属和溶剂相关的职业信息。
Ann Occup Hyg. 2014 Jun;58(5):612-24. doi: 10.1093/annhyg/meu012. Epub 2014 Mar 3.
6
Occupational self-coding and automatic recording (OSCAR): a novel web-based tool to collect and code lifetime job histories in large population-based studies.职业自我编码与自动记录(OSCAR):一种用于在大型基于人群的研究中收集和编码终生工作经历的新型网络工具。
Scand J Work Environ Health. 2017 Mar 1;43(2):181-186. doi: 10.5271/sjweh.3613. Epub 2016 Dec 14.
7
Occupation Coding of Job Titles: Iterative Development of an Automated Coding Algorithm for the Canadian National Occupation Classification (ACA-NOC).职位名称的职业编码:加拿大国家职业分类(ACA-NOC)自动编码算法的迭代开发
JMIR Form Res. 2020 Aug 5;4(8):e16422. doi: 10.2196/16422.
8
Impact of Variability in Job Coding on Reliability in Exposure Estimates Obtained via a Job-Exposure Matrix.工作编码变异性对通过工作暴露矩阵获得的暴露估计可靠性的影响。
Ann Work Expo Health. 2022 Jun 6;66(5):551-562. doi: 10.1093/annweh/wxab106.
9
Evaluation of the updated SOCcer v2 algorithm for coding free-text job descriptions in three epidemiologic studies.评估更新后的 SOCcer v2 算法在三项流行病学研究中对自由文本工作描述进行编码的效果。
Ann Work Expo Health. 2023 Jul 6;67(6):772-783. doi: 10.1093/annweh/wxad020.
10
Efficiency of autocoding programs for converting job descriptors into standard occupational classification (SOC) codes.自动编码程序将工作描述转换为标准职业分类(SOC)代码的效率。
Am J Ind Med. 2019 Jan;62(1):59-68. doi: 10.1002/ajim.22928. Epub 2018 Dec 5.

本文引用的文献

1
Introducing the FAIR Principles for research software.提出研究软件的 FAIR 原则。
Sci Data. 2022 Oct 14;9(1):622. doi: 10.1038/s41597-022-01710-x.
2
How many jobs can be done at home?有多少工作可以在家完成?
J Public Econ. 2020 Sep;189:104235. doi: 10.1016/j.jpubeco.2020.104235. Epub 2020 Jul 9.
3
Risk for Severe COVID-19 Illness Among Health Care Workers Who Work Directly with Patients.直接接触患者的医护人员中患重症 COVID-19 疾病的风险。
J Gen Intern Med. 2020 Sep;35(9):2804-2806. doi: 10.1007/s11606-020-05992-y. Epub 2020 Jun 24.
4
Efficiency of autocoding programs for converting job descriptors into standard occupational classification (SOC) codes.自动编码程序将工作描述转换为标准职业分类(SOC)代码的效率。
Am J Ind Med. 2019 Jan;62(1):59-68. doi: 10.1002/ajim.22928. Epub 2018 Dec 5.
5
Occupational self-coding and automatic recording (OSCAR): a novel web-based tool to collect and code lifetime job histories in large population-based studies.职业自我编码与自动记录(OSCAR):一种用于在大型基于人群的研究中收集和编码终生工作经历的新型网络工具。
Scand J Work Environ Health. 2017 Mar 1;43(2):181-186. doi: 10.5271/sjweh.3613. Epub 2016 Dec 14.
6
Computer-based coding of free-text job descriptions to efficiently identify occupations in epidemiological studies.基于计算机的自由文本职位描述编码,以在流行病学研究中高效识别职业。
Occup Environ Med. 2016 Jun;73(6):417-24. doi: 10.1136/oemed-2015-103152. Epub 2016 Apr 21.
7
Industry and Occupation in the Electronic Health Record: An Investigation of the National Institute for Occupational Safety and Health Industry and Occupation Computerized Coding System.电子健康记录中的行业和职业:对国家职业安全与健康研究所行业和职业计算机编码系统的调查。
JMIR Med Inform. 2016 Feb 15;4(1):e5. doi: 10.2196/medinform.4839.