Suppr超能文献

了解新冠疫情对卫生人力的影响:人工智能辅助的开源媒体内容分析

Understanding COVID-19 Impacts on the Health Workforce: AI-Assisted Open-Source Media Content Analysis.

作者信息

Pienkowska Anita, Ravaut Mathieu, Mammadova Maleyka, Ang Chin-Siang, Wang Hanyu, Ong Qi Chwen, Bojic Iva, Qin Vicky Mengqi, Sumsuzzman Dewan Md, Ajuebor Onyema, Boniol Mathieu, Bustamante Juana Paola, Campbell James, Cometto Giorgio, Fitzpatrick Siobhan, Kane Catherine, Joty Shafiq, Car Josip

机构信息

Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.

College of Computing and Data Science, Nanyang Technological University, Singapore, Singapore.

出版信息

JMIR Form Res. 2024 Jun 13;8:e53574. doi: 10.2196/53574.

Abstract

BACKGROUND

To investigate the impacts of the COVID-19 pandemic on the health workforce, we aimed to develop a framework that synergizes natural language processing (NLP) techniques and human-generated analysis to reduce, organize, classify, and analyze a vast volume of publicly available news articles to complement scientific literature and support strategic policy dialogue, advocacy, and decision-making.

OBJECTIVE

This study aimed to explore the possibility of systematically scanning intelligence from media that are usually not captured or best gathered through structured academic channels and inform on the impacts of the COVID-19 pandemic on the health workforce, contributing factors to the pervasiveness of the impacts, and policy responses, as depicted in publicly available news articles. Our focus was to investigate the impacts of the COVID-19 pandemic and, concurrently, assess the feasibility of gathering health workforce insights from open sources rapidly.

METHODS

We conducted an NLP-assisted media content analysis of open-source news coverage on the COVID-19 pandemic published between January 2020 and June 2022. A data set of 3,299,158 English news articles on the COVID-19 pandemic was extracted from the World Health Organization Epidemic Intelligence through Open Sources (EIOS) system. The data preparation phase included developing rules-based classification, fine-tuning an NLP summarization model, and further data processing. Following relevancy evaluation, a deductive-inductive approach was used for the analysis of the summarizations. This included data extraction, inductive coding, and theme grouping.

RESULTS

After processing and classifying the initial data set comprising 3,299,158 news articles and reports, a data set of 5131 articles with 3,007,693 words was devised. The NLP summarization model allowed for a reduction in the length of each article resulting in 496,209 words that facilitated agile analysis performed by humans. Media content analysis yielded results in 3 sections: areas of COVID-19 impacts and their pervasiveness, contributing factors to COVID-19-related impacts, and responses to the impacts. The results suggest that insufficient remuneration and compensation packages have been key disruptors for the health workforce during the COVID-19 pandemic, leading to industrial actions and mental health burdens. Shortages of personal protective equipment and occupational risks have increased infection and death risks, particularly at the pandemic's onset. Workload and staff shortages became a growing disruption as the pandemic progressed.

CONCLUSIONS

This study demonstrates the capacity of artificial intelligence-assisted media content analysis applied to open-source news articles and reports concerning the health workforce. Adequate remuneration packages and personal protective equipment supplies should be prioritized as preventive measures to reduce the initial impact of future pandemics on the health workforce. Interventions aimed at lessening the emotional toll and workload need to be formulated as a part of reactive measures, enhancing the efficiency and maintainability of health delivery during a pandemic.

摘要

背景

为调查新冠疫情对卫生人力的影响,我们旨在制定一个框架,将自然语言处理(NLP)技术与人工分析相结合,以减少、整理、分类和分析大量公开的新闻文章,补充科学文献,并支持战略政策对话、宣传和决策。

目的

本研究旨在探索从通常无法通过结构化学术渠道获取或难以全面收集的媒体中系统扫描情报的可能性,并了解新冠疫情对卫生人力的影响、影响广泛存在的促成因素以及政策应对措施,这些内容在公开的新闻文章中有描述。我们的重点是调查新冠疫情的影响,同时评估从开源渠道快速获取卫生人力见解的可行性。

方法

我们对2020年1月至2022年6月期间发布的关于新冠疫情的开源新闻报道进行了NLP辅助的媒体内容分析。从世界卫生组织通过开源获取的疫情情报系统(EIOS)中提取了3299158篇关于新冠疫情的英文新闻文章数据集。数据准备阶段包括制定基于规则的分类、微调NLP摘要模型以及进一步的数据处理。在相关性评估之后,采用演绎-归纳法对摘要进行分析。这包括数据提取、归纳编码和主题分组。

结果

在对包含3299158篇新闻文章和报告的初始数据集进行处理和分类后,设计了一个包含5131篇文章、3007693个单词的数据集。NLP摘要模型使每篇文章的长度得以缩短,最终得到496209个单词,便于人工进行灵活分析。媒体内容分析产生了三个部分的结果:新冠疫情影响的领域及其普遍性、与新冠疫情相关影响的促成因素以及对这些影响的应对措施。结果表明,在新冠疫情期间,薪酬和补偿方案不足一直是卫生人力的关键干扰因素,导致了劳工行动和心理健康负担。个人防护设备短缺和职业风险增加了感染和死亡风险,尤其是在疫情初期。随着疫情的发展,工作量和人员短缺成为日益严重的干扰因素。

结论

本研究证明了人工智能辅助的媒体内容分析应用于有关卫生人力的开源新闻文章和报告的能力。应优先考虑提供充足的薪酬方案和个人防护设备供应,作为预防措施,以减少未来疫情对卫生人力的初始影响。作为应对措施的一部分,需要制定旨在减轻情感负担和工作量的干预措施,提高疫情期间卫生服务的效率和可维护性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4d3/11211705/b6b6d16af54a/formative_v8i1e53574_fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验