• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对新冠肺炎与卫生新闻信息提取的卫生人力资源:算法开发与验证

Targeting COVID-19 and Human Resources for Health News Information Extraction: Algorithm Development and Validation.

作者信息

Ravaut Mathieu, Zhao Ruochen, Phung Duy, Qin Vicky Mengqi, Milovanovic Dusan, Pienkowska Anita, Bojic Iva, Car Josip, Joty Shafiq

机构信息

Nanyang Technological University, Singapore, Singapore.

Episteme Systems, Geneva, Switzerland.

出版信息

JMIR AI. 2024 Oct 30;3:e55059. doi: 10.2196/55059.

DOI:10.2196/55059
PMID:39475833
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11561429/
Abstract

BACKGROUND

Global pandemics like COVID-19 put a high amount of strain on health care systems and health workers worldwide. These crises generate a vast amount of news information published online across the globe. This extensive corpus of articles has the potential to provide valuable insights into the nature of ongoing events and guide interventions and policies. However, the sheer volume of information is beyond the capacity of human experts to process and analyze effectively.

OBJECTIVE

The aim of this study was to explore how natural language processing (NLP) can be leveraged to build a system that allows for quick analysis of a high volume of news articles. Along with this, the objective was to create a workflow comprising human-computer symbiosis to derive valuable insights to support health workforce strategic policy dialogue, advocacy, and decision-making.

METHODS

We conducted a review of open-source news coverage from January 2020 to June 2022 on COVID-19 and its impacts on the health workforce from the World Health Organization (WHO) Epidemic Intelligence from Open Sources (EIOS) by synergizing NLP models, including classification and extractive summarization, and human-generated analyses. Our DeepCovid system was trained on 2.8 million news articles in English from more than 3000 internet sources across hundreds of jurisdictions.

RESULTS

Rules-based classification with hand-designed rules narrowed the data set to 8508 articles with high relevancy confirmed in the human-led evaluation. DeepCovid's automated information targeting component reached a very strong binary classification performance of 98.98 for the area under the receiver operating characteristic curve (ROC-AUC) and 47.21 for the area under the precision recall curve (PR-AUC). Its information extraction component attained good performance in automatic extractive summarization with a mean Recall-Oriented Understudy for Gisting Evaluation (ROUGE) score of 47.76. DeepCovid's final summaries were used by human experts to write reports on the COVID-19 pandemic.

CONCLUSIONS

It is feasible to synergize high-performing NLP models and human-generated analyses to benefit open-source health workforce intelligence. The DeepCovid approach can contribute to an agile and timely global view, providing complementary information to scientific literature.

摘要

背景

像新冠疫情这样的全球大流行给全球的医疗系统和医护人员带来了巨大压力。这些危机在全球范围内产生了大量在线发布的新闻信息。这一庞大的文章语料库有可能为正在发生的事件的性质提供有价值的见解,并指导干预措施和政策制定。然而,信息的数量之多超出了人类专家有效处理和分析的能力。

目的

本研究的目的是探索如何利用自然语言处理(NLP)来构建一个系统,以便能够快速分析大量新闻文章。与此同时,目标是创建一个包含人机共生的工作流程,以获得有价值的见解,支持卫生人力战略政策对话、宣传和决策。

方法

我们通过整合NLP模型(包括分类和提取式摘要)以及人工分析,对2020年1月至2022年6月来自世界卫生组织(WHO)开源疫情情报(EIOS)的关于新冠疫情及其对卫生人力影响的开源新闻报道进行了综述。我们的DeepCovid系统在来自数百个司法管辖区的3000多个互联网来源的280万篇英文新闻文章上进行了训练。

结果

基于手工设计规则的基于规则的分类将数据集缩小到8508篇文章,在人工主导的评估中确认具有高度相关性。DeepCovid的自动信息定位组件在接收器操作特征曲线(ROC-AUC)下的区域达到了98.98的非常强的二元分类性能,在精确召回曲线(PR-AUC)下的区域达到了47.21。其信息提取组件在自动提取式摘要方面表现良好,平均面向召回的摘要评估(ROUGE)分数为47.76。DeepCovid的最终摘要被人类专家用于撰写关于新冠疫情的报告。

结论

整合高性能NLP模型和人工分析以受益于开源卫生人力情报是可行的。DeepCovid方法可以促成敏捷及时的全球视角,为科学文献提供补充信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c6/11561429/c38e80d9c319/ai_v3i1e55059_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c6/11561429/e2513503e033/ai_v3i1e55059_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c6/11561429/69c15dd246f3/ai_v3i1e55059_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c6/11561429/c38e80d9c319/ai_v3i1e55059_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c6/11561429/e2513503e033/ai_v3i1e55059_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c6/11561429/69c15dd246f3/ai_v3i1e55059_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c6/11561429/c38e80d9c319/ai_v3i1e55059_fig3.jpg

相似文献

1
Targeting COVID-19 and Human Resources for Health News Information Extraction: Algorithm Development and Validation.针对新冠肺炎与卫生新闻信息提取的卫生人力资源:算法开发与验证
JMIR AI. 2024 Oct 30;3:e55059. doi: 10.2196/55059.
2
Understanding COVID-19 Impacts on the Health Workforce: AI-Assisted Open-Source Media Content Analysis.了解新冠疫情对卫生人力的影响:人工智能辅助的开源媒体内容分析
JMIR Form Res. 2024 Jun 13;8:e53574. doi: 10.2196/53574.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Exploring the potential of ChatGPT in medical dialogue summarization: a study on consistency with human preferences.探索 ChatGPT 在医学对话总结中的潜力:一项关于与人类偏好一致性的研究。
BMC Med Inform Decis Mak. 2024 Mar 14;24(1):75. doi: 10.1186/s12911-024-02481-8.
5
Beyond the black stump: rapid reviews of health research issues affecting regional, rural and remote Australia.超越黑木树:影响澳大利亚地区、农村和偏远地区的健康研究问题的快速综述。
Med J Aust. 2020 Dec;213 Suppl 11:S3-S32.e1. doi: 10.5694/mja2.50881.
6
Agenda-Setting for COVID-19: A Study of Large-Scale Economic News Coverage Using Natural Language Processing.新冠疫情的议程设置:一项运用自然语言处理技术对大规模经济新闻报道的研究
Int J Data Sci Anal. 2023;15(3):291-312. doi: 10.1007/s41060-022-00364-7. Epub 2022 Oct 6.
7
CERC: an interactive content extraction, recognition, and construction tool for clinical and biomedical text.CERC:一个用于临床和生物医学文本的交互式内容提取、识别和构建工具。
BMC Med Inform Decis Mak. 2020 Dec 15;20(Suppl 14):306. doi: 10.1186/s12911-020-01330-8.
8
Data Exploration and Classification of News Article Reliability: Deep Learning Study.新闻文章可靠性的数据探索与分类:深度学习研究
JMIR Infodemiology. 2022 Sep 22;2(2):e38839. doi: 10.2196/38839. eCollection 2022 Jul-Dec.
9
Quantifying Online News Media Coverage of the COVID-19 Pandemic: Text Mining Study and Resource.量化新冠疫情期间的在线新闻媒体报道:文本挖掘研究与资源
J Med Internet Res. 2021 Jun 2;23(6):e28253. doi: 10.2196/28253.
10
Modified Bidirectional Encoder Representations From Transformers Extractive Summarization Model for Hospital Information Systems Based on Character-Level Tokens (AlphaBERT): Development and Performance Evaluation.基于字符级令牌的医院信息系统变压器抽取式摘要模型(AlphaBERT)的改进双向编码器表示:开发与性能评估
JMIR Med Inform. 2020 Apr 29;8(4):e17787. doi: 10.2196/17787.

本文引用的文献

1
A review on Natural Language Processing Models for COVID-19 research.关于用于新冠病毒研究的自然语言处理模型的综述。
Healthc Anal (N Y). 2022 Nov;2:100078. doi: 10.1016/j.health.2022.100078. Epub 2022 Jul 19.
2
Advances in Artificial Intelligence for Infectious-Disease Surveillance.用于传染病监测的人工智能进展。
N Engl J Med. 2023 Apr 27;388(17):1597-1607. doi: 10.1056/NEJMra2119215.
3
Surveillance of communicable diseases using social media: A systematic review.利用社交媒体进行传染病监测:系统评价。
PLoS One. 2023 Feb 24;18(2):e0282101. doi: 10.1371/journal.pone.0282101. eCollection 2023.
4
The Role of Natural Language Processing during the COVID-19 Pandemic: Health Applications, Opportunities, and Challenges.自然语言处理在新冠疫情期间的作用:健康应用、机遇与挑战
Healthcare (Basel). 2022 Nov 12;10(11):2270. doi: 10.3390/healthcare10112270.
5
Using Natural Language Processing to Explore Mental Health Insights From UK Tweets During the COVID-19 Pandemic: Infodemiology Study.利用自然语言处理技术探索新冠疫情期间英国推文所反映的心理健康见解:信息流行病学研究
JMIR Infodemiology. 2022 Mar 31;2(1):e32449. doi: 10.2196/32449. eCollection 2022 Jan-Jun.
6
Biases in using social media data for public health surveillance: A scoping review.社交媒体数据在公共卫生监测中的应用偏差:范围综述。
Int J Med Inform. 2022 Aug;164:104804. doi: 10.1016/j.ijmedinf.2022.104804. Epub 2022 May 23.
7
Impact of COVID-19 pandemic on healthcare workers.新冠疫情对医护人员的影响。
Ind Psychiatry J. 2021 Oct;30(Suppl 1):S282-S284. doi: 10.4103/0972-6748.328830. Epub 2021 Oct 22.
8
Analysis of mental and physical disorders associated with COVID-19 in online health forums: a natural language processing study.分析在线健康论坛中与 COVID-19 相关的身心障碍:一项自然语言处理研究。
BMJ Open. 2021 Nov 5;11(11):e056601. doi: 10.1136/bmjopen-2021-056601.
9
Prediction of Mental Health in Medical Workers During COVID-19 Based on Machine Learning.基于机器学习的新冠疫情期间医护人员心理健康预测
Front Public Health. 2021 Sep 7;9:697850. doi: 10.3389/fpubh.2021.697850. eCollection 2021.
10
Impacts of COVID-19 on the Life and Work of Healthcare Workers During the Nationwide Partial Lockdown in Vietnam.越南全国部分地区封锁期间新冠疫情对医护人员生活和工作的影响
Front Psychol. 2021 Aug 19;12:563193. doi: 10.3389/fpsyg.2021.563193. eCollection 2021.