• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

健康新闻报道中信息质量指标的自动识别

Automatic Identification of Information Quality Metrics in Health News Stories.

作者信息

Al-Jefri Majed, Evans Roger, Lee Joon, Ghezzi Pietro

机构信息

Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.

Data Intelligence for Health Lab, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.

出版信息

Front Public Health. 2020 Dec 18;8:515347. doi: 10.3389/fpubh.2020.515347. eCollection 2020.

DOI:10.3389/fpubh.2020.515347
PMID:33392124
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7775604/
Abstract

Many online and printed media publish health news of questionable trustworthiness and it may be difficult for laypersons to determine the information quality of such articles. The purpose of this work was to propose a methodology for the automatic assessment of the quality of health-related news stories using natural language processing and machine learning. We used a database from the website HealthNewsReview.org that aims to improve the public dialogue about health care. HealthNewsReview.org developed a set of criteria to critically analyze health care interventions' claims. In this work, we attempt to automate the evaluation process by identifying the indicators of those criteria using natural language processing-based machine learning on a corpus of more than 1,300 news stories. We explored features ranging from simple n-grams to more advanced linguistic features and optimized the feature selection for each task. Additionally, we experimented with the use of pre-trained natural language model BERT. For some criteria, such as mention of costs, benefits, harms, and "disease-mongering," the evaluation results were promising with an F measure reaching 81.94%, while for others the results were less satisfactory due to the dataset size, the need of external knowledge, or the subjectivity in the evaluation process. These used criteria are more challenging than those addressed by previous work, and our aim was to investigate how much more difficult the machine learning task was, and how and why it varied between criteria. For some criteria, the obtained results were promising; however, automated evaluation of the other criteria may not yet replace the manual evaluation process where human experts interpret text senses and make use of external knowledge in their assessment.

摘要

许多在线和印刷媒体都发布可信度存疑的健康新闻,外行人可能很难判断此类文章的信息质量。这项工作的目的是提出一种使用自然语言处理和机器学习自动评估健康相关新闻报道质量的方法。我们使用了来自HealthNewsReview.org网站的一个数据库,该网站旨在改善关于医疗保健的公众对话。HealthNewsReview.org制定了一套标准来批判性地分析医疗保健干预措施的声明。在这项工作中,我们试图通过在1300多篇新闻报道的语料库上使用基于自然语言处理的机器学习来识别这些标准的指标,从而使评估过程自动化。我们探索了从简单的n元语法到更高级的语言特征等各种特征,并针对每个任务优化了特征选择。此外,我们还试验了使用预训练的自然语言模型BERT。对于一些标准,如对成本、收益、危害和“疾病兜售”的提及,评估结果很有前景,F值达到81.94%,而对于其他标准,由于数据集大小、外部知识需求或评估过程中的主观性,结果不太令人满意。这些使用的标准比以前工作涉及的标准更具挑战性,我们的目的是研究机器学习任务有多困难,以及它在不同标准之间如何以及为何有所不同。对于一些标准,获得的结果很有前景;然而,对其他标准的自动评估可能还无法取代人工评估过程,在人工评估过程中,人类专家会解读文本含义并在评估中利用外部知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acc8/7775604/ba35471494b3/fpubh-08-515347-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acc8/7775604/b46d127ac26a/fpubh-08-515347-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acc8/7775604/ba35471494b3/fpubh-08-515347-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acc8/7775604/b46d127ac26a/fpubh-08-515347-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acc8/7775604/ba35471494b3/fpubh-08-515347-g0002.jpg

相似文献

1
Automatic Identification of Information Quality Metrics in Health News Stories.健康新闻报道中信息质量指标的自动识别
Front Public Health. 2020 Dec 18;8:515347. doi: 10.3389/fpubh.2020.515347. eCollection 2020.
2
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
3
Portable automatic text classification for adverse drug reaction detection via multi-corpus training.通过多语料库训练实现用于药物不良反应检测的便携式自动文本分类
J Biomed Inform. 2015 Feb;53:196-207. doi: 10.1016/j.jbi.2014.11.002. Epub 2014 Nov 8.
4
Using the contextual language model BERT for multi-criteria classification of scientific articles.使用上下文语言模型 BERT 对科学文章进行多标准分类。
J Biomed Inform. 2020 Dec;112:103578. doi: 10.1016/j.jbi.2020.103578. Epub 2020 Oct 13.
5
A novel framework for biomedical entity sense induction.一种用于生物医学实体感知归纳的新框架。
J Biomed Inform. 2018 Aug;84:31-41. doi: 10.1016/j.jbi.2018.06.007. Epub 2018 Jun 20.
6
Addressing tensions when popular media and evidence-based care collide.当大众媒体与循证医疗发生冲突时如何应对紧张局面。
BMC Med Inform Decis Mak. 2013;13 Suppl 3(Suppl 3):S3. doi: 10.1186/1472-6947-13-S3-S3. Epub 2013 Dec 6.
7
Comparison of different feature extraction methods for applicable automated ICD coding.不同特征提取方法在适用的自动化 ICD 编码中的比较。
BMC Med Inform Decis Mak. 2022 Jan 12;22(1):11. doi: 10.1186/s12911-022-01753-5.
8
Development of a global infectious disease activity database using natural language processing, machine learning, and human expertise.利用自然语言处理、机器学习和人类专业知识开发全球传染病活动数据库。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1355-1359. doi: 10.1093/jamia/ocz112.
9
Transferability Based on Drug Structure Similarity in the Automatic Classification of Noncompliant Drug Use on Social Media: Natural Language Processing Approach.基于药物结构相似性的社交媒体中不规范用药自动分类的可转移性:自然语言处理方法。
J Med Internet Res. 2023 May 3;25:e44870. doi: 10.2196/44870.
10
One Step Forward, One Step Back: Changes in News Coverage of Medical Interventions.一步之遥,亦步亦趋:医学干预措施新闻报道的变化。
Health Commun. 2018 Feb;33(2):174-187. doi: 10.1080/10410236.2016.1250706. Epub 2016 Dec 16.

引用本文的文献

1
Assessing the accuracy and explainability of using ChatGPT to evaluate the quality of health news.评估使用ChatGPT评估健康新闻质量的准确性和可解释性。
BMC Public Health. 2025 Jun 2;25(1):2038. doi: 10.1186/s12889-025-23206-0.
2
Visualizing the Interpretation of a Criteria-Driven System That Automatically Evaluates the Quality of Health News: Exploratory Study of 2 Approaches.可视化一个自动评估健康新闻质量的标准驱动系统的解读:两种方法的探索性研究
JMIR AI. 2022 Dec 20;1(1):e37751. doi: 10.2196/37751.
3
Quantifying the Quality of Web-Based Health Information on Student Health Center Websites Using a Software Tool: Design and Development Study.

本文引用的文献

1
AutoDiscern: rating the quality of online health information with hierarchical encoder attention-based neural networks.AutoDiscern:基于层次编码器注意力的神经网络对在线健康信息进行质量评估。
BMC Med Inform Decis Mak. 2020 Jun 9;20(1):104. doi: 10.1186/s12911-020-01131-z.
2
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
3
What Is Health Information Quality? Ethical Dimension and Perception by Users.
使用软件工具量化学生健康中心网站上基于网络的健康信息质量:设计与开发研究
JMIR Form Res. 2022 Feb 2;6(2):e32360. doi: 10.2196/32360.
什么是健康信息质量?伦理维度与用户认知。
Front Med (Lausanne). 2018 Sep 20;5:260. doi: 10.3389/fmed.2018.00260. eCollection 2018.
4
Interpretation of health news items reported with or without spin: protocol for a prospective meta-analysis of 16 randomised controlled trials.有倾向性或无倾向性报道的健康新闻的解读:16项随机对照试验的前瞻性荟萃分析方案
BMJ Open. 2017 Nov 17;7(11):e017425. doi: 10.1136/bmjopen-2017-017425.
5
Online Information on Antioxidants: Information Quality Indicators, Commercial Interests, and Ranking by Google.关于抗氧化剂的在线信息:信息质量指标、商业利益及谷歌排名
Front Public Health. 2017 Apr 21;5:90. doi: 10.3389/fpubh.2017.00090. eCollection 2017.
6
Bad News: Analysis of the Quality of Information on Influenza Prevention Returned by Google in English and Italian.坏消息:谷歌返回的英文和意大利文流感预防信息质量分析
Front Immunol. 2015 Dec 8;6:616. doi: 10.3389/fimmu.2015.00616. eCollection 2015.
7
Automated Detection of Health Websites' HONcode Conformity: Can N-gram Tokenization Replace Stemming?健康网站HONcode合规性的自动检测:N元语法分词能否取代词干提取?
Stud Health Technol Inform. 2015;216:1064.
8
Automated Detection of HONcode Website Conformity Compared to Manual Detection: An Evaluation.与人工检测相比,HONcode网站合规性的自动检测:一项评估
J Med Internet Res. 2015 Jun 2;17(6):e135. doi: 10.2196/jmir.3831.
9
Seeking health information online: does limited healthcare access matter?在线寻求健康信息:有限的医疗服务可及性重要吗?
J Am Med Inform Assoc. 2014 Nov-Dec;21(6):1113-7. doi: 10.1136/amiajnl-2013-002350. Epub 2014 Jun 19.
10
A guide to reading health care news stories.阅读医疗保健新闻报道指南。
JAMA Intern Med. 2014 Jul;174(7):1183-6. doi: 10.1001/jamainternmed.2014.1359.