• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用系统综述方案对医学摘要进行分类。

Exploiting the systematic review protocol for classification of medical abstracts.

机构信息

School of Information Technology and Engineering, University of Ottawa, 800 King Edward, Ottawa, Ontario, Canada K1N 6N5.

出版信息

Artif Intell Med. 2011 Jan;51(1):17-25. doi: 10.1016/j.artmed.2010.10.005. Epub 2010 Nov 16.

DOI:10.1016/j.artmed.2010.10.005
PMID:21084178
Abstract

OBJECTIVE

To determine whether the automatic classification of documents can be useful in systematic reviews on medical topics, and specifically if the performance of the automatic classification can be enhanced by using the particular protocol of questions employed by the human reviewers to create multiple classifiers.

METHODS AND MATERIALS

The test collection is the data used in large-scale systematic review on the topic of the dissemination strategy of health care services for elderly people. From a group of 47,274 abstracts marked by human reviewers to be included in or excluded from further screening, we randomly selected 20,000 as a training set, with the remaining 27,274 becoming a separate test set. As a machine learning algorithm we used complement naïve Bayes. We tested both a global classification method, where a single classifier is trained on instances of abstracts and their classification (i.e., included or excluded), and a novel per-question classification method that trains multiple classifiers for each abstract, exploiting the specific protocol (questions) of the systematic review. For the per-question method we tested four ways of combining the results of the classifiers trained for the individual questions. As evaluation measures, we calculated precision and recall for several settings of the two methods. It is most important not to exclude any relevant documents (i.e., to attain high recall for the class of interest) but also desirable to exclude most of the non-relevant documents (i.e., to attain high precision on the class of interest) in order to reduce human workload.

RESULTS

For the global method, the highest recall was 67.8% and the highest precision was 37.9%. For the per-question method, the highest recall was 99.2%, and the highest precision was 63%. The human-machine workflow proposed in this paper achieved a recall value of 99.6%, and a precision value of 17.8%.

CONCLUSION

The per-question method that combines classifiers following the specific protocol of the review leads to better results than the global method in terms of recall. Because neither method is efficient enough to classify abstracts reliably by itself, the technology should be applied in a semi-automatic way, with a human expert still involved. When the workflow includes one human expert and the trained automatic classifier, recall improves to an acceptable level, showing that automatic classification techniques can reduce the human workload in the process of building a systematic review.

摘要

目的

确定文献的自动分类是否可用于医学主题的系统评价,特别是通过使用人类审查员创建多个分类器所采用的特定问题协议,是否可以提高自动分类的性能。

方法和材料

测试集是在针对老年人医疗服务传播策略的大型系统评价中使用的数据。从由人类审查员标记为包含或排除进一步筛选的 47274 个摘要中,我们随机选择了 20000 个作为训练集,其余 27274 个成为单独的测试集。作为机器学习算法,我们使用了补充朴素贝叶斯。我们测试了两种分类方法,一种是全局分类方法,即使用单个分类器对摘要及其分类(即包含或排除)进行训练,另一种是新的每问题分类方法,该方法为每个摘要训练多个分类器,利用系统评价的特定协议(问题)。对于每问题方法,我们测试了四种方法来组合为各个问题训练的分类器的结果。作为评估指标,我们计算了两种方法的几种设置的精度和召回率。最重要的是不要排除任何相关文献(即对感兴趣的类达到高召回率),但也希望排除大多数不相关文献(即对感兴趣的类达到高精度),以减少人工工作量。

结果

对于全局方法,最高召回率为 67.8%,最高精度为 37.9%。对于每问题方法,最高召回率为 99.2%,最高精度为 63%。本文提出的人机工作流程实现了 99.6%的召回率和 17.8%的精度。

结论

与全局方法相比,按照审查特定协议结合分类器的每问题方法在召回率方面具有更好的结果。由于没有一种方法本身能够可靠地对摘要进行分类,因此该技术应该以半自动方式应用,仍然需要人类专家的参与。当工作流程包括一名人类专家和训练有素的自动分类器时,召回率会提高到可接受的水平,这表明自动分类技术可以减少系统评价过程中的人工工作量。

相似文献

1
Exploiting the systematic review protocol for classification of medical abstracts.利用系统综述方案对医学摘要进行分类。
Artif Intell Med. 2011 Jan;51(1):17-25. doi: 10.1016/j.artmed.2010.10.005. Epub 2010 Nov 16.
2
An SVM-based high-quality article classifier for systematic reviews.一种用于系统评价的基于支持向量机的高质量文章分类器。
J Biomed Inform. 2014 Feb;47:153-9. doi: 10.1016/j.jbi.2013.10.005. Epub 2013 Oct 29.
3
Building a protein name dictionary from full text: a machine learning term extraction approach.从全文构建蛋白质名称词典:一种机器学习术语提取方法。
BMC Bioinformatics. 2005 Apr 7;6:88. doi: 10.1186/1471-2105-6-88.
4
Medical linguistics: automated indexing into SNOMED.医学语言学:自动索引至医学系统命名法(SNOMED)
Crit Rev Med Inform. 1988;1(4):333-403.
5
The evidence for nursing interventions in the Cochrane Database of Systematic Reviews.Cochrane系统评价数据库中关于护理干预措施的证据。
Nurse Res. 2004;12(2):71-80.
6
Updating reviews: the experience of the Cochrane Neonatal Review Group.更新综述:考科蓝新生儿综述小组的经验
Paediatr Perinat Epidemiol. 2008 Jan;22 Suppl 1:29-32. doi: 10.1111/j.1365-3016.2007.00909.x.
7
[Structured abstracting of scientific medical publications].[医学科学出版物的结构化摘要]
Przegl Lek. 1995;52(3):105-8.
8
Tough mining: the challenges of searching the scientific literature.艰难的挖掘:搜索科学文献的挑战。
PLoS Biol. 2003 Nov;1(2):E48. doi: 10.1371/journal.pbio.0000048. Epub 2003 Nov 17.
9
Markov model recognition and classification of DNA/protein sequences within large text databases.大型文本数据库中DNA/蛋白质序列的马尔可夫模型识别与分类
Bioinformatics. 2005 Nov 1;21(21):4046-53. doi: 10.1093/bioinformatics/bti657. Epub 2005 Sep 13.
10
Mapping the literature of nursing administration.梳理护理管理文献。
J Med Libr Assoc. 2006 Apr;94(2 Suppl):E87-91.

引用本文的文献

1
A comparative study of screening performance between abstrackr and GPT models: Systematic review and contextual analysis.Abstrackr与GPT模型筛查性能的比较研究:系统评价与情境分析。
BMC Med Inform Decis Mak. 2025 Aug 7;25(1):293. doi: 10.1186/s12911-025-03138-w.
2
A question-answering framework for automated abstract screening using large language models.基于大语言模型的自动文摘筛选的问答框架。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1939-1952. doi: 10.1093/jamia/ocae166.
3
Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed.
生物医学文献系统评价自动化:PubMed 索引研究的范围综述。
Syst Rev. 2024 Jul 8;13(1):174. doi: 10.1186/s13643-024-02592-3.
4
Machine Learning Methods for Systematic Reviews:: A Rapid Scoping Review.系统评价的机器学习方法:快速范围综述
Dela J Public Health. 2023 Nov 30;9(4):40-47. doi: 10.32481/djph.2023.11.008. eCollection 2023 Nov.
5
Improving reference prioritisation with PICO recognition.通过 PICO 识别提高文献优先排序。
BMC Med Inform Decis Mak. 2019 Dec 5;19(1):256. doi: 10.1186/s12911-019-0992-8.
6
Novel text analytics approach to identify relevant literature for human health risk assessments: A pilot study with health effects of in utero exposures.一种新颖的文本分析方法,用于识别与人类健康风险评估相关的文献:以宫内暴露的健康影响为研究对象的初步研究
Environ Int. 2020 Jan;134:105228. doi: 10.1016/j.envint.2019.105228. Epub 2019 Nov 8.
7
Using Machine Learning and Natural Language Processing to Review and Classify the Medical Literature on Cancer Susceptibility Genes.使用机器学习和自然语言处理技术对癌症易感基因的医学文献进行综述和分类。
JCO Clin Cancer Inform. 2019 Sep;3:1-9. doi: 10.1200/CCI.19.00042.
8
Validation of a Semiautomated Natural Language Processing-Based Procedure for Meta-Analysis of Cancer Susceptibility Gene Penetrance.基于半自动化自然语言处理的癌症易感性基因外显率荟萃分析程序的验证
JCO Clin Cancer Inform. 2019 Aug;3:1-9. doi: 10.1200/CCI.19.00043.
9
Prioritising references for systematic reviews with RobotAnalyst: A user study.使用 RobotAnalyst 对系统评价进行优先排序:一项用户研究。
Res Synth Methods. 2018 Sep;9(3):470-488. doi: 10.1002/jrsm.1311. Epub 2018 Jul 30.
10
Patient healthcare trajectory. An essential monitoring tool: a systematic review.患者医疗轨迹。一种重要的监测工具:系统综述。
Health Inf Sci Syst. 2017 Apr 12;5(1):1. doi: 10.1007/s13755-017-0020-2. eCollection 2017 Dec.