• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

文本挖掘以鉴定与抗药生物体相关的生物实体。

Text mining for identification of biological entities related to antibiotic resistant organisms.

机构信息

Programa de pós-graduação em Engenharia Elétrica, Universidade Federal do Pará, Belém, Pará, Brazil.

Biological Science Institute, Universidade Federal do Pará, Belém, Pará, Brazil.

出版信息

PeerJ. 2022 May 5;10:e13351. doi: 10.7717/peerj.13351. eCollection 2022.

DOI:10.7717/peerj.13351
PMID:35539017
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9080439/
Abstract

Antimicrobial resistance is a significant public health problem worldwide. In recent years, the scientific community has been intensifying efforts to combat this problem; many experiments have been developed, and many articles are published in this area. However, the growing volume of biological literature increases the difficulty of the biocuration process due to the cost and time required. Modern text mining tools with the adoption of artificial intelligence technology are helpful to assist in the evolution of research. In this article, we propose a text mining model capable of identifying and ranking prioritizing scientific articles in the context of antimicrobial resistance. We retrieved scientific articles from the PubMed database, adopted machine learning techniques to generate the vector representation of the retrieved scientific articles, and identified their similarity with the context. As a result of this process, we obtained a dataset labeled "Relevant" and "Irrelevant" and used this dataset to implement one supervised learning algorithm to classify new records. The model's overall performance reached 90% accuracy and the f-measure (harmonic mean between the metrics) reached 82% accuracy for positive class and 93% for negative class, showing quality in the identification of scientific articles relevant to the context. The dataset, scripts and models are available at https://github.com/engbiopct/TextMiningAMR.

摘要

抗菌药物耐药性是全球范围内一个重大的公共卫生问题。近年来,科学界一直在加紧努力应对这一问题;在这一领域已经开发了许多实验,并发表了许多文章。然而,由于成本和时间的原因,生物文献的数量不断增加,增加了生物注释过程的难度。采用人工智能技术的现代文本挖掘工具有助于协助研究的发展。在本文中,我们提出了一种能够识别和优先排序抗菌药物耐药性背景下的科学文章的文本挖掘模型。我们从 PubMed 数据库中检索科学文章,采用机器学习技术生成检索科学文章的向量表示,并识别它们与上下文的相似性。通过这个过程,我们得到了一个标记为“相关”和“不相关”的数据集,并使用这个数据集来实现一个监督学习算法对新记录进行分类。该模型的整体性能达到了 90%的准确率,正类的 F1 测度(度量之间的调和平均值)达到了 82%,负类达到了 93%,表明在识别与上下文相关的科学文章方面具有良好的性能。数据集、脚本和模型可在 https://github.com/engbiopct/TextMiningAMR 上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/d5a701806f85/peerj-10-13351-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/bc6278c1c329/peerj-10-13351-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/83dfd4a9ddf3/peerj-10-13351-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/100bceffb899/peerj-10-13351-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/038b9b709f4c/peerj-10-13351-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/d5a701806f85/peerj-10-13351-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/bc6278c1c329/peerj-10-13351-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/83dfd4a9ddf3/peerj-10-13351-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/100bceffb899/peerj-10-13351-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/038b9b709f4c/peerj-10-13351-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376e/9080439/d5a701806f85/peerj-10-13351-g005.jpg

相似文献

1
Text mining for identification of biological entities related to antibiotic resistant organisms.文本挖掘以鉴定与抗药生物体相关的生物实体。
PeerJ. 2022 May 5;10:e13351. doi: 10.7717/peerj.13351. eCollection 2022.
2
Development of benchmark datasets for text mining and sentiment analysis to accelerate regulatory literature review.开发用于文本挖掘和情感分析的基准数据集,以加速监管文献综述。
Regul Toxicol Pharmacol. 2023 Jan;137:105287. doi: 10.1016/j.yrtph.2022.105287. Epub 2022 Nov 11.
3
Knowledge based word-concept model estimation and refinement for biomedical text mining.用于生物医学文本挖掘的基于知识的词概念模型估计与优化。
J Biomed Inform. 2015 Feb;53:300-7. doi: 10.1016/j.jbi.2014.11.015. Epub 2014 Dec 12.
4
Text mining-based word representations for biomedical data analysis and protein-protein interaction networks in machine learning tasks.基于文本挖掘的词表示在生物医学数据分析和机器学习任务中的蛋白质-蛋白质相互作用网络。
PLoS One. 2021 Oct 15;16(10):e0258623. doi: 10.1371/journal.pone.0258623. eCollection 2021.
5
Role of biological Data Mining and Machine Learning Techniques in Detecting and Diagnosing the Novel Coronavirus (COVID-19): A Systematic Review.生物数据挖掘和机器学习技术在检测和诊断新型冠状病毒 (COVID-19) 中的作用:系统评价。
J Med Syst. 2020 May 25;44(7):122. doi: 10.1007/s10916-020-01582-x.
6
BioReader: a text mining tool for performing classification of biomedical literature.BioReader:一种文本挖掘工具,用于对生物医学文献进行分类。
BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):57. doi: 10.1186/s12859-019-2607-x.
7
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
8
A review on utilizing machine learning technology in the fields of electronic emergency triage and patient priority systems in telemedicine: Coherent taxonomy, motivations, open research challenges and recommendations for intelligent future work.利用机器学习技术在电子急诊分诊和远程医疗患者优先系统领域的应用综述:连贯的分类法、动机、开放的研究挑战和对智能未来工作的建议。
Comput Methods Programs Biomed. 2021 Sep;209:106357. doi: 10.1016/j.cmpb.2021.106357. Epub 2021 Aug 16.
9
Unsupervised and self-supervised deep learning approaches for biomedical text mining.无监督和自监督深度学习方法在生物医学文本挖掘中的应用。
Brief Bioinform. 2021 Mar 22;22(2):1592-1603. doi: 10.1093/bib/bbab016.
10
A Text Mining Pipeline Using Active and Deep Learning Aimed at Curating Information in Computational Neuroscience.使用主动和深度学习的文本挖掘管道,旨在为计算神经科学中的信息提供支持。
Neuroinformatics. 2019 Jul;17(3):391-406. doi: 10.1007/s12021-018-9404-y.

本文引用的文献

1
CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database.CARD 2020:利用综合抗生素耐药数据库进行抗生素耐药组监测。
Nucleic Acids Res. 2020 Jan 8;48(D1):D517-D525. doi: 10.1093/nar/gkz935.
2
Text mining tools for extracting information about microbial biodiversity in food.用于从食品中提取微生物生物多样性信息的文本挖掘工具。
Food Microbiol. 2019 Aug;81:63-75. doi: 10.1016/j.fm.2018.04.011. Epub 2018 Apr 21.
3
Prediction of the intestinal resistome by a three-dimensional structure-based method.
基于三维结构的方法预测肠道抗药基因。
Nat Microbiol. 2019 Jan;4(1):112-123. doi: 10.1038/s41564-018-0292-6. Epub 2018 Nov 26.
4
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2019 Jan 8;47(D1):D23-D28. doi: 10.1093/nar/gky1069.
5
Abundance and diversity of the faecal resistome in slaughter pigs and broilers in nine European countries.九国屠宰猪和肉鸡粪便中耐药组的丰度和多样性。
Nat Microbiol. 2018 Aug;3(8):898-908. doi: 10.1038/s41564-018-0192-9. Epub 2018 Jul 23.
6
ARGs-OAP v2.0 with an expanded SARG database and Hidden Markov Models for enhancement characterization and quantification of antibiotic resistance genes in environmental metagenomes.ARGs-OAP v2.0 版本,其 SARG 数据库得到了扩展,并采用隐马尔可夫模型来增强环境宏基因组中抗生素抗性基因的特征描述和定量分析。
Bioinformatics. 2018 Jul 1;34(13):2263-2270. doi: 10.1093/bioinformatics/bty053.
7
Beta-lactamase database (BLDB) - structure and function.β-内酰胺酶数据库(BLDB)——结构与功能
J Enzyme Inhib Med Chem. 2017 Dec;32(1):917-919. doi: 10.1080/14756366.2017.1344235.
8
FARME DB: a functional antibiotic resistance element database.FARME DB:一个功能性抗生素抗性元件数据库。
Database (Oxford). 2017 Jan 10;2017. doi: 10.1093/database/baw165. Print 2017.
9
MEGARes: an antimicrobial resistance database for high throughput sequencing.MEGARes:一个用于高通量测序的抗菌药物耐药性数据库。
Nucleic Acids Res. 2017 Jan 4;45(D1):D574-D580. doi: 10.1093/nar/gkw1009. Epub 2016 Nov 28.
10
Mechanisms of drug resistance: daptomycin resistance.耐药机制:达托霉素耐药性
Ann N Y Acad Sci. 2015 Sep;1354:32-53. doi: 10.1111/nyas.12948. Epub 2015 Oct 23.