David Braley Centre for Antibiotic Discovery, McMaster University, 1280 Main Street West, Hamilton, ON L8S 4L8, Canada.
Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, 1280 Main Street West, Hamilton, ON L8S 4L8, Canada.
Database (Oxford). 2023 Apr 20;2023. doi: 10.1093/database/baad023.
Scientific literature is published at a rate that makes manual data extraction a highly time-consuming task. The Comprehensive Antibiotic Resistance Database (CARD) utilizes literature to curate information on antimicrobial resistance genes and to enable time-efficient triage of publications we have developed a classification algorithm for identifying publications describing first reports of new resistance genes. Trained on publications contained in the CARD, CARDShark downloads, processes and identifies publications recently added to PubMed that should be reviewed by biocurators. With CARDShark, we can minimize the monthly scope of articles a biocurator reviews from hundreds of articles to a few dozen, drastically improving the speed of curation while ensuring no relevant publications are overlooked. Database URL http://card.mcmaster.ca.
科学文献的发表速度非常快,使得手动数据提取成为一项非常耗时的任务。全面抗生素耐药性数据库(CARD)利用文献来整理关于抗菌药物耐药基因的信息,并使出版物的高效筛选成为可能。我们开发了一种分类算法,用于识别描述新耐药基因首次报告的出版物。在 CARD 中包含的出版物上进行训练,CARDShark 下载、处理并识别最近添加到 PubMed 中的出版物,这些出版物应由生物注释员进行审查。使用 CARDShark,我们可以将生物注释员每月需要审查的文章数量从数百篇减少到几十篇,在确保不忽略任何相关出版物的同时,极大地提高了注释的速度。数据库网址:http://card.mcmaster.ca。
Nucleic Acids Res. 2017-1-4
BMC Bioinformatics. 2019-2-4
Nucleic Acids Res. 2021-1-8
Database (Oxford). 2020-1-1
J Am Chem Soc. 2021-12-22
Nucleic Acids Res. 2020-1-8
Clin Microbiol Infect. 2016-11-23
Nucleic Acids Res. 2017-1-4