Chair for Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany.
Department of Microbial Natural Products, Helmholtz-Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Campus E8 1, 66123 Saarbrücken, Germany.
Nucleic Acids Res. 2022 Jan 7;50(D1):D273-D278. doi: 10.1093/nar/gkab1111.
Plasmids are known to contain genes encoding for virulence factors and antibiotic resistance mechanisms. Their relevance in metagenomic data processing is steadily growing. However, with the increasing popularity and scale of metagenomics experiments, the number of reported plasmids is rapidly growing as well, amassing a considerable number of false positives due to undetected misassembles. Here, our previously published database PLSDB provides a reliable resource for researchers to quickly compare their sequences against selected and annotated previous findings. Within two years, the size of this resource has more than doubled from the initial 13,789 to now 34,513 entries over the course of eight regular data updates. For this update, we aggregated community feedback for major changes to the database featuring new analysis functionality as well as performance, quality, and accessibility improvements. New filtering steps, annotations, and preprocessing of existing records improve the quality of the provided data. Additionally, new features implemented in the web-server ease user interaction and allow for a deeper understanding of custom uploaded sequences, by visualizing similarity information. Lastly, an application programming interface was implemented along with a python library, to allow remote database queries in automated workflows. The latest release of PLSDB is freely accessible under https://www.ccb.uni-saarland.de/plsdb.
质粒已知含有编码毒力因子和抗生素耐药机制的基因。它们在宏基因组数据处理中的相关性正在稳步增长。然而,随着宏基因组实验的普及和规模的不断扩大,报告的质粒数量也在迅速增加,由于未检测到的错误组装,积累了相当数量的假阳性。在这里,我们之前发布的数据库 PLSDB 为研究人员提供了一个可靠的资源,以便他们可以快速将自己的序列与经过选择和注释的先前发现进行比较。在两年内,这个资源的大小从最初的 13789 个增加到了现在的 34513 个,这是在经过八次常规数据更新的过程中实现的。在此次更新中,我们根据社区的反馈,对数据库进行了重大更改,增加了新的分析功能以及性能、质量和可访问性的改进。新的过滤步骤、注释和对现有记录的预处理提高了提供数据的质量。此外,在 web 服务器中实现的新功能简化了用户交互,并允许通过可视化相似性信息,更深入地了解自定义上传的序列。最后,实现了一个应用程序编程接口和一个 Python 库,以允许在自动化工作流程中进行远程数据库查询。最新版本的 PLSDB 可在 https://www.ccb.uni-saarland.de/plsdb 上免费访问。