State Key Laboratory for Infectious Disease Prevention and Control, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing 102206, China.
Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Hangzhou 310003, China.
Database (Oxford). 2018 Jan 1;2018. doi: 10.1093/database/bay055.
Advances in high-throughput sequencing have led to unprecedented growth in the amount of available genome sequencing data, especially for bacterial genomes, which has been accompanied by a challenge for the storage and management of such huge datasets. To facilitate bacterial research and related studies, we have developed the Mypathogen database (MPD), which provides access to users for searching, downloading, storing and sharing bacterial genomics data. The MPD represents the first pathogenic database for microbial genomes and metagenomes, and currently covers pathogenic microbial genomes (6604 genera, 11 071 species, 41 906 strains) and metagenomic data from host, air, water and other sources (28 816 samples). The MPD also functions as a management system for statistical and storage data that can be used by different organizations, thereby facilitating data sharing among different organizations and research groups. A user-friendly local client tool is provided to maintain the steady transmission of big sequencing data. The MPD is a useful tool for analysis and management in genomic research, especially for clinical Centers for Disease Control and epidemiological studies, and is expected to contribute to advancing knowledge on pathogenic bacteria genomes and metagenomes.Database URL: http://data.mypathogen.org.
高通量测序技术的进步使得可用的基因组测序数据呈指数级增长,尤其是细菌基因组的数据量,这给这些庞大数据集的存储和管理带来了挑战。为了促进细菌研究和相关研究,我们开发了 Mypathogen 数据库(MPD),为用户提供了搜索、下载、存储和共享细菌基因组数据的途径。MPD 是第一个针对微生物基因组和宏基因组的致病数据库,目前涵盖了致病微生物基因组(6604 属,11071 种,41906 株)和来自宿主、空气、水等来源的宏基因组数据(28816 个样本)。MPD 还作为一个统计和存储数据的管理系统,可供不同的组织使用,从而促进了不同组织和研究小组之间的数据共享。我们还提供了一个用户友好的本地客户端工具,以维持大数据量测序数据的稳定传输。MPD 是基因组研究中分析和管理的有用工具,特别是对于疾病控制中心和流行病学研究的临床应用,有望为了解致病菌基因组和宏基因组做出贡献。数据库网址:http://data.mypathogen.org。