Warrenfeltz Susanne, Kissinger Jessica C
Center for Tropical and Emerging Global Diseases, University of Georgia, Athens, GA, USA.
Institute of Bioinformatics, University of Georgia, Athens, GA, USA.
Methods Mol Biol. 2020;2052:139-192. doi: 10.1007/978-1-4939-9748-0_10.
Cryptosporidium has historically been a difficult organism to work with, and molecular genomic data for this important pathogen have typically lagged behind other prominent protist pathogens. CryptoDB ( http://cryptodb.org/ ) was launched in 2004 following the appearance of draft genome sequences for both C. parvum and C. hominis. CryptoDB merged with the EuPathDB Bioinformatics Resource Center family of databases ( https://eupathdb.org ) and has been maintained and updated regularly since its establishment. These resources are freely available, are web-based, and permit users to analyze their own sequence data in the context of reference genome sequences in our user workspaces. Advances in technology have greatly facilitated Cryptosporidium research in the last several years greatly enhancing and extending the data and types of data available for this genus. Currently, 13 genome sequences are available for 9 species of Cryptosporidium as well as the distantly related Gregarina niphandrodes and two free-living alveolate outgroups of the Apicomplexa, Chromera velia and Vitrella brassicaformis. Recent years have seen several new genome sequences for both existing and new Cryptosporidium species as well as transcriptomics, proteomics, SNP, and isolate population surveys. This chapter introduces the extensive data mining and visualization capabilities of the EuPathDB software platform and introduces the data types and tools that are currently available for Cryptosporidium. Key features are demonstrated with Cryptosporidium-relevant examples and explanations.
隐孢子虫一直以来都是一种难以研究的生物体,这种重要病原体的分子基因组数据通常落后于其他著名的原生生物病原体。在微小隐孢子虫和人隐孢子虫的基因组草图序列出现后,CryptoDB(http://cryptodb.org/ )于2004年推出。CryptoDB与EuPathDB生物信息学资源中心数据库家族(https://eupathdb.org )合并,自成立以来一直定期维护和更新。这些资源免费提供,基于网络,允许用户在我们的用户工作区中在参考基因组序列的背景下分析自己的序列数据。在过去几年中,技术进步极大地促进了隐孢子虫的研究,极大地增强和扩展了该属可用的数据及数据类型。目前,有9种隐孢子虫以及远缘相关的嗜尼帕德瑞纳簇虫和顶复门的两个自由生活的肺泡虫外类群(维氏色虫和芸苔样玻璃藻)的13个基因组序列可供使用。近年来,已有多种现有和新的隐孢子虫物种的新基因组序列以及转录组学、蛋白质组学、单核苷酸多态性和分离株群体调查数据。本章介绍了EuPathDB软件平台广泛的数据挖掘和可视化功能,并介绍了目前可用于隐孢子虫的数据类型和工具。通过与隐孢子虫相关的示例和解释展示了关键特性。