Wattam Alice R, Brettin Thomas, Davis James J, Gerdes Svetlana, Kenyon Ronald, Machi Dustin, Mao Chunhong, Olson Robert, Overbeek Ross, Pusch Gordon D, Shukla Maulik P, Stevens Rick, Vonstein Veronika, Warren Andrew, Xia Fangfang, Yoo Hyunseung
Biocomplexity Institute, Virginia Tech, 1015 Life Science Circle, Blacksburg, VA, 24061, USA.
Computation Institute, University of Chicago, Chicago, IL, USA.
Methods Mol Biol. 2018;1704:79-101. doi: 10.1007/978-1-4939-7463-4_4.
In the "big data" era, research biologists are faced with analyzing new types that usually require some level of computational expertise. A number of programs and pipelines exist, but acquiring the expertise to run them, and then understanding the output can be a challenge.The Pathosystems Resource Integration Center (PATRIC, www.patricbrc.org ) has created an end-to-end analysis platform that allows researchers to take their raw reads, assemble a genome, annotate it, and then use a suite of user-friendly tools to compare it to any public data that is available in the repository. With close to 113,000 bacterial and more than 1000 archaeal genomes, PATRIC creates a unique research experience with "virtual integration" of private and public data. PATRIC contains many diverse tools and functionalities to explore both genome-scale and gene expression data, but the main focus of this chapter is on assembly, annotation, and the downstream comparative analysis functionality that is freely available in the resource.
在“大数据”时代,从事研究的生物学家面临着分析新型数据的任务,而这些分析通常需要一定程度的计算专业知识。虽然有许多程序和流程,但掌握运行它们的专业知识,进而理解其输出结果可能是一项挑战。致病系统资源整合中心(PATRIC,网址:www.patricbrc.org )创建了一个端到端分析平台,使研究人员能够获取原始读数、组装基因组、进行注释,然后使用一套用户友好型工具将其与存储库中可用的任何公共数据进行比较。PATRIC拥有近113,000个细菌基因组和1000多个古细菌基因组,通过私有数据与公共数据的“虚拟整合”创造了独特的研究体验。PATRIC包含许多用于探索基因组规模和基因表达数据的多样工具和功能,但本章的主要重点是资源中免费提供的组装、注释以及下游比较分析功能。