Seppey Mathieu, Manni Mosè, Zdobnov Evgeny M
Department of Genetic Medicine and Development, Swiss Institute of Bioinformatics, University of Geneva Medical School, Geneva, Switzerland.
Methods Mol Biol. 2019;1962:227-245. doi: 10.1007/978-1-4939-9173-0_14.
Genomics drives the current progress in molecular biology, generating unprecedented volumes of data. The scientific value of these sequences depends on the ability to evaluate their completeness using a biologically meaningful approach. Here, we describe the use of the BUSCO tool suite to assess the completeness of genomes, gene sets, and transcriptomes, using their gene content as a complementary method to common technical metrics. The chapter introduces the concept of universal single-copy genes, which underlies the BUSCO methodology, covers the basic requirements to set up the tool, and provides guidelines to properly design the analyses, run the assessments, and interpret and utilize the results.
基因组学推动了当前分子生物学的进展,产生了前所未有的大量数据。这些序列的科学价值取决于使用具有生物学意义的方法评估其完整性的能力。在这里,我们描述了使用BUSCO工具套件来评估基因组、基因集和转录组的完整性,将其基因含量作为常用技术指标的补充方法。本章介绍了通用单拷贝基因的概念,这是BUSCO方法的基础,涵盖了设置该工具的基本要求,并提供了正确设计分析、运行评估以及解释和利用结果的指南。