Dominguez Del Angel Victoria, Hjerde Erik, Sterck Lieven, Capella-Gutierrez Salvadors, Notredame Cederic, Vinnere Pettersson Olga, Amselem Joelle, Bouri Laurent, Bocs Stephanie, Klopp Christophe, Gibrat Jean-Francois, Vlasova Anna, Leskosek Brane L, Soler Lucile, Binzer-Panchal Mahesh, Lantz Henrik
Institut Français de Bioinformatique, UMS3601-CNRS, Université Paris-Saclay, Orsay, 91403, France.
Department of Chemistry, Norstruct, UiT The Arctic University of Norway, Tromsø, 9019, Norway.
F1000Res. 2018 Feb 5;7. doi: 10.12688/f1000research.13598.1. eCollection 2018.
As a part of the ELIXIR-EXCELERATE efforts in capacity building, we present here 10 steps to facilitate researchers getting started in genome assembly and genome annotation. The guidelines given are broadly applicable, intended to be stable over time, and cover all aspects from start to finish of a general assembly and annotation project. Intrinsic properties of genomes are discussed, as is the importance of using high quality DNA. Different sequencing technologies and generally applicable workflows for genome assembly are also detailed. We cover structural and functional annotation and encourage readers to also annotate transposable elements, something that is often omitted from annotation workflows. The importance of data management is stressed, and we give advice on where to submit data and how to make your results Findable, Accessible, Interoperable, and Reusable (FAIR).
作为ELIXIR-EXCELERATE能力建设工作的一部分,我们在此介绍10个步骤,以帮助研究人员开始进行基因组组装和基因组注释。所给出的指南具有广泛适用性,旨在长期保持稳定,并涵盖了一般组装和注释项目从开始到结束的各个方面。我们讨论了基因组的内在特性,以及使用高质量DNA的重要性。还详细介绍了不同的测序技术和一般适用的基因组组装工作流程。我们涵盖了结构和功能注释,并鼓励读者对转座元件进行注释,而这在注释工作流程中常常被忽略。强调了数据管理的重要性,并就数据提交地点以及如何使你的结果具备可查找、可访问、可互操作和可重用(FAIR)性提供了建议。