School of Infection & Immunity, University of Glasgow, UK.
LPHI, CNRS, INSERM, Université de Montpellier, France.
Nucleic Acids Res. 2024 Jul 5;52(W1):W39-W44. doi: 10.1093/nar/gkae378.
As sequencing genomes has become increasingly popular, the need for annotation of the resulting assemblies is growing. Structural and functional annotation is still challenging as it includes finding the correct gene sequences, annotating other elements such as RNA and being able to submit those data to databases to share it with the community. Compared to de novo assembly where contiguous chromosomes are a sign of high quality, it is difficult to visualize and assess the quality of annotation. We developed the Companion web server to allow non-experts to annotate their genome using a reference-based method, enabling them to assess the output before submitting to public databases. In this update paper, we describe how we have included novel methods for gene finding and made the Companion server more efficient for annotation of genomes of up to 1 Gb in size. The reference set was increased to include genomes of interest for human and animal health from the fungi and arthropod kingdoms. We show that Companion outperforms existing comparable tools where closely related references are available.
随着测序基因组变得越来越流行,对生成的组装体进行注释的需求也在不断增长。结构和功能注释仍然具有挑战性,因为它包括找到正确的基因序列、注释 RNA 等其他元素,并能够将这些数据提交到数据库中与社区共享。与从头组装相比,连续染色体是高质量的标志,因此很难可视化和评估注释的质量。我们开发了 Companion 网络服务器,允许非专家使用基于参考的方法对他们的基因组进行注释,使他们能够在提交到公共数据库之前评估输出。在这篇更新论文中,我们描述了如何包含新的基因发现方法,并使 Companion 服务器在注释大小高达 10 亿碱基对的基因组时更加高效。参考集增加了真菌和节肢动物王国中与人类和动物健康相关的基因组。我们表明,在有密切相关参考的情况下,Companion 的性能优于现有的可比工具。