Sigova Elizaveta A, Pushkova Elena N, Rozhmina Tatiana A, Kudryavtseva Ludmila P, Zhuchenko Alexander A, Novakovskiy Roman O, Zhernova Daiana A, Povkhova Liubov V, Turba Anastasia A, Borkhert Elena V, Melnikova Nataliya V, Dmitriev Alexey A, Dvorianinova Ekaterina M
Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia.
Moscow Institute of Physics and Technology, Moscow 141701, Russia.
J Fungi (Basel). 2023 Feb 26;9(3):301. doi: 10.3390/jof9030301.
Flax ( L.) is attacked by numerous devastating fungal pathogens, including , , and (). The effective control of flax diseases follows the paradigm of extensive molecular research on pathogenicity. However, such studies require quality genome sequences of the studied organisms. This article reports on the approaches to assembling a high-quality fungal genome from the Oxford Nanopore Technologies data. We sequenced the genomes of , , and () and received different volumes of sequencing data: 1.7 Gb, 3.9 Gb, and 11.1 Gb, respectively. To obtain the optimal genome sequences, we studied the effect of input data quality and genome coverage on assembly statistics and tested the performance of different assembling and polishing software. For , the most contiguous and complete assembly was obtained by the Flye assembler and the Homopolish polisher. The genome coverage had more effect than data quality on assembly statistics, likely due to the relatively low amount of sequencing data obtained for . The final assembly was 53.4 Mb long and 96.4% complete (according to the glomerellales_odb10 BUSCO dataset), consisted of 42 contigs, and had an N50 of 4.4 Mb. For and (), the best assemblies were produced by Canu-Medaka and Canu-Homopolish, respectively. The final assembly of had a length of 29.5 Mb, 99.4% completeness (dothideomycetes_odb10), an N50 of 2.4 Mb and consisted of 32 contigs. () assembly was 44.1 Mb long, 97.8% complete (hypocreales_odb10), consisted of 54 contigs, and had an N50 of 4.4 Mb. The obtained results can serve as a guideline for assembling a de novo genome of a fungus. In addition, our data can be used in genomic studies of fungal pathogens or plant-pathogen interactions and assist in the management of flax diseases.
亚麻(L.)受到多种毁灭性真菌病原体的侵袭,包括[具体病原体1]、[具体病原体2]和[具体病原体3]([病原体学名])。亚麻病害的有效防治遵循对致病性进行广泛分子研究的模式。然而,此类研究需要所研究生物体的高质量基因组序列。本文报道了从牛津纳米孔技术数据组装高质量真菌基因组的方法。我们对[具体病原体1]、[具体病原体2]和[具体病原体3]([病原体学名])的基因组进行了测序,分别获得了不同量的测序数据:1.7 Gb、3.9 Gb和11.1 Gb。为了获得最佳基因组序列,我们研究了输入数据质量和基因组覆盖度对组装统计的影响,并测试了不同组装和抛光软件的性能。对于[具体病原体1],通过Flye组装器和Homopolish抛光器获得了最连续和完整的组装。基因组覆盖度对组装统计的影响比数据质量更大,这可能是由于为[具体病原体1]获得的测序数据量相对较少。最终组装的基因组长度为53.4 Mb,完整性为96.4%(根据glomerellales_odb10 BUSCO数据集),由42个重叠群组成,N50为4.4 Mb。对于[具体病原体2]和[具体病原体3]([病原体学名]),最佳组装分别由Canu-Medaka和Canu-Homopolish产生。[具体病原体2]的最终组装基因组长度为29.5 Mb,完整性为99.4%(dothideomycetes_odb10),N50为2.4 Mb,由32个重叠群组成。[具体病原体3]([病原体学名])的组装基因组长度为44.1 Mb,完整性为97.8%(hypocreales_odb10),由54个重叠群组成,N50为4.4 Mb。获得的结果可作为真菌从头基因组组装的指导。此外,我们的数据可用于真菌病原体或植物-病原体相互作用的基因组研究,并有助于亚麻病害的管理。