Hamlin Jennafer A P, Dias Guilherme B, Bergman Casey M, Bensasson Douda
Department of Plant Biology and Institute of Bioinformatics, University of Georgia, Athens, GA, and.
Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA, United States of America.
G3 (Bethesda). 2019 Nov 5;9(11):3547-3554. doi: 10.1534/g3.119.400486.
Although normally a harmless commensal, , it is also one of the most common causes of bloodstream infections in the U.S. has long been considered an obligate commensal, however, recent studies suggest it can live outside animal hosts. Here, we have generated PacBio sequences and phased genome assemblies for three strains from oak trees (NCYC 4144, NCYC 4145, and NCYC 4146). PacBio datasets are high depth (over 400 fold coverage) and more than half of the sequencing data are contained in reads longer than 15 kb. Primary assemblies showed high contiguity with several chromosomes for each strain recovered as single contigs, and greater than half of the alternative haplotype sequence was assembled in haplotigs at least 174 kb long. Using these assemblies we were able to identify structural polymorphisms, including a polymorphic inversion over 100 kb in length. These results show that phased diploid assemblies for can enable the study of genomic variation within and among strains of an important fungal pathogen.
虽然通常是无害的共生菌,但它也是美国血流感染最常见的病因之一。长期以来,它一直被认为是专性共生菌,然而,最近的研究表明它可以在动物宿主之外生存。在这里,我们为来自橡树的三株菌株(NCYC 4144、NCYC 4145和NCYC 4146)生成了PacBio序列和分阶段的基因组组装。PacBio数据集具有高深度(超过400倍覆盖),超过一半的测序数据包含在长度超过15 kb的读段中。初步组装显示每个菌株的几个染色体具有高连续性,作为单个重叠群回收,并且超过一半的替代单倍型序列组装在长度至少为174 kb的单倍型重叠群中。利用这些组装,我们能够识别结构多态性,包括长度超过100 kb的多态性倒位。这些结果表明,分阶段的二倍体组装能够用于研究一种重要真菌病原体菌株内部和之间的基因组变异。