Rao Soumya, Sharda Saphy, Oddi Vineesha, Nandineni Madhusudan R
Laboratory of Genomics and Profiling Applications, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India.
Graduate Studies, Manipal Academy of Higher Education, Manipal, India.
Front Microbiol. 2018 Oct 4;9:2367. doi: 10.3389/fmicb.2018.02367. eCollection 2018.
The ascomycete fungus is a major phytopathogen with a broad host range which causes anthracnose disease of chilli. The genome sequencing of this fungus led to the discovery of functional categories of genes that may play important roles in fungal pathogenicity. However, the presence of gaps in draft assembly prevented the accurate prediction of repetitive elements, which are the key players to determine the genome architecture and drive evolution and host adaptation. We re-sequenced its genome using single-molecule real-time (SMRT) sequencing technology to obtain a refined assembly with lesser and smaller gaps and ambiguities. This enabled us to study its genome architecture by characterising the repetitive sequences like transposable elements (TEs) and simple sequence repeats (SSRs), which constituted 4.9 and 0.38% of the assembled genome, respectively. The comparative analysis among different species revealed the extensive repeat rich regions, dominated by Gypsy superfamily of long terminal repeats (LTRs), and the differential composition of SSRs in their genomes. Our study revealed a recent burst of LTR amplification in , , and . TEs in were significantly associated with secretome, effectors and genes in secondary metabolism clusters. Some of the TE families in showed cytosine to thymine transitions indicative of repeat-induced point mutation (RIP). and showed strong signatures of RIP across their genomes and "two-speed" genomes with extensive AT-rich and gene-sparse regions. Comparative genomic analyses of species provided an insight into the species-specific SSR profiles. The SSRs in the coding and non-coding regions of the genome revealed the composition of trinucleotide repeat motifs in exons with potential to alter the translated protein structure through amino acid repeats. This is the first genome-wide study of TEs and SSRs in and their comparative analysis with six other species, which would serve as a useful resource for future research to get insights into the potential role of TEs in genome expansion and evolution of fungi and for development of SSR-based molecular markers for population genomic studies.
子囊菌是一种主要的植物病原体,寄主范围广泛,可引发辣椒炭疽病。该真菌的基因组测序发现了可能在真菌致病性中发挥重要作用的基因功能类别。然而,草图组装中存在的缺口妨碍了对重复元件的准确预测,而重复元件是决定基因组结构、推动进化和宿主适应性的关键因素。我们使用单分子实时(SMRT)测序技术对其基因组进行重新测序,以获得间隙和模糊性更小的精细组装。这使我们能够通过表征转座元件(TEs)和简单序列重复(SSRs)等重复序列来研究其基因组结构,这些重复序列分别占组装基因组的4.9%和0.38%。不同物种间的比较分析揭示了广泛的富含重复序列的区域,以长末端重复序列(LTRs)的吉普赛超家族为主,以及它们基因组中SSRs的差异组成。我们的研究揭示了近期在[具体物种1]、[具体物种2]和[具体物种3]中LTR扩增的爆发。[具体物种]中的TEs与分泌组、效应子和次生代谢簇中的基因显著相关。[具体物种]中的一些TE家族显示出胞嘧啶到胸腺嘧啶的转变,表明存在重复诱导点突变(RIP)。[具体物种1]和[具体物种2]在其基因组中显示出强烈的RIP特征以及具有广泛富含AT和基因稀疏区域的“双速”基因组。对[具体物种]的比较基因组分析提供了对物种特异性SSR图谱的深入了解。基因组编码区和非编码区的SSRs揭示了外显子中三核苷酸重复基序的组成,这些基序有可能通过氨基酸重复改变翻译后的蛋白质结构。这是首次对[具体物种]中的TEs和SSRs进行全基因组研究,并与其他六个[具体物种]进行比较分析,这将为未来研究深入了解TEs在[具体物种]真菌基因组扩展和进化中的潜在作用以及开发用于群体基因组研究的基于SSR的分子标记提供有用资源。