Hane James K, Oliver Richard P
Australian Centre for Necrotrophic Fungal Pathogens, Faculty of Health Sciences, Murdoch University, Murdoch, Australia.
BMC Bioinformatics. 2008 Nov 12;9:478. doi: 10.1186/1471-2105-9-478.
Repeat-induced point mutation (RIP) is a fungal-specific genome defence mechanism that alters the sequences of repetitive DNA, thereby inactivating coding genes. Repeated DNA sequences align between mating and meiosis and both sequences undergo C:G to T:A transitions. In most fungi these transitions preferentially affect CpA di-nucleotides thus altering the frequency of certain di-nucleotides in the affected sequences. The majority of previously published in silico analyses were limited to the comparison of ratios of pre- and post-RIP di-nucleotides in putatively RIP-affected sequences - so-called RIP indices. The analysis of RIP is significantly more informative when comparing sequence alignments of repeated sequences. There is, however, a dearth of bioinformatics tools available to the fungal research community for alignment-based RIP analysis of repeat families.
We present RIPCAL http://www.sourceforge.net/projects/ripcal, a software tool for the automated analysis of RIP in fungal genomic DNA repeats, which performs both RIP index and alignment-based analyses. We demonstrate the ability of RIPCAL to detect RIP within known RIP-affected sequences of Neurospora crassa and other fungi. We also predict and delineate the presence of RIP in the genome of Stagonospora nodorum - a Dothideomycete pathogen of wheat. We show that RIP has affected different members of the S. nodorum rDNA tandem repeat to different extents depending on their genomic contexts.
The RIPCAL alignment-based method has considerable advantages over RIP indices for the analysis of whole genomes. We demonstrate its application to the recently published genome assembly of S. nodorum.
重复诱导点突变(RIP)是一种真菌特有的基因组防御机制,它会改变重复DNA的序列,从而使编码基因失活。重复的DNA序列在交配和减数分裂过程中会进行比对,并且两个序列都会发生C:G到T:A的转变。在大多数真菌中,这些转变优先影响CpA二核苷酸,从而改变受影响序列中某些二核苷酸的频率。此前大多数已发表的计算机分析仅限于比较假定受RIP影响的序列中RIP前后二核苷酸的比例——即所谓的RIP指数。在比较重复序列的序列比对时,对RIP的分析会更具信息量。然而,真菌研究群体可用的生物信息学工具匮乏,难以对重复家族进行基于比对的RIP分析。
基于比对的RIPCAL方法在全基因组分析方面比RIP指数具有相当大的优势。我们展示了它在最近发表的小麦壳针孢基因组组装中的应用。