Forest Products, Mississippi State University, Mississippi State, Mississippi, USA.
Appl Environ Microbiol. 2012 Apr;78(7):2272-81. doi: 10.1128/AEM.06745-11. Epub 2012 Jan 13.
The feasibility of short-read sequencing for genomic analysis was demonstrated for Fibroporia radiculosa, a copper-tolerant fungus that causes brown rot decay of wood. The effect of read quality on genomic assembly was assessed by filtering Illumina GAIIx reads from a single run of a paired-end library (75-nucleotide read length and 300-bp fragment size) at three different stringency levels and then assembling each data set with Velvet. A simple approach was devised to determine which filter stringency was "best." Venn diagrams identified the regions containing reads that were used in an assembly but were of a low-enough quality to be removed by a filter. By plotting base quality histograms of reads in this region, we judged whether a filter was too stringent or not stringent enough. Our best assembly had a genome size of 33.6 Mb, an N50 of 65.8 kb for a k-mer of 51, and a maximum contig length of 347 kb. Using GeneMark, 9,262 genes were predicted. TargetP and SignalP analyses showed that among the 1,213 genes with secreted products, 986 had motifs for signal peptides and 227 had motifs for signal anchors. Blast2GO analysis provided functional annotation for 5,407 genes. We identified 29 genes with putative roles in copper tolerance and 73 genes for lignocellulose degradation. A search for homologs of these 102 genes showed that F. radiculosa exhibited more similarity to Postia placenta than Serpula lacrymans. Notable differences were found, however, and their involvements in copper tolerance and wood decay are discussed.
短读测序技术在基因组分析中的可行性已在铜耐受真菌纤孔菌(Fibroporia radiculosa)中得到验证,该真菌可导致木材的褐腐腐朽。通过在三个不同严格程度下过滤 Illumina GAIIx 来自单个配对末端文库运行(75 个核苷酸读长和 300 个碱基片段大小)的读取,并使用 Velvet 组装每个数据集,评估了读取质量对基因组组装的影响。设计了一种简单的方法来确定哪种过滤严格程度“最佳”。Venn 图确定了包含在组装中但质量太低而被过滤器去除的读取的区域。通过绘制该区域中读取的碱基质量直方图,我们判断过滤器是否过于严格或不够严格。我们的最佳组装基因组大小为 33.6Mb,k-mer 为 51 的 N50 为 65.8kb,最大连续体长度为 347kb。使用 GeneMark,预测了 9,262 个基因。TargetP 和 SignalP 分析表明,在具有分泌产物的 1,213 个基因中,986 个基因具有信号肽的基序,227 个基因具有信号锚的基序。Blast2GO 分析为 5,407 个基因提供了功能注释。我们鉴定了 29 个具有铜耐受作用的假定基因和 73 个木质纤维素降解基因。对这些 102 个基因的同源物进行搜索表明,纤孔菌与胎盘 Postia 比皱孔菌 Serpula lacrymans 更相似。然而,发现了明显的差异,并讨论了它们在铜耐受和木材腐朽中的作用。