Department of Neurobiology and Anatomy, University of Utah Molecular Medicine Program, University of Utah, Salt Lake City, Utah 84112, USA.
Genome Res. 2013 Apr;23(4):687-97. doi: 10.1101/gr.146936.112. Epub 2013 Jan 8.
Forward genetic screens in model organisms are vital for identifying novel genes essential for developmental or disease processes. One drawback of these screens is the labor-intensive and sometimes inconclusive process of mapping the causative mutation. To leverage high-throughput techniques to improve this mapping process, we have developed a Mutation Mapping Analysis Pipeline for Pooled RNA-seq (MMAPPR) that works without parental strain information or requiring a preexisting SNP map of the organism, and adapts to differential recombination frequencies across the genome. MMAPPR accommodates the considerable amount of noise in RNA-seq data sets, calculates allelic frequency by Euclidean distance followed by Loess regression analysis, identifies the region where the mutation lies, and generates a list of putative coding region mutations in the linked genomic segment. MMAPPR can exploit RNA-seq data sets from isolated tissues or whole organisms that are used for gene expression and transcriptome analysis in novel mutants. We tested MMAPPR on two known mutant lines in zebrafish, nkx2.5 and tbx1, and used it to map two novel ENU-induced cardiovascular mutants, with mutations found in the ctr9 and cds2 genes. MMAPPR can be directly applied to other model organisms, such as Drosophila and Caenorhabditis elegans, that are amenable to both forward genetic screens and pooled RNA-seq experiments. Thus, MMAPPR is a rapid, cost-efficient, and highly automated pipeline, available to perform mutant mapping in any organism with a well-assembled genome.
正向遗传学筛选在模式生物中对于鉴定对发育或疾病过程至关重要的新基因至关重要。这些筛选的一个缺点是映射致病突变的劳动密集型和有时不确定的过程。为了利用高通量技术来改进该映射过程,我们开发了一种用于 Pooled RNA-seq 的突变映射分析管道(MMAPPR),它不需要亲本菌株信息或需要生物体的预先存在的 SNP 图谱,并且适应整个基因组的差异重组频率。MMAPPR 适应 RNA-seq 数据集的大量噪声,通过欧几里得距离计算等位基因频率,然后进行 Loess 回归分析,确定突变所在的区域,并生成链接基因组片段中假定的编码区突变列表。MMAPPR 可以利用用于新型突变体基因表达和转录组分析的分离组织或整个生物体的 RNA-seq 数据集。我们在斑马鱼中的两个已知突变体线 nkx2.5 和 tbx1 上测试了 MMAPPR,并使用它来映射两个新的 ENU 诱导的心血管突变体,其突变位于 ctr9 和 cds2 基因中。MMAPPR 可以直接应用于其他模式生物,如果蝇和秀丽隐杆线虫,它们既适用于正向遗传学筛选,也适用于 pooled RNA-seq 实验。因此,MMAPPR 是一种快速、具有成本效益的高度自动化管道,可用于任何具有良好组装基因组的生物体中的突变体映射。