Derdak Sophia, Sabrautzki Sibylle, de Angelis Martin Hrabě, Gut Marta, Gut Ivo G, Beltran Sergi
Centro Nacional de Análisis Genómico, Parc Científic de Barcelona - Torre I, Baldiri Reixac, 4, 08028, Barcelona, Spain.
Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH), Institute of Experimental Genetics and German Mouse Clinic, Ingolstädter Landstr.1, 85764, Neuherberg, Germany.
BMC Genomics. 2015 May 6;16(1):351. doi: 10.1186/s12864-015-1548-7.
Exome sequencing has become a popular method to evaluate undirected mutagenesis experiments in mice. However, the most suitable mouse strain for the biological model may be relatively distant from the standard mouse reference genome. For pinpointing causative variants, a matching reference with gene annotations is essential, but not always readily available.
We present an approach that allows to use murine Ensembl annotations on alternative mouse strain assemblies. We resolved ENU-induced mutation screening for 8 phenotypic mutant lines generated on C3HeB/FeJ background aligning the sequences against the closely related, but not annotated reference of C3H/HeJ. Variants occurring in all strains were filtered out as specific for the C3HeB/FeJ strain but unrelated to mutagenesis. Variants occurring exclusively in all individuals of one mutant line and matching the inheritance model were selected as mutagenesis-related. These variants were annotated with gene and exon names lifted over from the standard murine reference mm9 to C3H/HeJ using megablast. For each mutant line, we could restrict the results to exonic variants in between 1 and 23 genes.
The presented method of exonic annotation lift-over proved to be a valuable tool in the search for mutagenesis-derived coding genomic variants and the assessment of genotype-phenotype relationships.
外显子组测序已成为评估小鼠无定向诱变实验的常用方法。然而,适用于生物学模型的小鼠品系可能与标准小鼠参考基因组相对较远。为了精确确定致病变体,带有基因注释的匹配参考至关重要,但并非总是容易获得。
我们提出了一种方法,允许在替代小鼠品系组装上使用小鼠Ensembl注释。我们解析了在C3HeB/FeJ背景上产生的8个表型突变系的ENU诱导突变筛选,将序列与密切相关但未注释的C3H/HeJ参考进行比对。在所有品系中出现的变体被过滤掉,因为它们是C3HeB/FeJ品系特有的,但与诱变无关。仅在一个突变系的所有个体中出现且符合遗传模型的变体被选为与诱变相关。这些变体使用megablast从标准小鼠参考mm9提升到C3H/HeJ,并标注了基因和外显子名称。对于每个突变系,我们可以将结果限制在1至23个基因之间的外显子变体。
所提出的外显子注释提升方法被证明是寻找诱变衍生的编码基因组变体以及评估基因型-表型关系的有价值工具。