Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland.
Department of Informatics, Technical University of Munich, Garching, Germany.
Hum Mutat. 2019 Sep;40(9):1215-1224. doi: 10.1002/humu.23869. Epub 2019 Aug 19.
Precision medicine and sequence-based clinical diagnostics seek to predict disease risk or to identify causative variants from sequencing data. The Critical Assessment of Genome Interpretation (CAGI) is a community experiment consisting of genotype-phenotype prediction challenges; participants build models, undergo assessment, and share key findings. In the past, few CAGI challenges have addressed the impact of sequence variants on splicing. In CAGI5, two challenges (Vex-seq and MaPSY) involved prediction of the effect of variants, primarily single-nucleotide changes, on splicing. Although there are significant differences between these two challenges, both involved prediction of results from high-throughput exon inclusion assays. Here, we discuss the methods used to predict the impact of these variants on splicing, their performance, strengths, and weaknesses, and prospects for predicting the impact of sequence variation on splicing and disease phenotypes.
精准医学和基于序列的临床诊断旨在预测疾病风险,或从测序数据中识别致病变异。基因组解读的关键评估(Critical Assessment of Genome Interpretation,CAGI)是一个由基因型-表型预测挑战组成的社区实验;参与者构建模型,进行评估,并分享关键发现。过去,CAGI 挑战赛很少涉及序列变异对剪接的影响。在 CAGI5 中,有两个挑战(Vex-seq 和 MaPSY)涉及预测变异,主要是单核苷酸变化,对剪接的影响。尽管这两个挑战之间存在显著差异,但它们都涉及预测高通量外显子包含测定的结果。在这里,我们讨论了用于预测这些变异对剪接影响的方法、它们的性能、优势和劣势,以及预测序列变异对剪接和疾病表型影响的前景。