Department of Biological Sciences, University of Maryland, Baltimore, Maryland.
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.
Hum Mutat. 2019 Sep;40(9):1612-1622. doi: 10.1002/humu.23849. Epub 2019 Aug 17.
The availability of disease-specific genomic data is critical for developing new computational methods that predict the pathogenicity of human variants and advance the field of precision medicine. However, the lack of gold standards to properly train and benchmark such methods is one of the greatest challenges in the field. In response to this challenge, the scientific community is invited to participate in the Critical Assessment for Genome Interpretation (CAGI), where unpublished disease variants are available for classification by in silico methods. As part of the CAGI-5 challenge, we evaluated the performance of 18 submissions and three additional methods in predicting the pathogenicity of single nucleotide variants (SNVs) in checkpoint kinase 2 (CHEK2) for cases of breast cancer in Hispanic females. As part of the assessment, the efficacy of the analysis method and the setup of the challenge were also considered. The results indicated that though the challenge could benefit from additional participant data, the combined generalized linear model analysis and odds of pathogenicity analysis provided a framework to evaluate the methods submitted for SNV pathogenicity identification and for comparison to other available methods. The outcome of this challenge and the approaches used can help guide further advancements in identifying SNV-disease relationships.
疾病特异性基因组数据的可用性对于开发新的计算方法至关重要,这些方法可以预测人类变异的致病性,并推进精准医学领域的发展。然而,缺乏适当的训练和基准测试方法的黄金标准是该领域面临的最大挑战之一。为了应对这一挑战,科学界受邀参加基因组解读关键评估(Critical Assessment for Genome Interpretation,CAGI),其中未公开的疾病变体可通过计算方法进行分类。作为 CAGI-5 挑战赛的一部分,我们评估了 18 项提交内容和另外 3 种方法在预测乳腺癌中 Hispanic 女性的 CHK2 单核苷酸变异(Single Nucleotide Variant,SNV)致病性方面的性能。作为评估的一部分,还考虑了分析方法的功效和挑战的设置。结果表明,尽管该挑战可能受益于更多参与者的数据,但综合广义线性模型分析和致病性可能性分析为评估提交的 SNV 致病性识别方法以及与其他可用方法进行比较提供了框架。本挑战赛的结果和所采用的方法可以帮助指导进一步确定 SNV-疾病关系。