Katsonis Panagiotis, Lichtarge Olivier
Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA.
Department of Biochemistry & Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA.
Nat Commun. 2025 Jan 2;16(1):159. doi: 10.1038/s41467-024-55066-4.
Computational methods for estimating missense variant impact suffer from inconsistent performance across genes, which poses a major challenge for their reliable use in clinical practice. While ensemble scores leverage multiple prediction methods to enhance consistency, the overrepresentation of certain genes in the training data can bias their outcomes. To address this critical limitation, we propose a gene-specific ensemble framework trained on reference computational annotations rather than on clinical or experimental data. Accordingly, we generate Meta-EA ensemble scores that achieve comparable performance to the top individual predicting method for each gene set. Incorporating the effects of splicing and the allele frequency of human polymorphisms further enhances the performance of Meta-EA, achieving an area under the receiver operating characteristic curve of 0.97 for both gene-balanced and imbalanced clinical assessments. In conclusion, this work leverages the wealth of existing variant impact prediction approaches to generate improved estimations for clinical interpretation.
用于估计错义变体影响的计算方法在不同基因间的性能存在不一致性,这对其在临床实践中的可靠应用构成了重大挑战。虽然综合评分利用多种预测方法来提高一致性,但训练数据中某些基因的过度代表性可能会使其结果产生偏差。为解决这一关键限制,我们提出了一种基于参考计算注释而非临床或实验数据进行训练的基因特异性综合框架。据此,我们生成了Meta-EA综合评分,其性能与每个基因集的顶级单一预测方法相当。纳入剪接效应和人类多态性的等位基因频率进一步提高了Meta-EA的性能,在基因平衡和不平衡的临床评估中,受试者工作特征曲线下面积均达到0.97。总之,这项工作利用了现有的丰富变体影响预测方法,以生成用于临床解释的改进估计。