Kumaran Manojkumar, Devarajan Bharanidharan
Department of Bioinformatics, Aravind Medical Research Foundation, Madurai, Tamil Nadu, India; School of Chemical and Biotechnology, SASTRA (Deemed to be a university), Thanjavur, Tamil Nadu, India.
Department of Bioinformatics, Aravind Medical Research Foundation, Madurai, Tamil Nadu, India.
Genet Med. 2023 Jul;25(7):100862. doi: 10.1016/j.gim.2023.100862. Epub 2023 Apr 21.
Disease-specific pathogenic variant prediction tools that differentiate pathogenic variants from benign have been improved through disease specificity recently. However, they have not been evaluated on disease-specific pathogenic variants compared with other diseases, which would help to prioritize disease-specific variants from several genes or novel genes. Thus, we hypothesize that features of pathogenic variants alone would provide a better model.
We developed an eye disease-specific variant prioritization tool (eyeVarP), which applied the random forest algorithm to the data set of pathogenic variants of eye diseases and other diseases. We also developed the VarP tool and generalized pipeline to filter missense and insertion-deletion variants and predict their pathogenicity from exome or genome sequencing data, thus we provide a complete computational procedure.
eyeVarP outperformed pan disease-specific tools in identifying eye disease-specific pathogenic variants under the top 10. VarP outperformed 12 pathogenicity prediction tools with an accuracy of 95% in correctly identifying the pathogenicity of missense and insertion-deletion variants. The complete pipeline would help to develop disease-specific tools for other genetic disorders.
eyeVarP performs better in identifying eye disease-specific pathogenic variants using pathogenic variant features and gene features. Implementing such complete computational procedure would significantly improve the clinical variant interpretation for specific diseases.
区分致病性变异与良性变异的疾病特异性致病变异预测工具最近通过疾病特异性得到了改进。然而,与其他疾病相比,它们尚未针对疾病特异性致病变异进行评估,而这将有助于从多个基因或新基因中对疾病特异性变异进行优先级排序。因此,我们假设仅致病性变异的特征就能提供更好的模型。
我们开发了一种眼部疾病特异性变异优先级排序工具(eyeVarP),该工具将随机森林算法应用于眼部疾病和其他疾病的致病变异数据集。我们还开发了VarP工具和通用流程,以筛选错义变异和插入缺失变异,并从外显子组或基因组测序数据预测其致病性,从而提供了一个完整的计算程序。
在识别排名前10的眼部疾病特异性致病变异方面,eyeVarP优于泛疾病特异性工具。VarP在正确识别错义变异和插入缺失变异的致病性方面表现优于12种致病性预测工具,准确率达95%。完整的流程将有助于为其他遗传疾病开发疾病特异性工具。
eyeVarP在利用致病变异特征和基因特征识别眼部疾病特异性致病变异方面表现更佳。实施这样完整的计算程序将显著改善特定疾病的临床变异解读。