Pillai Joshua, Liu Sophia, Sung Kijung, Shi Linda, Wu Chengbiao
School of Biological Sciences, University of California, 9301 S Scholars Dr, La Jolla, San Diego, CA, 92093, USA.
Department of Neurosciences, University of California San Diego, Medical Teaching Facility, 9500 Gilman Drive, La Jolla, CA, 92093-0624, USA.
Biochem Biophys Rep. 2025 May 16;42:102049. doi: 10.1016/j.bbrep.2025.102049. eCollection 2025 Jun.
Alzheimer's disease (AD) is a progressive neurodegenerative disease characterized by progressive cognitive decline. Over 200 pathogenic mutations in , (), and (), have been implicated in AD. Yet, many rare and common variants have not been completely classified as protective or benign, risk-modifiers, or pathogenic, which is important for research on the disease mechanisms and discovery of treatment methods. The majority of these variants are missense mutations, and there is an active need for computational approaches to accurately predict their molecular consequences. AlphaMissense (AM) is a novel technology that uses population frequency data along with structural and sequential contexts from AlphaFold to predict the pathogenicity of missense mutations. Herein, we sought to evaluate the capabilities of AM on 114 variants of unknown significance (VUS), including 56 missense variants of , 25 of , and 33 of by benchmarking its prediction against their respective Aβ isoform levels , respectively. We found that the AM scores correlated moderately well with the critical Aβ42/Aβ40 biomarker and Aβ40 levels in the transmembrane proteins compared to weaker correlations in traditional approaches, including Combined Annotation Dependent Depletion (CADD) v1.7, evolutionary model of variant effect (EVE), and Evolutionary Scale Modeling-1b (ESM-1B). Yet, there were non-significant correlations identified with Aβ42 levels in all models. Furthermore, we found that AM does not rely completely on structural contexts from AlphaFold2, as it accurately predicted the effects of known variants on residues with a low predicted local distance difference test (pLDDT) score. Additionally, based on the receiver operating characteristic-area under the curve analysis (ROC-AUC), we found that AM retained a high performance on 263 validated variants of these amyloidogenic genes, and performed the greatest compared to other models for the 114 VUS. We believe this is the first study to provide comprehensive characterization and validation of AM in comparison to the widely utilized pathogenicity scoring models for VUS involved in proteins implicated in AD.
阿尔茨海默病(AD)是一种以进行性认知衰退为特征的进行性神经退行性疾病。淀粉样前体蛋白(APP)、早老素1(PS1)和早老素2(PS2)中的200多种致病突变与AD有关。然而,许多罕见和常见变异尚未被完全归类为保护性或良性、风险修饰因子或致病性变异,这对疾病机制研究和治疗方法的发现很重要。这些变异中的大多数是错义突变,因此迫切需要计算方法来准确预测其分子后果。AlphaMissense(AM)是一种新技术,它利用群体频率数据以及来自AlphaFold的结构和序列背景来预测错义突变的致病性。在此,我们试图通过将AM的预测结果与各自的Aβ同工型水平进行基准比较,来评估AM对114个意义未明变异(VUS)的能力,其中包括56个APP错义变异、25个PS1错义变异和33个PS2错义变异。我们发现,与传统方法(包括联合注释依赖缺失(CADD)v1.7、变异效应进化模型(EVE)和进化尺度建模-1b(ESM-1B))中较弱的相关性相比,AM评分与关键的Aβ42/Aβ40生物标志物以及跨膜蛋白中的Aβ40水平具有中等程度的良好相关性。然而,在所有模型中,与Aβ42水平的相关性均无统计学意义。此外,我们发现AM并不完全依赖于AlphaFold2的结构背景,因为它能准确预测已知变异对预测局部距离差异测试(pLDDT)分数较低的残基的影响。此外,基于曲线下面积分析的受试者工作特征(ROC-AUC),我们发现AM在这些淀粉样蛋白生成基因的263个验证变异上保持了高性能,并且在114个VUS中与其他模型相比表现最佳。我们相信,这是第一项与广泛使用的涉及AD相关蛋白的VUS致病性评分模型相比,对AM进行全面表征和验证的研究。