Shovlin Claire L, Aldred Micheala A
National Heart and Lung Institute, Imperial College London, London, UK.
Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, USA.
Am J Hum Genet. 2025 Jan 2;112(1):3-10. doi: 10.1016/j.ajhg.2024.12.002.
Each human genome has approximately 5 million DNA variants. Even for complete loss-of-function variants causing inherited, monogenic diseases, current understanding based on gene-specific molecular function does not adequately predict variability observed between people with identical mutations or fluctuating disease trajectories. We present a parallel paradigm for loss-of-function variants based on broader consequences to the cell when aberrant polypeptide chains of amino acids are translated from mutant RNA to generate mutated proteins. Missense variants that modify primary amino acid sequence, and nonsense/frameshift variants that generate premature termination codons (PTCs), are placed in context alongside emergent themes of chaperone binding, protein quality control capacity, and cellular adaptation to stress. Relatively stable proteostasis burdens are contrasted with rapid changes after induction of gene expression, or stress responses that suppress nonsense mediated decay (NMD) leading to higher PTC transcript levels where mutant proteins can augment cellular stress. For known disease-causal mutations, an adjunctive variant categorization system enhances clinical predictive power and precision therapeutic opportunities. Additionally, with typically more than 100 nonsense and frameshift variants, and ∼10,000 missense variants per human DNA, the paradigm focuses attention on all protein-coding DNA variants, and their potential contributions to multimorbid states beyond classically designated inherited diseases. Experimental testing in clinically relevant systems is encouraged to augment current atlases of protein expression at single-cell resolution, and high-throughput experimental data and deep-learning models that predict which amino acid substitutions generate enhanced degradative burdens. Incorporating additional dimensions such as pan-proteome competition for chaperones, and age-related loss of proteostasis capacity, should further accelerate health impacts.
每个人类基因组大约有500万个DNA变异。即使对于导致遗传性单基因疾病的完全功能丧失变异,目前基于基因特异性分子功能的理解也不足以预测具有相同突变的个体之间观察到的变异性或波动的疾病轨迹。我们基于从突变RNA翻译异常氨基酸多肽链以产生突变蛋白时对细胞产生的更广泛影响,提出了一种针对功能丧失变异的平行范式。修饰一级氨基酸序列的错义变异,以及产生过早终止密码子(PTC)的无义/移码变异,与伴侣蛋白结合、蛋白质质量控制能力和细胞对压力的适应性等新出现的主题一起被置于背景中。相对稳定的蛋白质稳态负担与基因表达诱导后的快速变化形成对比,或者与抑制无义介导的衰变(NMD)导致更高PTC转录水平的应激反应形成对比,在这种情况下突变蛋白会加剧细胞应激。对于已知的致病突变,一种辅助性变异分类系统增强了临床预测能力和精准治疗机会。此外,每个人类DNA通常有超过100个无义变异和移码变异,以及约10000个错义变异,该范式将注意力集中在所有蛋白质编码DNA变异及其对除经典指定的遗传性疾病之外的多种疾病状态的潜在贡献上。鼓励在临床相关系统中进行实验测试,以增强当前单细胞分辨率下的蛋白质表达图谱,以及预测哪些氨基酸替代会产生增强的降解负担的高通量实验数据和深度学习模型。纳入诸如伴侣蛋白的全蛋白质组竞争以及与年龄相关的蛋白质稳态能力丧失等额外维度,应该会进一步加速对健康的影响。