Division of Computational Biology School of Life Sciences University of Dundee, Dow Street Dundee, DD1 5EH, Scotland, UK.
European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
Commun Biol. 2024 Apr 11;7(1):447. doi: 10.1038/s42003-024-06117-5.
Protein evolution is constrained by structure and function, creating patterns in residue conservation that are routinely exploited to predict structure and other features. Similar constraints should affect variation across individuals, but it is only with the growth of human population sequencing that this has been tested at scale. Now, human population constraint has established applications in pathogenicity prediction, but it has not yet been explored for structural inference. Here, we map 2.4 million population variants to 5885 protein families and quantify residue-level constraint with a new Missense Enrichment Score (MES). Analysis of 61,214 structures from the PDB spanning 3661 families shows that missense depleted sites are enriched in buried residues or those involved in small-molecule or protein binding. MES is complementary to evolutionary conservation and a combined analysis allows a new classification of residues according to a conservation plane. This approach finds functional residues that are evolutionarily diverse, which can be related to specificity, as well as family-wide conserved sites that are critical for folding or function. We also find a possible contrast between lethal and non-lethal pathogenic sites, and a surprising clinical variant hot spot at a subset of missense enriched positions.
蛋白质进化受到结构和功能的限制,在残基保守性中产生了模式,这些模式通常被用来预测结构和其他特征。类似的限制也应该影响个体之间的变异,但只有随着人类群体测序的增长,这种情况才在大规模上得到了检验。现在,人群约束已经在致病性预测中得到了应用,但尚未在结构推断中得到探索。在这里,我们将 240 万个群体变异映射到 5885 个蛋白质家族,并使用新的错义富集评分(MES)量化残基水平的约束。对来自 PDB 的 61214 个结构进行分析,涵盖 3661 个家族,结果表明,错义缺失的位点富集在埋入残基或参与小分子或蛋白质结合的残基中。MES 与进化保守性互补,综合分析可以根据保守平面对残基进行新的分类。这种方法可以找到进化上多样化的功能残基,这些残基与特异性有关,同时也可以找到对折叠或功能至关重要的全家族保守位点。我们还发现了致死和非致死致病性位点之间的一个可能的对比,以及在错义富集位置的一个子集上的一个令人惊讶的临床变异热点。