• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于罕见变异关联研究的统一元回归模型。

Unified meta regression models for rare variant association studies.

作者信息

Lauer Larissa, Rivas Manuel A

机构信息

Department of Statistics, Stanford, CA, USA, 94305.

Department of Biomedical Data Science, Stanford University, Stanford, CA, USA 94305.

出版信息

bioRxiv. 2025 Jan 24:2025.01.23.634522. doi: 10.1101/2025.01.23.634522.

DOI:10.1101/2025.01.23.634522
PMID:39896616
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11785203/
Abstract

Rare variant association studies (RVAS) of complex traits have emerged as a powerful approach to advance drug discovery and diagnostics. Missense pathogenicity predictions from AlphaMissense based on structural context and protein language models improve the differentiation between benign and deleterious variants. Constraint metrics, on the other hand, allow researchers to pinpoint genomic regions under selective pressure that may not directly impact protein structure, but are more likely to contain functionally important mutations. Loss-of-function (LoF) variants, which result in the complete or partial loss of protein function, are particularly informative, as it is more straightforward to assess their downstream functional consequences. In this study, we present a unified meta regression model approach that incorporates the probability of pathogenicity, probability of constraint, and indicator whether a variant is a predicted loss-of-function or missense variant as features to model the observed effect size and uncertainty of effect size obtained from single-variant genetic analysis. We applied the unified meta regression model to 1,144 continuous phenotypes from UK Biobank using single variant summary statistics obtained from Genebass. We replicated our findings using the AllofUS cohort. For each gene discovery, we make available a characterization of whether constrained sites are associated with the phenotype, whether pathogenic sites determined by structural based predictions are associated with phenotype, and whether broader loss-of-function or missense variant annotation better explains the summary statistics observed. Our results are publicly available at Global Biobank Engine (https://biobankengine.shinyapps.io/phenome-wide-unified-model/).

摘要

复杂性状的罕见变异关联研究(RVAS)已成为推动药物发现和诊断的有力方法。基于结构背景和蛋白质语言模型的AlphaMissense对错义致病性的预测,改善了良性和有害变异之间的区分。另一方面,约束指标使研究人员能够确定处于选择压力下的基因组区域,这些区域可能不会直接影响蛋白质结构,但更有可能包含功能重要的突变。功能丧失(LoF)变异会导致蛋白质功能完全或部分丧失,因其下游功能后果更易于评估,所以特别具有信息量。在本研究中,我们提出了一种统一的元回归模型方法,该方法将致病性概率、约束概率以及一个变异是预测的功能丧失变异还是错义变异的指标作为特征,来对从单变异基因分析中获得的观察效应大小和效应大小的不确定性进行建模。我们使用从Genebass获得的单变异汇总统计数据,将统一的元回归模型应用于英国生物银行的1144个连续性状。我们使用AllofUS队列重复了我们的发现。对于每个基因发现,我们提供了关于受约束位点是否与该性状相关、基于结构预测确定的致病位点是否与性状相关,以及更广泛的功能丧失或错义变异注释是否能更好地解释观察到的汇总统计数据的特征描述。我们的结果可在全球生物银行引擎(https://biobankengine.shinyapps.io/phenome-wide-unified-model/)上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11785203/2725abde7d4d/nihpp-2025.01.23.634522v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11785203/a54669e4a438/nihpp-2025.01.23.634522v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11785203/731392d707cb/nihpp-2025.01.23.634522v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11785203/460211899558/nihpp-2025.01.23.634522v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11785203/2725abde7d4d/nihpp-2025.01.23.634522v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11785203/a54669e4a438/nihpp-2025.01.23.634522v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11785203/731392d707cb/nihpp-2025.01.23.634522v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11785203/460211899558/nihpp-2025.01.23.634522v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11785203/2725abde7d4d/nihpp-2025.01.23.634522v1-f0004.jpg

相似文献

1
Unified meta regression models for rare variant association studies.用于罕见变异关联研究的统一元回归模型。
bioRxiv. 2025 Jan 24:2025.01.23.634522. doi: 10.1101/2025.01.23.634522.
2
The performance of AlphaMissense to identify genes influencing disease.AlphaMissense 识别影响疾病的基因的性能。
HGG Adv. 2024 Oct 10;5(4):100344. doi: 10.1016/j.xhgg.2024.100344. Epub 2024 Aug 22.
3
Performance of a Protein Language Model for Variant Annotation in Cardiac Disease.蛋白质语言模型在心脏疾病变异注释中的性能。
J Am Heart Assoc. 2024 Oct 15;13(20):e036921. doi: 10.1161/JAHA.124.036921. Epub 2024 Oct 11.
4
Genetic constraint at single amino acid resolution in protein domains improves missense variant prioritisation and gene discovery.在蛋白质结构域中单氨基酸分辨率的遗传限制可提高错义变异体的优先级排序和基因发现。
Genome Med. 2024 Jul 11;16(1):88. doi: 10.1186/s13073-024-01358-9.
5
Genetic variant classification by predicted protein structure: A case study on IRF6.基于预测蛋白质结构的基因变异分类:以IRF6为例的案例研究
Comput Struct Biotechnol J. 2024 Feb 3;23:892-904. doi: 10.1016/j.csbj.2024.01.019. eCollection 2024 Dec.
6
Genome-wide prediction of pathogenic gain- and loss-of-function variants from ensemble learning of a diverse feature set.从多样化特征集的集成学习中预测全基因组致病性功能获得和功能丧失变异。
Genome Med. 2023 Nov 30;15(1):103. doi: 10.1186/s13073-023-01261-9.
7
Evaluating the use of paralogous protein domains to increase data availability for missense variant classification.评估利用旁系同源蛋白结构域增加错义变异分类数据可用性。
Genome Med. 2023 Dec 12;15(1):110. doi: 10.1186/s13073-023-01264-6.
8
An allelic-series rare-variant association test for candidate-gene discovery.候选基因发现的等位基因系列罕见变异关联测试。
Am J Hum Genet. 2023 Aug 3;110(8):1330-1342. doi: 10.1016/j.ajhg.2023.07.001. Epub 2023 Jul 25.
9
Cohort-driven variant burden analysis and pathogenicity identification in monogenic autoinflammatory disorders.队列驱动的单基因自身炎症性疾病变异负担分析和致病性鉴定。
J Allergy Clin Immunol. 2023 Aug;152(2):517-527. doi: 10.1016/j.jaci.2023.03.028. Epub 2023 Apr 7.
10
Genome-wide association study meta-analysis identifies three novel loci for circulating anti-Müllerian hormone levels in women.全基因组关联研究荟萃分析确定了女性循环抗苗勒管激素水平的三个新基因座。
Hum Reprod. 2022 May 3;37(5):1069-1082. doi: 10.1093/humrep/deac028.

本文引用的文献

1
Efficient storage and regression computation for population-scale genome sequencing studies.针对群体规模基因组测序研究的高效存储与回归计算
Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btaf067.
2
Massively parallel characterization of transcriptional regulatory elements.转录调控元件的大规模并行表征
Nature. 2025 Mar;639(8054):411-420. doi: 10.1038/s41586-024-08430-9. Epub 2025 Jan 15.
3
Genetic constraint at single amino acid resolution in protein domains improves missense variant prioritisation and gene discovery.
在蛋白质结构域中单氨基酸分辨率的遗传限制可提高错义变异体的优先级排序和基因发现。
Genome Med. 2024 Jul 11;16(1):88. doi: 10.1186/s13073-024-01358-9.
4
Investigating the role of common cis-regulatory variants in modifying penetrance of putatively damaging, inherited variants in severe neurodevelopmental disorders.研究常见顺式调控变异在修饰具有潜在破坏性的遗传性变异在严重神经发育障碍中的外显率中的作用。
Sci Rep. 2024 Apr 15;14(1):8708. doi: 10.1038/s41598-024-58894-y.
5
Accurate proteome-wide missense variant effect prediction with AlphaMissense.使用 AlphaMissense 进行精确的全蛋白质错义变异效应预测。
Science. 2023 Sep 22;381(6664):eadg7492. doi: 10.1126/science.adg7492.
6
The landscape of tolerated genetic variation in humans and primates.人类和灵长类动物中可耐受遗传变异的景观。
Science. 2023 Jun 2;380(6648):eabn8153. doi: 10.1126/science.abn8197.
7
Rare coding variants in ten genes confer substantial risk for schizophrenia.十个基因中的罕见编码变异赋予精神分裂症的显著风险。
Nature. 2022 Apr;604(7906):509-516. doi: 10.1038/s41586-022-04556-w. Epub 2022 Apr 8.
8
Reduced reproductive success is associated with selective constraint on human genes.生殖成功率降低与人类基因的选择限制有关。
Nature. 2022 Mar;603(7903):858-863. doi: 10.1038/s41586-022-04549-9. Epub 2022 Mar 23.
9
Genetics of 35 blood and urine biomarkers in the UK Biobank.英国生物库中 35 项血液和尿液生物标志物的遗传学研究
Nat Genet. 2021 Feb;53(2):185-194. doi: 10.1038/s41588-020-00757-z. Epub 2021 Jan 18.
10
The mutational constraint spectrum quantified from variation in 141,456 humans.从 141456 名人类个体的变异中量化的突变约束谱。
Nature. 2020 May;581(7809):434-443. doi: 10.1038/s41586-020-2308-7. Epub 2020 May 27.