• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于生物库规模的分位数回归进行全基因组生物标志物发现。

Genome-wide discovery for biomarkers using quantile regression at biobank scale.

机构信息

Department of Biostatistics, Columbia University, New York, NY, USA.

Division of Nephrology, Department of Medicine, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA.

出版信息

Nat Commun. 2024 Jul 31;15(1):6460. doi: 10.1038/s41467-024-50726-x.

DOI:10.1038/s41467-024-50726-x
PMID:39085219
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11291931/
Abstract

Genome-wide association studies (GWAS) for biomarkers important for clinical phenotypes can lead to clinically relevant discoveries. Conventional GWAS for quantitative traits are based on simplified regression models modeling the conditional mean of a phenotype as a linear function of genotype. We draw attention here to an alternative, lesser known approach, namely quantile regression that naturally extends linear regression to the analysis of the entire conditional distribution of a phenotype of interest. Quantile regression can be applied efficiently at biobank scale, while having some unique advantages such as (1) identifying variants with heterogeneous effects across quantiles of the phenotype distribution; (2) accommodating a wide range of phenotype distributions including non-normal distributions, with invariance of results to trait transformations; and (3) providing more detailed information about genotype-phenotype associations even for those associations identified by conventional GWAS. We show in simulations that quantile regression is powerful across both homogeneous and various heterogeneous models. Applications to 39 quantitative traits in the UK Biobank demonstrate that quantile regression can be a helpful complement to linear regression in GWAS and can identify variants with larger effects on high-risk subgroups of individuals but with lower or no contribution overall.

摘要

全基因组关联研究(GWAS)对于重要的临床表型生物标志物可以带来有临床意义的发现。传统的用于定量性状的 GWAS 基于简化的回归模型,将表型的条件均值建模为基因型的线性函数。我们在这里提请注意另一种不太为人知的方法,即分位数回归,它自然地将线性回归扩展到对感兴趣的表型的整个条件分布的分析。分位数回归可以在生物库规模上高效应用,同时具有一些独特的优势,例如(1)在表型分布的分位数上识别具有异质效应的变体;(2)适应包括非正态分布在内的广泛的表型分布,结果对特征转换不变;(3)即使对于那些通过传统 GWAS 识别的关联,也能提供关于基因型-表型关联的更详细信息。我们在模拟中表明,分位数回归在同质和各种异质模型中都非常强大。在英国生物库中的 39 个定量性状中的应用表明,分位数回归可以成为 GWAS 中线性回归的有益补充,并且可以识别对个体高风险亚组有更大影响的变体,但总体贡献较低或没有。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/e60aad650f67/41467_2024_50726_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/ad0046b17fb9/41467_2024_50726_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/a0737d7dec78/41467_2024_50726_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/b456567144e9/41467_2024_50726_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/7f9f006208d1/41467_2024_50726_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/a97d6d50d450/41467_2024_50726_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/e60aad650f67/41467_2024_50726_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/ad0046b17fb9/41467_2024_50726_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/a0737d7dec78/41467_2024_50726_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/b456567144e9/41467_2024_50726_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/7f9f006208d1/41467_2024_50726_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/a97d6d50d450/41467_2024_50726_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a10/11291931/e60aad650f67/41467_2024_50726_Fig6_HTML.jpg

相似文献

1
Genome-wide discovery for biomarkers using quantile regression at biobank scale.基于生物库规模的分位数回归进行全基因组生物标志物发现。
Nat Commun. 2024 Jul 31;15(1):6460. doi: 10.1038/s41467-024-50726-x.
2
Quantile Regression for biomarkers in the UK Biobank.英国生物银行中生物标志物的分位数回归
bioRxiv. 2023 Jun 7:2023.06.05.543699. doi: 10.1101/2023.06.05.543699.
3
Reproducibility in the UK biobank of genome-wide significant signals discovered in earlier genome-wide association studies.英国生物银行重现了先前全基因组关联研究中发现的全基因组显著信号。
Sci Rep. 2021 Sep 20;11(1):18625. doi: 10.1038/s41598-021-97896-y.
4
Analyses of biomarker traits in diverse UK biobank participants identify associations missed by European-centric analysis strategies.在英国生物银行的不同参与者中分析生物标志物特征,可识别出欧洲中心分析策略错过的关联。
J Hum Genet. 2022 Feb;67(2):87-93. doi: 10.1038/s10038-021-00968-0. Epub 2021 Aug 11.
5
Ordered multinomial regression for genetic association analysis of ordinal phenotypes at Biobank scale.基于生物库规模的有序表型遗传关联分析的有序多项回归。
Genet Epidemiol. 2020 Apr;44(3):248-260. doi: 10.1002/gepi.22276. Epub 2019 Dec 26.
6
An atlas of genetic associations in UK Biobank.英国生物银行中的遗传关联图谱
Nat Genet. 2018 Nov;50(11):1593-1599. doi: 10.1038/s41588-018-0248-z. Epub 2018 Oct 22.
7
Synthetic surrogates improve power for genome-wide association studies of partially missing phenotypes in population biobanks.合成替代物可提高在人群生物库中对部分缺失表型进行全基因组关联研究的功效。
Nat Genet. 2024 Jul;56(7):1527-1536. doi: 10.1038/s41588-024-01793-9. Epub 2024 Jun 13.
8
Discovering non-additive heritability using additive GWAS summary statistics.利用加性 GWAS 汇总统计数据发现非加性遗传力。
Elife. 2024 Jun 24;13:e90459. doi: 10.7554/eLife.90459.
9
QRank: a novel quantile regression tool for eQTL discovery.QRank:一种用于 eQTL 发现的新型分位数回归工具。
Bioinformatics. 2017 Jul 15;33(14):2123-2130. doi: 10.1093/bioinformatics/btx119.
10
A Fast and Accurate Method for Genome-wide Scale Phenome-wide G × E Analysis and Its Application to UK Biobank.一种用于全基因组规模表型全基因组 G × E 分析的快速准确方法及其在 UK Biobank 中的应用。
Am J Hum Genet. 2019 Dec 5;105(6):1182-1192. doi: 10.1016/j.ajhg.2019.10.008. Epub 2019 Nov 14.

引用本文的文献

1
Regenie.QRS: computationally efficient whole-genome quantile regression at biobank scale.Regenie.QRS:生物样本库规模下计算效率高的全基因组分位数回归
bioRxiv. 2025 May 7:2025.05.02.651730. doi: 10.1101/2025.05.02.651730.
2
Quantile-specific confounding: correction for subtle population stratification via quantile regression.分位数特定混杂:通过分位数回归校正细微的人群分层。
bioRxiv. 2025 Mar 19:2025.03.18.638253. doi: 10.1101/2025.03.18.638253.
3
fastGxE: Powering genome-wide detection of genotype-environment interactions in biobank studies.

本文引用的文献

1
Population size in QTL detection using quantile regression in genome-wide association studies.基于分位数回归的全基因组关联研究中 QTL 检测的群体大小。
Sci Rep. 2023 Jun 13;13(1):9585. doi: 10.1038/s41598-023-36730-z.
2
Analysis of genetic dominance in the UK Biobank.英国生物银行的遗传显性分析。
Science. 2023 Mar 31;379(6639):1341-1348. doi: 10.1126/science.abn8455. Epub 2023 Mar 30.
3
Genomics and phenomics of body mass index reveals a complex disease network.体质指数的基因组学和表型组学揭示了一个复杂的疾病网络。
fastGxE:助力生物样本库研究中全基因组范围内基因-环境相互作用的检测。
Res Sq. 2025 Mar 20:rs.3.rs-5952773. doi: 10.21203/rs.3.rs-5952773/v1.
4
The integration of quantile regression with 3VmrMLM identifies more QTNs and QTN-by-environment interactions using SNP- and haplotype-based markers.分位数回归与3VmrMLM的整合使用基于单核苷酸多态性(SNP)和单倍型的标记物鉴定出更多的数量性状核苷酸(QTN)以及QTN与环境的互作。
Plant Commun. 2025 Mar 10;6(3):101196. doi: 10.1016/j.xplc.2024.101196. Epub 2024 Nov 23.
Nat Commun. 2022 Dec 29;13(1):7973. doi: 10.1038/s41467-022-35553-2.
4
A quantile integral linear model to quantify genetic effects on phenotypic variability.一种分位数积分线性模型,用于量化遗传效应对表型变异性的影响。
Proc Natl Acad Sci U S A. 2022 Sep 27;119(39):e2212959119. doi: 10.1073/pnas.2212959119. Epub 2022 Sep 19.
5
Large-scale genome-wide association study of coronary artery disease in genetically diverse populations.大规模全基因组关联研究在遗传多样化人群中的冠状动脉疾病。
Nat Med. 2022 Aug;28(8):1679-1692. doi: 10.1038/s41591-022-01891-3. Epub 2022 Aug 1.
6
Variance-quantitative trait loci enable systematic discovery of gene-environment interactions for cardiometabolic serum biomarkers.方差-数量性状基因座可用于系统地发现与心血管代谢血清生物标志物相关的基因-环境相互作用。
Nat Commun. 2022 Jul 9;13(1):3993. doi: 10.1038/s41467-022-31625-5.
7
Extensive pleiotropism and allelic heterogeneity mediate metabolic effects of and .广泛的多效性和等位基因异质性介导 和 的代谢作用。
Science. 2021 Jun 4;372(6546):1085-1091. doi: 10.1126/science.abf1008.
8
Long noncoding RNA LYPLAL1-AS1 regulates adipogenic differentiation of human mesenchymal stem cells by targeting desmoplakin and inhibiting the Wnt/β-catenin pathway.长链非编码RNA LYPLAL1-AS1通过靶向桥粒斑蛋白并抑制Wnt/β-连环蛋白通路来调节人间充质干细胞的成脂分化。
Cell Death Discov. 2021 May 15;7(1):105. doi: 10.1038/s41420-021-00500-5.
9
Functional dynamic genetic effects on gene regulation are specific to particular cell types and environmental conditions.功能动态遗传对基因调控的影响是特定于特定细胞类型和环境条件的。
Elife. 2021 May 14;10:e67077. doi: 10.7554/eLife.67077.
10
Genetics of 35 blood and urine biomarkers in the UK Biobank.英国生物库中 35 项血液和尿液生物标志物的遗传学研究
Nat Genet. 2021 Feb;53(2):185-194. doi: 10.1038/s41588-020-00757-z. Epub 2021 Jan 18.