• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepNull 模型可以对非线性协变量效应进行建模,以提高表型预测和关联能力。

DeepNull models non-linear covariate effects to improve phenotypic prediction and association power.

机构信息

Google Health, Palo Alto, CA, USA.

Google Health, Cambridge, MA, USA.

出版信息

Nat Commun. 2022 Jan 11;13(1):241. doi: 10.1038/s41467-021-27930-0.

DOI:10.1038/s41467-021-27930-0
PMID:35017556
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8752755/
Abstract

Genome-wide association studies (GWASs) examine the association between genotype and phenotype while adjusting for a set of covariates. Although the covariates may have non-linear or interactive effects, due to the challenge of specifying the model, GWAS often neglect such terms. Here we introduce DeepNull, a method that identifies and adjusts for non-linear and interactive covariate effects using a deep neural network. In analyses of simulated and real data, we demonstrate that DeepNull maintains tight control of the type I error while increasing statistical power by up to 20% in the presence of non-linear and interactive effects. Moreover, in the absence of such effects, DeepNull incurs no loss of power. When applied to 10 phenotypes from the UK Biobank (n = 370K), DeepNull discovered more hits (+6%) and loci (+7%), on average, than conventional association analyses, many of which are biologically plausible or have previously been reported. Finally, DeepNull improves upon linear modeling for phenotypic prediction (+23% on average).

摘要

全基因组关联研究 (GWAS) 考察了基因型与表型之间的关联,同时调整了一组协变量。尽管协变量可能具有非线性或交互作用,但由于指定模型的挑战,GWAS 通常忽略了这些术语。在这里,我们介绍了 DeepNull,这是一种使用深度神经网络识别和调整非线性和交互协变量效应的方法。在对模拟和真实数据的分析中,我们证明了 DeepNull 在存在非线性和交互作用的情况下保持了严格的 I 型错误控制,同时将统计功效提高了多达 20%。此外,在没有这些影响的情况下,DeepNull 不会损失功效。当应用于来自英国生物银行的 10 种表型(n = 370K)时,DeepNull 发现了更多的命中(平均增加 6%)和位点(平均增加 7%),其中许多是生物学上合理的或以前已经报道过的。最后,DeepNull 提高了表型预测的线性建模效果(平均提高 23%)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/e6517748025d/41467_2021_27930_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/2f92115f29e3/41467_2021_27930_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/a07b504b385f/41467_2021_27930_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/6d073902cddb/41467_2021_27930_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/c88f45f8a62d/41467_2021_27930_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/e6517748025d/41467_2021_27930_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/2f92115f29e3/41467_2021_27930_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/a07b504b385f/41467_2021_27930_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/6d073902cddb/41467_2021_27930_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/c88f45f8a62d/41467_2021_27930_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9483/8752755/e6517748025d/41467_2021_27930_Fig5_HTML.jpg

相似文献

1
DeepNull models non-linear covariate effects to improve phenotypic prediction and association power.DeepNull 模型可以对非线性协变量效应进行建模,以提高表型预测和关联能力。
Nat Commun. 2022 Jan 11;13(1):241. doi: 10.1038/s41467-021-27930-0.
2
A novel association test for multiple secondary phenotypes from a case-control GWAS.一种针对病例对照全基因组关联研究中多个次要表型的新型关联测试。
Genet Epidemiol. 2017 Jul;41(5):413-426. doi: 10.1002/gepi.22045. Epub 2017 Apr 10.
3
A powerful subset-based method identifies gene set associations and improves interpretation in UK Biobank.一种强大的基于子集的方法可识别基因集关联,并改善 UK Biobank 中的解释。
Am J Hum Genet. 2021 Apr 1;108(4):669-681. doi: 10.1016/j.ajhg.2021.02.016. Epub 2021 Mar 16.
4
Genome-wide discovery for biomarkers using quantile regression at biobank scale.基于生物库规模的分位数回归进行全基因组生物标志物发现。
Nat Commun. 2024 Jul 31;15(1):6460. doi: 10.1038/s41467-024-50726-x.
5
Adjustment for covariates using summary statistics of genome-wide association studies.使用全基因组关联研究的汇总统计量对协变量进行调整。
Genet Epidemiol. 2018 Dec;42(8):812-825. doi: 10.1002/gepi.22148. Epub 2018 Sep 20.
6
Interpretable artificial neural networks incorporating Bayesian alphabet models for genome-wide prediction and association studies.基于贝叶斯字母模型的可解释人工神经网络在全基因组预测和关联研究中的应用。
G3 (Bethesda). 2021 Sep 27;11(10). doi: 10.1093/g3journal/jkab228.
7
Methodological Considerations in Estimation of Phenotype Heritability Using Genome-Wide SNP Data, Illustrated by an Analysis of the Heritability of Height in a Large Sample of African Ancestry Adults.利用全基因组SNP数据估计表型遗传力的方法学考量,以对大量非洲裔成年人身高遗传力的分析为例
PLoS One. 2015 Jun 30;10(6):e0131106. doi: 10.1371/journal.pone.0131106. eCollection 2015.
8
Estimating the effective sample size in association studies of quantitative traits.估计数量性状关联研究中的有效样本量。
G3 (Bethesda). 2021 Jun 17;11(6). doi: 10.1093/g3journal/jkab057.
9
Efficient mixed model approach for large-scale genome-wide association studies of ordinal categorical phenotypes.高效混合模型方法在大规模全基因组关联研究中对有序分类表型的应用。
Am J Hum Genet. 2021 May 6;108(5):825-839. doi: 10.1016/j.ajhg.2021.03.019. Epub 2021 Apr 8.
10
GbyE: an integrated tool for genome widely association study and genome selection based on genetic by environmental interaction.GbyE:一种基于遗传与环境互作的全基因组关联研究和基因组选择的综合工具。
BMC Genomics. 2024 Apr 19;25(1):386. doi: 10.1186/s12864-024-10310-5.

引用本文的文献

1
Learning genotype-phenotype associations from gaps in multi-species sequence alignments.从多物种序列比对的缺口处学习基因型-表型关联。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf022.
2
The Significant Effects of Threshold Selection for Advancing Nitrogen Use Efficiency in Whole Genome of Bread Wheat.阈值选择对提高面包小麦全基因组氮素利用效率的显著影响
Plant Direct. 2025 Jan 21;9(1):e70036. doi: 10.1002/pld3.70036. eCollection 2025 Jan.
3
Deep learning captures the effect of epistasis in multifactorial diseases.

本文引用的文献

1
Large-scale machine-learning-based phenotyping significantly improves genomic discovery for optic nerve head morphology.基于大规模机器学习的表型分析显著提高了视神经头形态的基因组发现。
Am J Hum Genet. 2021 Jul 1;108(7):1217-1230. doi: 10.1016/j.ajhg.2021.05.004. Epub 2021 Jun 1.
2
The GTEx Consortium atlas of genetic regulatory effects across human tissues.GTEx 联盟人类组织遗传调控效应图谱
Science. 2020 Sep 11;369(6509):1318-1330. doi: 10.1126/science.aaz1776.
3
Operating characteristics of the rank-based inverse normal transformation for quantitative trait analysis in genome-wide association studies.
深度学习捕捉多因素疾病中上位性的影响。
Front Med (Lausanne). 2025 Jan 7;11:1479717. doi: 10.3389/fmed.2024.1479717. eCollection 2024.
4
A review of deep learning models for the prediction of chromatin interactions with DNA and epigenomic profiles.用于预测染色质与DNA相互作用及表观基因组图谱的深度学习模型综述。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae651.
5
Machine learning models for predicting blood pressure phenotypes by combining multiple polygenic risk scores.基于多种多基因风险评分预测血压表型的机器学习模型。
Sci Rep. 2024 May 30;14(1):12436. doi: 10.1038/s41598-024-62945-9.
6
Development and application of a deep learning-based comprehensive early diagnostic model for chronic obstructive pulmonary disease.基于深度学习的慢性阻塞性肺疾病综合早期诊断模型的建立与应用。
Respir Res. 2024 Apr 18;25(1):167. doi: 10.1186/s12931-024-02793-3.
7
Machine learning models for blood pressure phenotypes combining multiple polygenic risk scores.结合多个多基因风险评分的血压表型机器学习模型。
medRxiv. 2023 Dec 14:2023.12.13.23299909. doi: 10.1101/2023.12.13.23299909.
8
Statistical inference with large-scale trait imputation.基于大规模性状插补的统计推断。
Stat Med. 2024 Feb 20;43(4):625-641. doi: 10.1002/sim.9975. Epub 2023 Dec 1.
9
Artificial intelligence for dementia genetics and omics.人工智能在痴呆症遗传学和组学中的应用。
Alzheimers Dement. 2023 Dec;19(12):5905-5921. doi: 10.1002/alz.13427. Epub 2023 Aug 22.
10
Multi-PGS enhances polygenic prediction by combining 937 polygenic scores.多基因评分聚合(Multi-PGS)通过整合 937 个多基因评分来增强多基因预测。
Nat Commun. 2023 Aug 5;14(1):4702. doi: 10.1038/s41467-023-40330-w.
基于等级的逆正态变换在全基因组关联研究中数量性状分析的操作特征。
Biometrics. 2020 Dec;76(4):1262-1272. doi: 10.1111/biom.13214. Epub 2020 Jan 13.
4
Clinical use of current polygenic risk scores may exacerbate health disparities.现行多基因风险评分的临床应用可能会加剧健康差异。
Nat Genet. 2019 Apr;51(4):584-591. doi: 10.1038/s41588-019-0379-x. Epub 2019 Mar 29.
5
New genetic signals for lung function highlight pathways and chronic obstructive pulmonary disease associations across multiple ancestries.新的肺功能遗传信号突出了多种血统中的途径和慢性阻塞性肺疾病的关联。
Nat Genet. 2019 Mar;51(3):481-493. doi: 10.1038/s41588-018-0321-7. Epub 2019 Feb 25.
6
Leveraging Polygenic Functional Enrichment to Improve GWAS Power.利用多基因功能富集提高 GWAS 效力。
Am J Hum Genet. 2019 Jan 3;104(1):65-75. doi: 10.1016/j.ajhg.2018.11.008. Epub 2018 Dec 27.
7
The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019.NHGRI-EBI GWAS Catalog 于 2019 年发布的已发表全基因组关联研究、靶向基因芯片和汇总统计数据
Nucleic Acids Res. 2019 Jan 8;47(D1):D1005-D1012. doi: 10.1093/nar/gky1120.
8
The UK Biobank resource with deep phenotyping and genomic data.英国生物银行资源库,具有深度表型和基因组数据。
Nature. 2018 Oct;562(7726):203-209. doi: 10.1038/s41586-018-0579-z. Epub 2018 Oct 10.
9
Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.全基因组多基因疾病风险评分可识别出与单基因突变风险相当的个体。
Nat Genet. 2018 Sep;50(9):1219-1224. doi: 10.1038/s41588-018-0183-z. Epub 2018 Aug 13.
10
Mixed-model association for biobank-scale datasets.基于生物库规模数据集的混合模型关联分析。
Nat Genet. 2018 Jul;50(7):906-908. doi: 10.1038/s41588-018-0144-6.