• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于置换统计量的全基因组测序数据中假定因果基因座的识别。

Identification of putative causal loci in whole-genome sequencing data via knockoff statistics.

机构信息

Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA.

Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, USA.

出版信息

Nat Commun. 2021 May 25;12(1):3152. doi: 10.1038/s41467-021-22889-4.

DOI:10.1038/s41467-021-22889-4
PMID:34035245
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8149672/
Abstract

The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. We propose a statistical method to detect and localize rare and common risk variants in whole-genome sequencing studies based on a recently developed knockoff framework. It can (1) prioritize causal variants over associations due to linkage disequilibrium thereby improving interpretability; (2) help distinguish the signal due to rare variants from shadow effects of significant common variants nearby; (3) integrate multiple knockoffs for improved power, stability, and reproducibility; and (4) flexibly incorporate state-of-the-art and future association tests to achieve the benefits proposed here. In applications to whole-genome sequencing data from the Alzheimer's Disease Sequencing Project (ADSP) and COPDGene samples from NHLBI Trans-Omics for Precision Medicine (TOPMed) Program we show that our method compared with conventional association tests can lead to substantially more discoveries.

摘要

由于非编码区域中存在大量罕见变异,以及缺乏用于检测的自然单位,全基因组测序研究的分析具有挑战性。我们提出了一种统计方法,基于最近开发的 knockoff 框架,用于检测和定位全基因组测序研究中的罕见和常见风险变异。它可以(1)优先考虑由于连锁不平衡而导致的因果变异,从而提高可解释性;(2)有助于区分罕见变异引起的信号和附近显著常见变异的阴影效应;(3)整合多个 knockoffs 以提高功效、稳定性和可重复性;以及(4)灵活地结合最先进和未来的关联测试,以实现这里提出的好处。在应用于阿尔茨海默病测序项目(ADSP)的全基因组测序数据和 NHLBI Trans-Omics for Precision Medicine(TOPMed)计划的 COPDGene 样本时,我们表明,与传统关联测试相比,我们的方法可以导致更多的发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/c596603b5cfc/41467_2021_22889_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/5f71f2be2d00/41467_2021_22889_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/56c133ffba61/41467_2021_22889_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/3d7df6e404bb/41467_2021_22889_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/adb8e81ee60b/41467_2021_22889_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/4569b9dd9ff5/41467_2021_22889_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/a29ff2e5fcdb/41467_2021_22889_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/eaf60340993b/41467_2021_22889_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/9a19f3d13498/41467_2021_22889_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/c596603b5cfc/41467_2021_22889_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/5f71f2be2d00/41467_2021_22889_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/56c133ffba61/41467_2021_22889_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/3d7df6e404bb/41467_2021_22889_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/adb8e81ee60b/41467_2021_22889_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/4569b9dd9ff5/41467_2021_22889_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/a29ff2e5fcdb/41467_2021_22889_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/eaf60340993b/41467_2021_22889_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/9a19f3d13498/41467_2021_22889_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb3/8149672/c596603b5cfc/41467_2021_22889_Fig9_HTML.jpg

相似文献

1
Identification of putative causal loci in whole-genome sequencing data via knockoff statistics.基于置换统计量的全基因组测序数据中假定因果基因座的识别。
Nat Commun. 2021 May 25;12(1):3152. doi: 10.1038/s41467-021-22889-4.
2
Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations.超过 10 万 NHLBI 转化医学精准医学(TOPMed)联盟全基因组序列的使用提高了混合非裔和西班牙裔/拉丁裔人群中罕见变异关联的推断质量和检测能力。
PLoS Genet. 2019 Dec 23;15(12):e1008500. doi: 10.1371/journal.pgen.1008500. eCollection 2019 Dec.
3
Whole-Genome Sequencing Association Analyses of Stroke and Its Subtypes in Ancestrally Diverse Populations From Trans-Omics for Precision Medicine Project.基于精准医学项目的跨组学研究,对来自不同祖先人群的中风及其亚型进行全基因组测序关联分析。
Stroke. 2022 Mar;53(3):875-885. doi: 10.1161/STROKEAHA.120.031792. Epub 2021 Nov 3.
4
A genome-wide scan statistic framework for whole-genome sequence data analysis.全基因组序列数据分析的全基因组扫描统计框架。
Nat Commun. 2019 Jul 9;10(1):3018. doi: 10.1038/s41467-019-11023-0.
5
Genome-wide analysis of common and rare variants via multiple knockoffs at biobank scale, with an application to Alzheimer disease genetics.基于生物库规模的多重置换进行全基因组常见和罕见变异的分析,及其在阿尔茨海默病遗传学中的应用。
Am J Hum Genet. 2021 Dec 2;108(12):2336-2353. doi: 10.1016/j.ajhg.2021.10.009. Epub 2021 Nov 11.
6
Dynamic Scan Procedure for Detecting Rare-Variant Association Regions in Whole-Genome Sequencing Studies.全基因组测序研究中稀有变异关联区域的动态扫描程序。
Am J Hum Genet. 2019 May 2;104(5):802-814. doi: 10.1016/j.ajhg.2019.03.002. Epub 2019 Apr 12.
7
KnockoffHybrid: A knockoff framework for hybrid analysis of trio and population designs in genome-wide association studies.仿冒混合分析框架:全基因组关联研究中 trio 设计和群体设计的混合分析框架。
Am J Hum Genet. 2024 Jul 11;111(7):1448-1461. doi: 10.1016/j.ajhg.2024.05.003. Epub 2024 May 30.
8
GhostKnockoff inference empowers identification of putative causal variants in genome-wide association studies.幽灵复刻推断使全基因组关联研究中假定因果变异的识别成为可能。
Nat Commun. 2022 Nov 23;13(1):7209. doi: 10.1038/s41467-022-34932-z.
9
Multi-resolution localization of causal variants across the genome.全基因组因果变异的多分辨率定位。
Nat Commun. 2020 Feb 27;11(1):1093. doi: 10.1038/s41467-020-14791-2.
10
Powerful gene-based testing by integrating long-range chromatin interactions and knockoff genotypes.基于基因的强大测试,整合长程染色质相互作用和 knockoff 基因型。
Proc Natl Acad Sci U S A. 2021 Nov 23;118(47). doi: 10.1073/pnas.2105191118.

引用本文的文献

1
Reflection Knockoffs via Householder Reflection: Applications in Proteomics and Genetic Fine Mapping.通过豪斯霍尔德反射实现的反射仿冒品:在蛋白质组学和基因精细定位中的应用
bioRxiv. 2025 May 29:2025.01.16.633369. doi: 10.1101/2025.01.16.633369.
2
It's a wrap: deriving distinct discoveries with FDR control after a GWAS pipeline.大功告成:在全基因组关联研究流程之后通过错误发现率控制得出不同的发现。
bioRxiv. 2025 Jul 9:2025.06.05.658138. doi: 10.1101/2025.06.05.658138.
3
Robust inference with GhostKnockoffs in genome-wide association studies.全基因组关联研究中使用GhostKnockoffs进行稳健推断。

本文引用的文献

1
A simple new approach to variable selection in regression, with application to genetic fine mapping.一种用于回归中变量选择的简单新方法及其在基因精细定位中的应用。
J R Stat Soc Series B Stat Methodol. 2020 Dec;82(5):1273-1300. doi: 10.1111/rssb.12388. Epub 2020 Jul 10.
2
False discovery rate control in genome-wide association studies with population structure.基于群体结构的全基因组关联研究中的假发现率控制。
Proc Natl Acad Sci U S A. 2021 Oct 5;118(40). doi: 10.1073/pnas.2105841118.
3
Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.
Res Sq. 2025 May 5:rs.3.rs-6396196. doi: 10.21203/rs.3.rs-6396196/v1.
4
Local genetic correlation via knockoffs reduces confounding due to cross-trait assortative mating.通过仿制品进行的局部遗传相关性分析可减少因跨性状选型交配导致的混杂因素。
Am J Hum Genet. 2024 Dec 5;111(12):2839-2848. doi: 10.1016/j.ajhg.2024.10.012. Epub 2024 Nov 14.
5
Second-order group knockoffs with applications to genome-wide association studies.二阶群组置换检验及其在全基因组关联研究中的应用。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae580.
6
Summary statistics knockoffs inference with family-wise error rate control.基于 FWER 控制的摘要统计量置换检验推断。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae082.
7
KnockoffHybrid: A knockoff framework for hybrid analysis of trio and population designs in genome-wide association studies.仿冒混合分析框架:全基因组关联研究中 trio 设计和群体设计的混合分析框架。
Am J Hum Genet. 2024 Jul 11;111(7):1448-1461. doi: 10.1016/j.ajhg.2024.05.003. Epub 2024 May 30.
8
Enhancing credit scoring accuracy with a comprehensive evaluation of alternative data.利用替代数据进行综合评估,提高信用评分准确性。
PLoS One. 2024 May 21;19(5):e0303566. doi: 10.1371/journal.pone.0303566. eCollection 2024.
9
Key variants via the Alzheimer's Disease Sequencing Project whole genome sequence data.关键变异通过阿尔茨海默病测序计划全基因组序列数据。
Alzheimers Dement. 2024 May;20(5):3290-3304. doi: 10.1002/alz.13705. Epub 2024 Mar 21.
10
Beyond guilty by association at scale: searching for causal variants on the basis of genome-wide summary statistics.超越大规模关联推断有罪:基于全基因组汇总统计寻找因果变异体。
bioRxiv. 2025 Feb 26:2024.02.28.582621. doi: 10.1101/2024.02.28.582621.
美国国立卫生研究院生物医学高级研究与发展局(NHLBI)TOPMed 项目中对 53831 个不同基因组进行测序。
Nature. 2021 Feb;590(7845):290-299. doi: 10.1038/s41586-021-03205-y. Epub 2021 Feb 10.
4
Cauchy combination test: a powerful test with analytic -value calculation under arbitrary dependency structures.柯西组合检验:一种在任意相依结构下具有解析值计算功能的强大检验。
J Am Stat Assoc. 2020;115(529):393-402. doi: 10.1080/01621459.2018.1554485. Epub 2019 Apr 25.
5
Scalable generalized linear mixed model for region-based association tests in large biobanks and cohorts.基于区域的大型生物库和队列关联检验的可扩展广义线性混合模型。
Nat Genet. 2020 Jun;52(6):634-639. doi: 10.1038/s41588-020-0621-6. Epub 2020 May 18.
6
Multi-resolution localization of causal variants across the genome.全基因组因果变异的多分辨率定位。
Nat Commun. 2020 Feb 27;11(1):1093. doi: 10.1038/s41467-020-14791-2.
7
UK Biobank Whole-Exome Sequence Binary Phenome Analysis with Robust Region-Based Rare-Variant Test.英国生物银行全外显子组序列双表型分析与稳健基于区域的罕见变异测试。
Am J Hum Genet. 2020 Jan 2;106(1):3-12. doi: 10.1016/j.ajhg.2019.11.012. Epub 2019 Dec 19.
8
Ensembl 2020.Ensembl 2020.
Nucleic Acids Res. 2020 Jan 8;48(D1):D682-D688. doi: 10.1093/nar/gkz966.
9
Circulating glucuronic acid predicts healthspan and longevity in humans and mice.循环葡萄糖醛酸可预测人类和小鼠的健康寿命和寿命。
Aging (Albany NY). 2019 Sep 26;11(18):7694-7706. doi: 10.18632/aging.102281.
10
Ultrarare variants drive substantial cis heritability of human gene expression.超罕见变异驱动人类基因表达的大量顺式遗传。
Nat Genet. 2019 Sep;51(9):1349-1355. doi: 10.1038/s41588-019-0487-7. Epub 2019 Sep 2.