• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过整合外部对照提高罕见变异检测的效能。

Improving power for rare-variant tests by integrating external controls.

作者信息

Lee Seunggeun, Kim Sehee, Fuchsberger Christian

机构信息

Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA.

Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA.

出版信息

Genet Epidemiol. 2017 Nov;41(7):610-619. doi: 10.1002/gepi.22057. Epub 2017 Jun 28.

DOI:10.1002/gepi.22057
PMID:28657150
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6082405/
Abstract

Due to the drop in sequencing cost, the number of sequenced genomes is increasing rapidly. To improve power of rare-variant tests, these sequenced samples could be used as external control samples in addition to control samples from the study itself. However, when using external controls, possible batch effects due to the use of different sequencing platforms or genotype calling pipelines can dramatically increase type I error rates. To address this, we propose novel summary statistics based single and gene- or region-based rare-variant tests that allow the integration of external controls while controlling for type I error. Our approach is based on the insight that batch effects on a given variant can be assessed by comparing odds ratio estimates using internal controls only vs. using combined control samples of internal and external controls. From simulation experiments and the analysis of data from age-related macular degeneration and type 2 diabetes studies, we demonstrate that our method can substantially improve power while controlling for type I error rate.

摘要

由于测序成本的下降,已测序基因组的数量正在迅速增加。为了提高罕见变异检测的效能,除了来自研究本身的对照样本外,这些已测序样本还可作为外部对照样本。然而,使用外部对照时,由于使用不同的测序平台或基因分型流程可能产生的批次效应会显著增加I型错误率。为了解决这个问题,我们提出了基于新颖汇总统计量的单基因或基于区域的罕见变异检测方法,该方法允许在控制I型错误的同时整合外部对照。我们的方法基于这样一种见解,即通过比较仅使用内部对照与使用内部和外部对照的组合对照样本的比值比估计值,可以评估给定变异上的批次效应。通过模拟实验以及对年龄相关性黄斑变性和2型糖尿病研究数据的分析,我们证明我们的方法在控制I型错误率的同时可以显著提高效能。

相似文献

1
Improving power for rare-variant tests by integrating external controls.通过整合外部对照提高罕见变异检测的效能。
Genet Epidemiol. 2017 Nov;41(7):610-619. doi: 10.1002/gepi.22057. Epub 2017 Jun 28.
2
ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls.ProxECAT:代理外部控制协会测试。一种新的病例对照基因区域关联测试,使用公共对照的等位基因频率。
PLoS Genet. 2018 Oct 16;14(10):e1007591. doi: 10.1371/journal.pgen.1007591. eCollection 2018 Oct.
3
Integrating external controls in case-control studies improves power for rare-variant tests.在病例对照研究中纳入外部对照可提高罕见变异检验的效能。
Genet Epidemiol. 2022 Apr;46(3-4):145-158. doi: 10.1002/gepi.22444. Epub 2022 Feb 16.
4
PhredEM: a phred-score-informed genotype-calling approach for next-generation sequencing studies.PhredEM:一种用于下一代测序研究的基于Phred分数的基因型分型方法。
Genet Epidemiol. 2017 Jul;41(5):375-387. doi: 10.1002/gepi.22048. Epub 2017 May 31.
5
Novel score test to increase power in association test by integrating external controls.新型评分检验通过整合外部对照来提高关联检验的功效。
Genet Epidemiol. 2021 Apr;45(3):293-304. doi: 10.1002/gepi.22370. Epub 2020 Nov 8.
6
Testing Rare-Variant Association without Calling Genotypes Allows for Systematic Differences in Sequencing between Cases and Controls.在不进行基因型分型的情况下测试罕见变异关联会导致病例组和对照组在测序方面存在系统性差异。
PLoS Genet. 2016 May 6;12(5):e1006040. doi: 10.1371/journal.pgen.1006040. eCollection 2016 May.
7
Empirical estimation of sequencing error rates using smoothing splines.使用平滑样条对测序错误率进行经验估计。
BMC Bioinformatics. 2016 Apr 22;17:177. doi: 10.1186/s12859-016-1052-3.
8
Comparison and evaluation of two exome capture kits and sequencing platforms for variant calling.两种外显子捕获试剂盒和测序平台用于变异检测的比较与评估
BMC Genomics. 2015 Aug 5;16(1):581. doi: 10.1186/s12864-015-1796-6.
9
Evaluating Variant Calling Tools for Non-Matched Next-Generation Sequencing Data.评估用于非配对下一代测序数据的变异调用工具。
Sci Rep. 2017 Feb 24;7:43169. doi: 10.1038/srep43169.
10
Using genotype array data to compare multi- and single-sample variant calls and improve variant call sets from deep coverage whole-genome sequencing data.利用基因型阵列数据比较多样本和单样本变异检测结果,并改进来自深度覆盖全基因组测序数据的变异检测集。
Bioinformatics. 2017 Apr 15;33(8):1147-1153. doi: 10.1093/bioinformatics/btw786.

引用本文的文献

1
Replication of a GWAS signal near with AML using a disease-only cohort and external population-based controls.使用仅患有疾病的队列和基于外部人群的对照,在与急性髓系白血病相关的区域复制全基因组关联研究信号。
Blood Neoplasia. 2025 May 19;2(3):100118. doi: 10.1016/j.bneo.2025.100118. eCollection 2025 Aug.
2
CCAFE: Estimating Case and Control Allele Frequencies from GWAS Summary Statistics.CCAFE:从全基因组关联研究汇总统计数据中估计病例和对照等位基因频率
bioRxiv. 2024 Oct 29:2024.10.24.619530. doi: 10.1101/2024.10.24.619530.
3
Integrating External Controls by Regression Calibration for Genome-Wide Association Study.基于回归校准的全基因组关联研究的外部控制整合。
Genes (Basel). 2024 Jan 3;15(1):67. doi: 10.3390/genes15010067.
4
Public platform with 39,472 exome control samples enables association studies without genotype sharing.拥有 39472 个外显子对照样本的公共平台可实现无需基因型共享的关联研究。
Nat Genet. 2024 Feb;56(2):327-335. doi: 10.1038/s41588-023-01637-y. Epub 2024 Jan 10.
5
Case-only exome variation analysis of severe alcohol dependence using a multivariate hierarchical gene clustering approach.采用多变量层次基因聚类方法对严重酒精依赖进行病例外显子变异分析。
PLoS One. 2023 Apr 25;18(4):e0283985. doi: 10.1371/journal.pone.0283985. eCollection 2023.
6
Inflated expectations: Rare-variant association analysis using public controls. inflated expectations:使用公共对照进行罕见变异关联分析。
PLoS One. 2023 Jan 25;18(1):e0280951. doi: 10.1371/journal.pone.0280951. eCollection 2023.
7
Recent advances and challenges of rare variant association analysis in the biobank sequencing era.生物样本库测序时代罕见变异关联分析的最新进展与挑战
Front Genet. 2022 Oct 6;13:1014947. doi: 10.3389/fgene.2022.1014947. eCollection 2022.
8
Opportunities and challenges for the use of common controls in sequencing studies.测序研究中使用常见对照的机遇和挑战。
Nat Rev Genet. 2022 Nov;23(11):665-679. doi: 10.1038/s41576-022-00487-4. Epub 2022 May 17.
9
Integrating external controls in case-control studies improves power for rare-variant tests.在病例对照研究中纳入外部对照可提高罕见变异检验的效能。
Genet Epidemiol. 2022 Apr;46(3-4):145-158. doi: 10.1002/gepi.22444. Epub 2022 Feb 16.
10
A data harmonization pipeline to leverage external controls and boost power in GWAS.利用外部对照和提高 GWAS 效能的数据协调管道。
Hum Mol Genet. 2022 Feb 3;31(3):481-489. doi: 10.1093/hmg/ddab261.

本文引用的文献

1
Analysis of protein-coding genetic variation in 60,706 humans.对60706名人类的蛋白质编码基因变异进行分析。
Nature. 2016 Aug 18;536(7616):285-91. doi: 10.1038/nature19057.
2
The genetic architecture of type 2 diabetes.2型糖尿病的遗传结构
Nature. 2016 Aug 4;536(7614):41-47. doi: 10.1038/nature18642. Epub 2016 Jul 11.
3
Testing Rare-Variant Association without Calling Genotypes Allows for Systematic Differences in Sequencing between Cases and Controls.在不进行基因型分型的情况下测试罕见变异关联会导致病例组和对照组在测序方面存在系统性差异。
PLoS Genet. 2016 May 6;12(5):e1006040. doi: 10.1371/journal.pgen.1006040. eCollection 2016 May.
4
A Method to Exploit the Structure of Genetic Ancestry Space to Enhance Case-Control Studies.一种利用遗传祖先空间结构来加强病例对照研究的方法。
Am J Hum Genet. 2016 May 5;98(5):857-868. doi: 10.1016/j.ajhg.2016.02.025. Epub 2016 Apr 14.
5
Rare-variant association analysis: study designs and statistical tests.罕见变异关联分析:研究设计与统计检验。
Am J Hum Genet. 2014 Jul 3;95(1):5-23. doi: 10.1016/j.ajhg.2014.06.009.
6
Association analysis using next-generation sequence data from publicly available control groups: the robust variance score statistic.利用公共可用对照组的下一代测序数据进行关联分析:稳健方差得分统计。
Bioinformatics. 2014 Aug 1;30(15):2179-88. doi: 10.1093/bioinformatics/btu196. Epub 2014 Apr 14.
7
Ancestry estimation and control of population stratification for sequence-based association studies.基于序列的关联研究中的祖源估计和群体分层控制。
Nat Genet. 2014 Apr;46(4):409-15. doi: 10.1038/ng.2924. Epub 2014 Mar 16.
8
Identification of low-frequency and rare sequence variants associated with elevated or reduced risk of type 2 diabetes.鉴定与 2 型糖尿病风险升高或降低相关的低频和罕见序列变异。
Nat Genet. 2014 Mar;46(3):294-8. doi: 10.1038/ng.2882. Epub 2014 Jan 26.
9
Searching for missing heritability: designing rare variant association studies.寻找缺失的遗传力:设计罕见变异关联研究。
Proc Natl Acad Sci U S A. 2014 Jan 28;111(4):E455-64. doi: 10.1073/pnas.1322563111. Epub 2014 Jan 17.
10
Rare coding variants in the phospholipase D3 gene confer risk for Alzheimer's disease.磷脂酶 D3 基因中的罕见编码变异与阿尔茨海默病的风险相关。
Nature. 2014 Jan 23;505(7484):550-554. doi: 10.1038/nature12825. Epub 2013 Dec 11.