• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探讨基于分层假发现率的区域关联测试在稀有遗传变异中的潜在益处。

Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation.

机构信息

Lady Davis Institute for Medical Research, Jewish General Hospital Montreal, QC, Canada ; Department of Epidemiology, Biostatistics and Occupational Health, McGill University Montreal, QC, Canada.

Department of Epidemiology, Biostatistics and Occupational Health, McGill University Montreal, QC, Canada.

出版信息

Front Genet. 2014 Jan 29;5:11. doi: 10.3389/fgene.2014.00011. eCollection 2014.

DOI:10.3389/fgene.2014.00011
PMID:24523729
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3905218/
Abstract

When analyzing the data that arises from exome or whole-genome sequencing studies, window-based tests, (i.e., tests that jointly analyze all genetic data in a small genomic region), are very popular. However, power is known to be quite low for finding associations with phenotypes using these tests, and therefore a variety of analytic strategies may be employed to potentially improve power. Using sequencing data of all of chromosome 3 from an interim release of data on 2432 individuals from the UK10K project, we simulated phenotypes associated with rare genetic variation, and used the results to explore the window-based test power. We asked two specific questions: firstly, whether there could be substantial benefits associated with incorporating information from external annotation on the genetic variants, and secondly whether the false discovery rate (FDRs) would be a useful metric for assessing significance. Although, as expected, there are benefits to using additional information (such as annotation) when it is associated with causality, we confirmed the general pattern of low sensitivity and power for window-based tests. For our chosen example, even when power is high to detect some of the associations, many of the regions containing causal variants are not detectable, despite using lax significance thresholds and optimal analytic methods. Furthermore, our estimated FDR values tended to be much smaller than the true FDRs. Long-range correlations between variants-due to linkage disequilibrium-likely explain some of this bias. A more sophisticated approach to using the annotation information may improve power, however, many causal variants of realistic effect sizes may simply be undetectable, at least with this sample size. Perhaps annotation information could assist in distinguishing windows containing causal variants from windows that are merely correlated with causal variants.

摘要

在分析外显子组或全基因组测序研究产生的数据时,基于窗口的测试(即联合分析小基因组区域内所有遗传数据的测试)非常流行。然而,使用这些测试发现与表型相关联的关联的功效已知是相当低的,因此可以采用各种分析策略来提高潜在的功效。使用来自 UK10K 项目的 2432 个人的中间数据释放的所有染色体 3 的测序数据,我们模拟了与罕见遗传变异相关的表型,并使用结果来探索基于窗口的测试功效。我们提出了两个具体问题:首先,是否可以从遗传变异的外部注释中获得大量相关信息;其次,错误发现率(FDR)是否可以作为评估显着性的有用指标。尽管,如预期的那样,当附加信息(如注释)与因果关系相关时,会有好处,但我们确认了基于窗口的测试的敏感性和功效普遍较低的模式。对于我们选择的示例,即使在检测到一些关联的功效很高的情况下,许多包含因果变异的区域仍然无法检测到,尽管使用了宽松的显着性阈值和最佳分析方法。此外,我们估计的 FDR 值往往远小于真实 FDR 值。由于连锁不平衡导致的变异之间的长程相关性可能解释了部分偏差。使用注释信息的更复杂方法可能会提高功效,但是,许多现实效应大小的因果变异可能根本无法检测到,至少在这种样本量下是如此。也许注释信息可以帮助区分包含因果变异的窗口和仅与因果变异相关的窗口。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56a9/3905218/9b50011a45c9/fgene-05-00011-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56a9/3905218/d77425b769c4/fgene-05-00011-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56a9/3905218/86a39dd8e073/fgene-05-00011-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56a9/3905218/fc0582e183ae/fgene-05-00011-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56a9/3905218/9b50011a45c9/fgene-05-00011-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56a9/3905218/d77425b769c4/fgene-05-00011-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56a9/3905218/86a39dd8e073/fgene-05-00011-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56a9/3905218/fc0582e183ae/fgene-05-00011-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56a9/3905218/9b50011a45c9/fgene-05-00011-g0004.jpg

相似文献

1
Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation.探讨基于分层假发现率的区域关联测试在稀有遗传变异中的潜在益处。
Front Genet. 2014 Jan 29;5:11. doi: 10.3389/fgene.2014.00011. eCollection 2014.
2
GENOME-WIDE ASSOCIATION MAPPING AND RARE ALLELES: FROM POPULATION GENOMICS TO PERSONALIZED MEDICINE - Session Introduction.全基因组关联图谱与罕见等位基因:从群体基因组学到个性化医学——会议介绍
Pac Symp Biocomput. 2011:74-5. doi: 10.1142/9789814335058_0008.
3
Estimating genome-wide significance for whole-genome sequencing studies.估算全基因组测序研究的全基因组显著性。
Genet Epidemiol. 2014 May;38(4):281-90. doi: 10.1002/gepi.21797. Epub 2014 Feb 14.
4
Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle.基于注释对序列变异进行加权可提高奶牛全基因组关联研究的效力。
Genet Sel Evol. 2019 May 10;51(1):20. doi: 10.1186/s12711-019-0463-9.
5
The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.基于基因的罕见变异方法在检测疾病相关变异以及检验关于复杂疾病的假设方面的能力。
PLoS Genet. 2015 Apr 23;11(4):e1005165. doi: 10.1371/journal.pgen.1005165. eCollection 2015 Apr.
6
Two-stage extreme phenotype sequencing design for discovering and testing common and rare genetic variants: efficiency and power.用于发现和测试常见及罕见遗传变异的两阶段极端表型测序设计:效率与效能
Hum Hered. 2012;73(3):139-47. doi: 10.1159/000337300. Epub 2012 Jun 7.
7
Localization of association signal from risk and protective variants in sequencing studies.测序研究中风险和保护性变异的关联信号定位
Front Genet. 2012 Sep 6;3:173. doi: 10.3389/fgene.2012.00173. eCollection 2012.
8
BioBin: a bioinformatics tool for automating the binning of rare variants using publicly available biological knowledge.BioBin:一个生物信息学工具,用于利用公开可用的生物知识自动对稀有变体进行分类。
BMC Med Genomics. 2013;6 Suppl 2(Suppl 2):S6. doi: 10.1186/1755-8794-6-S2-S6. Epub 2013 May 7.
9
Pathway analysis approaches for rare and common variants: insights from Genetic Analysis Workshop 18.罕见和常见变异的通路分析方法:来自遗传分析研讨会18的见解
Genet Epidemiol. 2014 Sep;38 Suppl 1(0 1):S86-91. doi: 10.1002/gepi.21831.
10
Evaluating the Calibration and Power of Three Gene-Based Association Tests of Rare Variants for the X Chromosome.评估X染色体上三种基于基因的罕见变异关联测试的校准度和效能。
Genet Epidemiol. 2015 Nov;39(7):499-508. doi: 10.1002/gepi.21935. Epub 2015 Oct 10.

引用本文的文献

1
Weighted functional linear regression models for gene-based association analysis.用于基于基因的关联分析的加权功能线性回归模型。
PLoS One. 2018 Jan 8;13(1):e0190486. doi: 10.1371/journal.pone.0190486. eCollection 2018.
2
Comparison of single-marker and multi-marker tests in rare variant association studies of quantitative traits.数量性状罕见变异关联研究中单标记与多标记检验的比较。
PLoS One. 2017 May 31;12(5):e0178504. doi: 10.1371/journal.pone.0178504. eCollection 2017.
3
Discovery of Cancer Driver Long Noncoding RNAs across 1112 Tumour Genomes: New Candidates and Distinguishing Features.

本文引用的文献

1
Estimating genome-wide significance for whole-genome sequencing studies.估算全基因组测序研究的全基因组显著性。
Genet Epidemiol. 2014 May;38(4):281-90. doi: 10.1002/gepi.21797. Epub 2014 Feb 14.
2
A sequence of methodological changes due to sequencing.测序引发的一系列方法学变革。
Curr Opin Allergy Clin Immunol. 2013 Oct;13(5):470-7. doi: 10.1097/ACI.0b013e3283648f68.
3
Empirical power of very rare variants for common traits and disease: results from sanger sequencing 1998 individuals.非常罕见变异对常见性状和疾病的经验效力:来自对 1998 个人进行桑格测序的结果。
在 1112 个肿瘤基因组中发现癌症驱动长非编码 RNA:新的候选物和鉴别特征。
Sci Rep. 2017 Jan 27;7:41544. doi: 10.1038/srep41544.
4
Assessing the effects of multiple markers in genetic association studies.评估基因关联研究中多个标记物的作用。
Front Genet. 2015 Feb 24;6:66. doi: 10.3389/fgene.2015.00066. eCollection 2015.
Eur J Hum Genet. 2013 Sep;21(9):1027-30. doi: 10.1038/ejhg.2012.284. Epub 2013 Jan 16.
4
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
5
The effect of correlation in false discovery rate estimation.相关性在错误发现率估计中的作用。
Biometrika. 2011 Mar;98(1):199-214. doi: 10.1093/biomet/asq075.
6
Localization of association signal from risk and protective variants in sequencing studies.测序研究中风险和保护性变异的关联信号定位
Front Genet. 2012 Sep 6;3:173. doi: 10.3389/fgene.2012.00173. eCollection 2012.
7
ENCODE: The human encyclopaedia.ENCODE:人类百科全书。
Nature. 2012 Sep 6;489(7414):46-8. doi: 10.1038/489046a.
8
Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies.最优统一方法用于罕见变异关联测试及其在小样本病例对照全外显子测序研究中的应用。
Am J Hum Genet. 2012 Aug 10;91(2):224-37. doi: 10.1016/j.ajhg.2012.06.007. Epub 2012 Aug 2.
9
Detecting rare variant associations by identity-by-descent mapping in case-control studies.利用病例对照研究中的亲缘关系映射检测罕见变异关联。
Genetics. 2012 Apr;190(4):1521-31. doi: 10.1534/genetics.111.136937. Epub 2012 Jan 20.
10
A combined functional annotation score for non-synonymous variants.一种针对非同义变异的综合功能注释评分
Hum Hered. 2012;73(1):47-51. doi: 10.1159/000334984. Epub 2012 Jan 18.