• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种快速且抗噪的方法,用于检测复杂疾病的深度测序数据中的罕见变异关联。

A fast and noise-resilient approach to detect rare-variant associations with deep sequencing data for complex disorders.

机构信息

Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, New York 10032, USA.

出版信息

Genet Epidemiol. 2012 Nov;36(7):675-85. doi: 10.1002/gepi.21662. Epub 2012 Aug 3.

DOI:10.1002/gepi.21662
PMID:22865616
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6240912/
Abstract

Next generation sequencing technology has enabled the paradigm shift in genetic association studies from the common disease/common variant to common disease/rare-variant hypothesis. Analyzing individual rare variants is known to be underpowered; therefore association methods have been developed that aggregate variants across a genetic region, which for exome sequencing is usually a gene. The foreseeable widespread use of whole genome sequencing poses new challenges in statistical analysis. It calls for new rare-variant association methods that are statistically powerful, robust against high levels of noise due to inclusion of noncausal variants, and yet computationally efficient. We propose a simple and powerful statistic that combines the disease-associated P-values of individual variants using a weight that is the inverse of the expected standard deviation of the allele frequencies under the null. This approach, dubbed as Sigma-P method, is extremely robust to the inclusion of a high proportion of noncausal variants and is also powerful when both detrimental and protective variants are present within a genetic region. The performance of the Sigma-P method was tested using simulated data based on realistic population demographic and disease models and its power was compared to several previously published methods. The results demonstrate that this method generally outperforms other rare-variant association methods over a wide range of models. Additionally, sequence data on the ANGPTL family of genes from the Dallas Heart Study were tested for associations with nine metabolic traits and both known and novel putative associations were uncovered using the Sigma-P method.

摘要

下一代测序技术使遗传关联研究从常见疾病/常见变异到常见疾病/罕见变异假说发生了范式转变。分析个体罕见变异的功效通常较低;因此,已经开发了一些关联方法,可以在遗传区域内聚集变异,对于外显子组测序来说,通常是一个基因。可预见的全基因组测序的广泛应用给统计分析带来了新的挑战。这需要新的罕见变异关联方法,这些方法在统计学上具有强大的功效,能够抵抗由于包含非因果变异而导致的高水平噪声,并且计算效率高。我们提出了一种简单而强大的统计方法,该方法使用一种权重来组合个体变异的疾病相关 P 值,权重为在零假设下等位基因频率的预期标准偏差的倒数。这种方法称为 Sigma-P 方法,即使包含大量非因果变异,也具有极强的稳健性,并且在遗传区域内存在有害和保护变异时也具有强大的功效。使用基于现实人口统计学和疾病模型的模拟数据测试了 Sigma-P 方法的性能,并将其功效与几种先前发表的方法进行了比较。结果表明,该方法在广泛的模型范围内通常优于其他罕见变异关联方法。此外,使用 Sigma-P 方法对来自达拉斯心脏研究的 ANGPTL 基因家族的序列数据进行了与九个代谢特征的关联测试,发现了已知和新的假定关联。

相似文献

1
A fast and noise-resilient approach to detect rare-variant associations with deep sequencing data for complex disorders.一种快速且抗噪的方法,用于检测复杂疾病的深度测序数据中的罕见变异关联。
Genet Epidemiol. 2012 Nov;36(7):675-85. doi: 10.1002/gepi.21662. Epub 2012 Aug 3.
2
A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions.一种用于分析下一代测序数据的新自适应方法,用于检测由于基因主效应和相互作用而导致的复杂性状关联的罕见变异体。
PLoS Genet. 2010 Oct 14;6(10):e1001156. doi: 10.1371/journal.pgen.1001156.
3
A power set-based statistical selection procedure to locate susceptible rare variants associated with complex traits with sequencing data.一种基于幂集的统计选择程序,用于利用测序数据定位与复杂性状相关的易感罕见变异。
Bioinformatics. 2014 Aug 15;30(16):2317-23. doi: 10.1093/bioinformatics/btu207. Epub 2014 Apr 22.
4
A weighted U-statistic for genetic association analyses of sequencing data.一种用于测序数据遗传关联分析的加权U统计量。
Genet Epidemiol. 2014 Dec;38(8):699-708. doi: 10.1002/gepi.21864. Epub 2014 Oct 20.
5
Detecting rare variant effects using extreme phenotype sampling in sequencing association studies.利用测序关联研究中的极端表型抽样检测罕见变异效应。
Genet Epidemiol. 2013 Feb;37(2):142-51. doi: 10.1002/gepi.21699. Epub 2012 Nov 26.
6
Association studies for next-generation sequencing.下一代测序的关联研究。
Genome Res. 2011 Jul;21(7):1099-108. doi: 10.1101/gr.115998.110. Epub 2011 Apr 26.
7
A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.一种基于基因组信息含量的新型统计方法,用于针对下一代测序数据的全基因组关联分析。
J Comput Biol. 2012 Jun;19(6):731-44. doi: 10.1089/cmb.2012.0035. Epub 2012 May 31.
8
A generalized genetic random field method for the genetic association analysis of sequencing data.一种用于测序数据遗传关联分析的广义遗传随机场方法。
Genet Epidemiol. 2014 Apr;38(3):242-53. doi: 10.1002/gepi.21790. Epub 2014 Jan 30.
9
Statistical selection strategy for risk and protective rare variants associated with complex traits.与复杂性状相关的风险和保护性罕见变异的统计选择策略。
J Comput Biol. 2015 Nov;22(11):1034-43. doi: 10.1089/cmb.2015.0091. Epub 2015 Oct 15.
10
Smoothed functional principal component analysis for testing association of the entire allelic spectrum of genetic variation.平滑功能主成分分析检验全等位基因谱遗传变异的关联。
Eur J Hum Genet. 2013 Feb;21(2):217-24. doi: 10.1038/ejhg.2012.141. Epub 2012 Jul 11.

引用本文的文献

1
Excalibur: A new ensemble method based on an optimal combination of aggregation tests for rare-variant association testing for sequencing data.Excalibur:一种新的基于聚合检验最优组合的测序数据罕见变异关联检验的集成方法。
PLoS Comput Biol. 2023 Sep 14;19(9):e1011488. doi: 10.1371/journal.pcbi.1011488. eCollection 2023 Sep.
2
Association detection between ordinal trait and rare variants based on adaptive combination of P values.基于 P 值自适应组合的有序性状与稀有变异关联检测。
J Hum Genet. 2018 Jan;63(1):37-45. doi: 10.1038/s10038-017-0354-2. Epub 2017 Nov 7.
3
DoEstRare: A statistical test to identify local enrichments in rare genomic variants associated with disease.DoEstRare:一种用于识别与疾病相关的罕见基因组变异中局部富集情况的统计检验。
PLoS One. 2017 Jul 24;12(7):e0179364. doi: 10.1371/journal.pone.0179364. eCollection 2017.
4
Multiple Group Testing Procedures for Analysis of High-Dimensional Genomic Data.用于高维基因组数据分析的多重组检验程序。
Genomics Inform. 2016 Dec;14(4):187-195. doi: 10.5808/GI.2016.14.4.187. Epub 2016 Dec 30.
5
Conditioning adaptive combination of P-values method to analyze case-parent trios with or without population controls.用于分析有或没有群体对照的病例-父母三联体的P值条件适应性组合方法
Sci Rep. 2016 Jun 24;6:28389. doi: 10.1038/srep28389.
6
Beyond Rare-Variant Association Testing: Pinpointing Rare Causal Variants in Case-Control Sequencing Study.超越罕见变异关联测试:在病例对照测序研究中精准定位罕见致病变异
Sci Rep. 2016 Feb 23;6:21824. doi: 10.1038/srep21824.
7
Detecting association of rare and common variants by adaptive combination of P-values.通过P值的自适应组合检测罕见和常见变异的关联。
Genet Res (Camb). 2015 Oct 6;97:e20. doi: 10.1017/S0016672315000208.
8
Group association test using a hidden Markov model.使用隐马尔可夫模型的群组关联测试。
Biostatistics. 2016 Apr;17(2):221-34. doi: 10.1093/biostatistics/kxv035. Epub 2015 Sep 28.
9
Adaptive combination of P-values for family-based association testing with sequence data.用于基于家系的序列数据关联测试的P值自适应组合
PLoS One. 2014 Dec 26;9(12):e115971. doi: 10.1371/journal.pone.0115971. eCollection 2014.
10
Reproducible simulations of realistic samples for next-generation sequencing studies using Variant Simulation Tools.使用变异模拟工具对下一代测序研究的真实样本进行可重复模拟。
Genet Epidemiol. 2015 Jan;39(1):45-52. doi: 10.1002/gepi.21867. Epub 2014 Nov 13.

本文引用的文献

1
Stratification-score matching improves correction for confounding by population stratification in case-control association studies.分层评分匹配可改善病例对照关联研究中因群体分层导致的混杂校正。
Genet Epidemiol. 2012 Apr;36(3):195-205. doi: 10.1002/gepi.21611.
2
Distribution of allele frequencies and effect sizes and their interrelationships for common genetic susceptibility variants.常见遗传易感性变异的等位基因频率和效应大小的分布及其相互关系。
Proc Natl Acad Sci U S A. 2011 Nov 1;108(44):18026-31. doi: 10.1073/pnas.1114759108. Epub 2011 Oct 14.
3
A general framework for detecting disease associations with rare variants in sequencing studies.一种用于在测序研究中检测罕见变异与疾病关联的通用框架。
Am J Hum Genet. 2011 Sep 9;89(3):354-67. doi: 10.1016/j.ajhg.2011.07.015. Epub 2011 Sep 1.
4
Rare-variant association testing for sequencing data with the sequence kernel association test.基于序列核关联检验的测序数据罕见变异关联分析
Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.
5
Bias due to selection of rare variants using frequency in controls.因使用对照中的频率来选择罕见变异而导致的偏差。
Nat Genet. 2011 May;43(5):392-3; author reply 394-5. doi: 10.1038/ng.816.
6
Defining rare variants by their frequencies in controls may increase type I error.根据罕见变异在对照中的频率来定义它们可能会增加I型错误。
Nat Genet. 2011 May;43(5):391-2; author reply 394-5. doi: 10.1038/ng.818.
7
Testing for an unusual distribution of rare variants.检测罕见变异的异常分布。
PLoS Genet. 2011 Mar;7(3):e1001322. doi: 10.1371/journal.pgen.1001322. Epub 2011 Mar 3.
8
A new testing strategy to identify rare variants with either risk or protective effect on disease.一种新的检测策略,用于识别对疾病具有风险或保护作用的罕见变异。
PLoS Genet. 2011 Feb 3;7(2):e1001289. doi: 10.1371/journal.pgen.1001289.
9
Simulating sequences of the human genome with rare variants.模拟含有罕见变异的人类基因组序列。
Hum Hered. 2010;70(4):287-91. doi: 10.1159/000323316. Epub 2011 Jan 6.
10
A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions.一种用于分析下一代测序数据的新自适应方法,用于检测由于基因主效应和相互作用而导致的复杂性状关联的罕见变异体。
PLoS Genet. 2010 Oct 14;6(10):e1001156. doi: 10.1371/journal.pgen.1001156.