• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过单核苷酸多态性集的高效精确方差分量测试提高基因定位能力和效率

Boosting Gene Mapping Power and Efficiency with Efficient Exact Variance Component Tests of Single Nucleotide Polymorphism Sets.

作者信息

Zhou Jin J, Hu Tao, Qiao Dandi, Cho Michael H, Zhou Hua

机构信息

Division of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona, Tucson, Arizona 85724

Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina 27695.

出版信息

Genetics. 2016 Nov;204(3):921-931. doi: 10.1534/genetics.116.190454. Epub 2016 Sep 19.

DOI:10.1534/genetics.116.190454
PMID:27646141
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5105869/
Abstract

Single nucleotide polymorphism (SNP) set tests have been a powerful method in analyzing next-generation sequencing (NGS) data. The popular sequence kernel association test (SKAT) method tests a set of variants as random effects in the linear mixed model setting. Its P-value is calculated based on asymptotic theory that requires a large sample size. Therefore, it is known that SKAT is conservative and can lose power at small or moderate sample sizes. Given the current cost of sequencing technology, scales of NGS are still limited. In this report, we derive and implement computationally efficient, exact (nonasymptotic) score (eScore), likelihood ratio (eLRT), and restricted likelihood ratio (eRLRT) tests, ExactVCTest, that can achieve high power even when sample sizes are small. We perform simulation studies under various genetic scenarios. Our ExactVCTest (i.e., eScore, eLRT, eRLRT) exhibits well-controlled type I error. Under the alternative model, eScore P-values are universally smaller than those from SKAT. eLRT and eRLRT demonstrate significantly higher power than eScore, SKAT, and SKAT optimal (SKAT-o) across all scenarios and various samples sizes. We applied these tests to an exome sequencing study. Our findings replicate previous results and shed light on rare variant effects within genes. The software package is implemented in the open source, high-performance technical computing language Julia, and is freely available at https://github.com/Tao-Hu/VarianceComponentTest.jl Analysis of each trait in the exome sequencing data set with 399 individuals and 16,619 genes takes around 1 min on a desktop computer.

摘要

单核苷酸多态性(SNP)集检验一直是分析下一代测序(NGS)数据的有力方法。流行的序列核关联检验(SKAT)方法在线性混合模型设置中把一组变异作为随机效应进行检验。其P值是基于需要大样本量的渐近理论计算得出的。因此,已知SKAT是保守的,在小样本或中等样本量时会失去检验效能。鉴于当前测序技术的成本,NGS的规模仍然有限。在本报告中,我们推导并实现了计算高效的精确(非渐近)得分(eScore)、似然比(eLRT)和受限似然比(eRLRT)检验,即ExactVCTest,即使在样本量较小时也能实现高检验效能。我们在各种遗传场景下进行了模拟研究。我们的ExactVCTest(即eScore、eLRT、eRLRT)表现出良好控制的I型错误。在备择模型下,eScore的P值普遍小于SKAT的P值。在所有场景和各种样本量下,eLRT和eRLRT的检验效能显著高于eScore、SKAT和SKAT最优检验(SKAT-o)。我们将这些检验应用于一项外显子组测序研究。我们的发现重复了先前的结果,并揭示了基因内罕见变异的效应。该软件包是用开源的高性能技术计算语言Julia实现的,可在https://github.com/Tao-Hu/VarianceComponentTest.jl上免费获取。在一台台式计算机上,对包含399个个体和16,619个基因的外显子组测序数据集中的每个性状进行分析大约需要1分钟。

相似文献

1
Boosting Gene Mapping Power and Efficiency with Efficient Exact Variance Component Tests of Single Nucleotide Polymorphism Sets.通过单核苷酸多态性集的高效精确方差分量测试提高基因定位能力和效率
Genetics. 2016 Nov;204(3):921-931. doi: 10.1534/genetics.116.190454. Epub 2016 Sep 19.
2
Likelihood ratio tests in rare variant detection for continuous phenotypes.连续型表型罕见变异检测中的似然比检验。
Ann Hum Genet. 2014 Sep;78(5):320-32. doi: 10.1111/ahg.12071.
3
Generalized functional linear models for gene-based case-control association studies.用于基于基因的病例对照关联研究的广义功能线性模型。
Genet Epidemiol. 2014 Nov;38(7):622-637. doi: 10.1002/gepi.21840. Epub 2014 Sep 9.
4
RL-SKAT: An Exact and Efficient Score Test for Heritability and Set Tests.RL-SKAT:一种用于遗传力和集合检验的精确且高效的评分检验。
Genetics. 2017 Dec;207(4):1275-1283. doi: 10.1534/genetics.117.300395. Epub 2017 Oct 12.
5
Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies.最优统一方法用于罕见变异关联测试及其在小样本病例对照全外显子测序研究中的应用。
Am J Hum Genet. 2012 Aug 10;91(2):224-37. doi: 10.1016/j.ajhg.2012.06.007. Epub 2012 Aug 2.
6
On Robust Association Testing for Quantitative Traits and Rare Variants.关于数量性状和罕见变异的稳健关联测试
G3 (Bethesda). 2016 Dec 7;6(12):3941-3950. doi: 10.1534/g3.116.035485.
7
AP-SKAT: highly-efficient genome-wide rare variant association test.AP-SKAT:高效的全基因组罕见变异关联测试。
BMC Genomics. 2016 Sep 21;17(1):745. doi: 10.1186/s12864-016-3094-3.
8
A Comparison Study of Fixed and Mixed Effect Models for Gene Level Association Studies of Complex Traits.复杂性状基因水平关联研究中固定效应模型与混合效应模型的比较研究
Genet Epidemiol. 2016 Dec;40(8):702-721. doi: 10.1002/gepi.21984. Epub 2016 Jul 4.
9
On Efficient and Accurate Calculation of Significance P-Values for Sequence Kernel Association Testing of Variant Set.关于变异集序列核关联测试中显著性P值的高效准确计算
Ann Hum Genet. 2016 Mar;80(2):123-35. doi: 10.1111/ahg.12144. Epub 2016 Jan 12.
10
Multi-SKAT: General framework to test for rare-variant association with multiple phenotypes.多基因集联合分析检验(Multi-SKAT):用于检测罕见变异与多种表型关联的通用框架。
Genet Epidemiol. 2019 Feb;43(1):4-23. doi: 10.1002/gepi.22156. Epub 2018 Oct 8.

引用本文的文献

1
Identifying interpretable gene-biomarker associations with functionally informed kernel-based tests in 190,000 exomes.在 190,000 个外显子组中,利用基于功能信息核的测试,识别可解释的基因 - 生物标志物关联。
Nat Commun. 2022 Sep 10;13(1):5332. doi: 10.1038/s41467-022-32864-2.
2
Discovery of rare variants implicated in schizophrenia using next-generation sequencing.利用下一代测序技术发现与精神分裂症相关的罕见变异
J Transl Genet Genom. 2019;3(1):1-20. doi: 10.20517/jtgg.2018.26. Epub 2019 Jan 20.
3
Heritability of interpack aggression in a wild pedigreed population of North American grey wolves.野生北美灰狼 pedigreed 种群中包装间攻击的遗传性。
Mol Ecol. 2020 May;29(10):1764-1775. doi: 10.1111/mec.15349. Epub 2020 Jan 22.
4
MM Algorithms For Variance Components Models.方差分量模型的MM算法
J Comput Graph Stat. 2019;28(2):350-361. doi: 10.1080/10618600.2018.1529601. Epub 2019 Mar 9.
5
OPENMENDEL: a cooperative programming project for statistical genetics.OPENMENDEL:一个用于统计遗传学的合作编程项目。
Hum Genet. 2020 Jan;139(1):61-71. doi: 10.1007/s00439-019-02001-z. Epub 2019 Mar 26.
6
Efficient Variant Set Mixed Model Association Tests for Continuous and Binary Traits in Large-Scale Whole-Genome Sequencing Studies.高效的变体集混合模型关联测试在全基因组测序研究中用于连续和二项性状。
Am J Hum Genet. 2019 Feb 7;104(2):260-274. doi: 10.1016/j.ajhg.2018.12.012. Epub 2019 Jan 10.
7
Exact variance component tests for longitudinal microbiome studies.纵向微生物组研究的精确方差成分检验。
Genet Epidemiol. 2019 Apr;43(3):250-262. doi: 10.1002/gepi.22185. Epub 2019 Jan 8.
8
Cis-SNPs Set Testing and PrediXcan Analysis for Gene Expression Data using Linear Mixed Models.顺式单核苷酸多态性检测和使用线性混合模型的基因表达数据的 PrediXcan 分析。
Sci Rep. 2017 Nov 10;7(1):15237. doi: 10.1038/s41598-017-15055-8.

本文引用的文献

1
Exome Sequencing Analysis in Severe, Early-Onset Chronic Obstructive Pulmonary Disease.严重早发型慢性阻塞性肺疾病的外显子组测序分析
Am J Respir Crit Care Med. 2016 Jun 15;193(12):1353-63. doi: 10.1164/rccm.201506-1223OC.
2
Permutation-based variance component test in generalized linear mixed model with application to multilocus genetic association study.广义线性混合模型中基于排列的方差分量检验及其在多位点基因关联研究中的应用
BMC Med Res Methodol. 2015 Apr 22;15:37. doi: 10.1186/s12874-015-0030-1.
3
Lessons and implications from association studies and post-GWAS analyses of cervical cancer.宫颈癌关联研究和 GWAS 分析的经验教训及启示
Trends Genet. 2015 Jan;31(1):41-54. doi: 10.1016/j.tig.2014.10.005. Epub 2014 Nov 18.
4
Likelihood ratio tests in rare variant detection for continuous phenotypes.连续型表型罕见变异检测中的似然比检验。
Ann Hum Genet. 2014 Sep;78(5):320-32. doi: 10.1111/ahg.12071.
5
Rare-variant association analysis: study designs and statistical tests.罕见变异关联分析:研究设计与统计检验。
Am J Hum Genet. 2014 Jul 3;95(1):5-23. doi: 10.1016/j.ajhg.2014.06.009.
6
Linear score tests for variance components in linear mixed models and applications to genetic association studies.线性混合模型中方差分量的线性得分检验及其在基因关联研究中的应用。
Biometrics. 2013 Dec;69(4):883-92. doi: 10.1111/biom.12095. Epub 2013 Nov 4.
7
Sequence kernel association test for quantitative traits in family samples.基于家系样本的数量性状序列核关联检验。
Genet Epidemiol. 2013 Feb;37(2):196-204. doi: 10.1002/gepi.21703. Epub 2012 Dec 26.
8
Testing multiple variance components in linear mixed-effects models.在线性混合效应模型中检验多个方差分量。
Biostatistics. 2013 Jan;14(1):144-59. doi: 10.1093/biostatistics/kxs028. Epub 2012 Aug 28.
9
Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies.最优统一方法用于罕见变异关联测试及其在小样本病例对照全外显子测序研究中的应用。
Am J Hum Genet. 2012 Aug 10;91(2):224-37. doi: 10.1016/j.ajhg.2012.06.007. Epub 2012 Aug 2.
10
Large-scale genome-wide association analysis of bipolar disorder identifies a new susceptibility locus near ODZ4.大规模全基因组关联分析发现双相情感障碍的新易感基因位点 ODZ4 附近。
Nat Genet. 2011 Sep 18;43(10):977-83. doi: 10.1038/ng.943.