• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大规模测序研究中单个基因座水平的罕见变异关联分析

Rare Variants Association Analysis in Large-Scale Sequencing Studies at the Single Locus Level.

作者信息

Jeng Xinge Jessie, Daye Zhongyin John, Lu Wenbin, Tzeng Jung-Ying

机构信息

Department of Statistics, North Carolina State University, Raleigh, North Carolina, United States of America.

Epidemiology and Biostatistics, University of Arizona, Tucson, Arizona, United States of America.

出版信息

PLoS Comput Biol. 2016 Jun 29;12(6):e1004993. doi: 10.1371/journal.pcbi.1004993. eCollection 2016 Jun.

DOI:10.1371/journal.pcbi.1004993
PMID:27355347
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4927097/
Abstract

Genetic association analyses of rare variants in next-generation sequencing (NGS) studies are fundamentally challenging due to the presence of a very large number of candidate variants at extremely low minor allele frequencies. Recent developments often focus on pooling multiple variants to provide association analysis at the gene instead of the locus level. Nonetheless, pinpointing individual variants is a critical goal for genomic researches as such information can facilitate the precise delineation of molecular mechanisms and functions of genetic factors on diseases. Due to the extreme rarity of mutations and high-dimensionality, significances of causal variants cannot easily stand out from those of noncausal ones. Consequently, standard false-positive control procedures, such as the Bonferroni and false discovery rate (FDR), are often impractical to apply, as a majority of the causal variants can only be identified along with a few but unknown number of noncausal variants. To provide informative analysis of individual variants in large-scale sequencing studies, we propose the Adaptive False-Negative Control (AFNC) procedure that can include a large proportion of causal variants with high confidence by introducing a novel statistical inquiry to determine those variants that can be confidently dispatched as noncausal. The AFNC provides a general framework that can accommodate for a variety of models and significance tests. The procedure is computationally efficient and can adapt to the underlying proportion of causal variants and quality of significance rankings. Extensive simulation studies across a plethora of scenarios demonstrate that the AFNC is advantageous for identifying individual rare variants, whereas the Bonferroni and FDR are exceedingly over-conservative for rare variants association studies. In the analyses of the CoLaus dataset, AFNC has identified individual variants most responsible for gene-level significances. Moreover, single-variant results using the AFNC have been successfully applied to infer related genes with annotation information.

摘要

在下一代测序(NGS)研究中,由于存在大量频率极低的候选变异,对罕见变异进行基因关联分析极具挑战性。近期的研究进展通常聚焦于合并多个变异,以便在基因而非位点水平上进行关联分析。尽管如此,精确识别单个变异仍是基因组研究的关键目标,因为此类信息有助于精准描述遗传因素在疾病中的分子机制和功能。由于突变极其罕见且维度高,因果变异的显著性不易从非因果变异中凸显出来。因此,标准的假阳性控制程序,如邦费罗尼校正和错误发现率(FDR),往往不切实际,因为大多数因果变异只能与少数但数量未知的非因果变异一同被识别。为了在大规模测序研究中对单个变异进行有意义的分析,我们提出了自适应假阴性控制(AFNC)程序,该程序通过引入一种新颖的统计探究来确定那些可被确定为非因果的变异,从而能够以高置信度纳入大部分因果变异。AFNC提供了一个通用框架,可适用于各种模型和显著性检验。该程序计算效率高,能够适应因果变异的潜在比例和显著性排名的质量。在众多场景下进行的广泛模拟研究表明,AFNC在识别单个罕见变异方面具有优势,而邦费罗尼校正和FDR在罕见变异关联研究中则过于保守。在对CoLaus数据集的分析中,AFNC识别出了对基因水平显著性最具责任的单个变异。此外,使用AFNC的单变异结果已成功应用于利用注释信息推断相关基因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c4/4927097/d6c4351a5b3f/pcbi.1004993.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c4/4927097/80e08dbcb4a1/pcbi.1004993.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c4/4927097/73b8913b2085/pcbi.1004993.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c4/4927097/d6c4351a5b3f/pcbi.1004993.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c4/4927097/80e08dbcb4a1/pcbi.1004993.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c4/4927097/73b8913b2085/pcbi.1004993.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c4/4927097/d6c4351a5b3f/pcbi.1004993.g003.jpg

相似文献

1
Rare Variants Association Analysis in Large-Scale Sequencing Studies at the Single Locus Level.大规模测序研究中单个基因座水平的罕见变异关联分析
PLoS Comput Biol. 2016 Jun 29;12(6):e1004993. doi: 10.1371/journal.pcbi.1004993. eCollection 2016 Jun.
2
DoEstRare: A statistical test to identify local enrichments in rare genomic variants associated with disease.DoEstRare:一种用于识别与疾病相关的罕见基因组变异中局部富集情况的统计检验。
PLoS One. 2017 Jul 24;12(7):e0179364. doi: 10.1371/journal.pone.0179364. eCollection 2017.
3
Reconsidering association testing methods using single-variant test statistics as alternatives to pooling tests for sequence data with rare variants.重新考虑使用单变量检验统计量作为合并检验的替代方法,用于具有罕见变异的序列数据的关联检验方法。
PLoS One. 2012;7(2):e30238. doi: 10.1371/journal.pone.0030238. Epub 2012 Feb 17.
4
A new statistical framework for genetic pleiotropic analysis of high dimensional phenotype data.一种用于高维表型数据遗传多效性分析的新统计框架。
BMC Genomics. 2016 Nov 7;17(1):881. doi: 10.1186/s12864-016-3169-1.
5
A fast and noise-resilient approach to detect rare-variant associations with deep sequencing data for complex disorders.一种快速且抗噪的方法,用于检测复杂疾病的深度测序数据中的罕见变异关联。
Genet Epidemiol. 2012 Nov;36(7):675-85. doi: 10.1002/gepi.21662. Epub 2012 Aug 3.
6
Next-generation DNA sequencing to identify novel genetic risk factors for cerebral vein thrombosis.下一代 DNA 测序技术鉴定脑静脉血栓形成的新遗传风险因素。
Thromb Res. 2018 Sep;169:76-81. doi: 10.1016/j.thromres.2018.06.011. Epub 2018 Jun 15.
7
Beyond Rare-Variant Association Testing: Pinpointing Rare Causal Variants in Case-Control Sequencing Study.超越罕见变异关联测试:在病例对照测序研究中精准定位罕见致病变异
Sci Rep. 2016 Feb 23;6:21824. doi: 10.1038/srep21824.
8
Statistical selection strategy for risk and protective rare variants associated with complex traits.与复杂性状相关的风险和保护性罕见变异的统计选择策略。
J Comput Biol. 2015 Nov;22(11):1034-43. doi: 10.1089/cmb.2015.0091. Epub 2015 Oct 15.
9
Significance of functional disease-causal/susceptible variants identified by whole-genome analyses for the understanding of human diseases.全基因组分析鉴定的功能疾病因果/易感变异在理解人类疾病中的意义。
Proc Jpn Acad Ser B Phys Biol Sci. 2017;93(9):657-676. doi: 10.2183/pjab.93.042.
10
Detecting association of rare and common variants by adaptive combination of P-values.通过P值的自适应组合检测罕见和常见变异的关联。
Genet Res (Camb). 2015 Oct 6;97:e20. doi: 10.1017/S0016672315000208.

引用本文的文献

1
GWAS for identification of genomic regions and candidate genes in vegetable crops.GWAS 用于鉴定蔬菜作物中的基因组区域和候选基因。
Funct Integr Genomics. 2024 Oct 29;24(6):203. doi: 10.1007/s10142-024-01477-x.
2
Transfer learning with false negative control improves polygenic risk prediction.基于假阴性对照的迁移学习可提高多基因风险预测能力。
PLoS Genet. 2023 Nov 27;19(11):e1010597. doi: 10.1371/journal.pgen.1010597. eCollection 2023 Nov.
3
Effective SNP ranking improves the performance of eQTL mapping.有效的 SNP 排名可提高 eQTL 映射的性能。

本文引用的文献

1
SCN9A Variants May be Implicated in Neuropathic Pain Associated With Diabetic Peripheral Neuropathy and Pain Severity.SCN9A基因变异可能与糖尿病性周围神经病变相关的神经性疼痛及疼痛严重程度有关。
Clin J Pain. 2015 Nov;31(11):976-82. doi: 10.1097/AJP.0000000000000205.
2
Identification of rare causal variants in sequence-based studies: methods and applications to VPS13B, a gene involved in Cohen syndrome and autism.基于序列研究中罕见因果变异的鉴定:方法及其在VPS13B基因中的应用,VPS13B基因与科恩综合征和自闭症相关
PLoS Genet. 2014 Dec 11;10(12):e1004729. doi: 10.1371/journal.pgen.1004729. eCollection 2014 Dec.
3
Prenatal exposure to cigarette smoke interacts with OPRM1 to modulate dietary preference for fat.
Genet Epidemiol. 2020 Sep;44(6):611-619. doi: 10.1002/gepi.22293. Epub 2020 Mar 26.
4
Identifying individual risk rare variants using protein structure guided local tests (POINT).利用蛋白结构引导的局部检验(POINT)鉴定个体风险罕见变异。
PLoS Comput Biol. 2019 Feb 19;15(2):e1006722. doi: 10.1371/journal.pcbi.1006722. eCollection 2019 Feb.
5
Whole-Exome Sequencing to Identify Novel Biological Pathways Associated With Infertility After Pelvic Inflammatory Disease.全外显子测序以鉴定与盆腔炎后不孕相关的新生物途径。
Sex Transm Dis. 2017 Jan;44(1):35-41. doi: 10.1097/OLQ.0000000000000533.
孕期接触香烟烟雾会与阿片受体μ1(OPRM1)相互作用,从而调节对脂肪的饮食偏好。
J Psychiatry Neurosci. 2015 Jan;40(1):38-45. doi: 10.1503/jpn.130263.
4
The endocannabinoid-CB2 receptor axis protects the ischemic heart at the early stage of cardiomyopathy.内源性大麻素-CB2 受体轴在心肌病早期阶段保护缺血性心脏。
Basic Res Cardiol. 2014 Jul;109(4):425. doi: 10.1007/s00395-014-0425-x. Epub 2014 Jul 1.
5
Exposing synonymous mutations.揭示同义突变。
Trends Genet. 2014 Jul;30(7):308-21. doi: 10.1016/j.tig.2014.04.006. Epub 2014 Jun 19.
6
TNNI3K mutation in familial syndrome of conduction system disease, atrial tachyarrhythmia and dilated cardiomyopathy.家族性传导系统疾病、房性快速心律失常和扩张型心肌病综合征中的TNNI3K突变
Hum Mol Genet. 2014 Nov 1;23(21):5793-804. doi: 10.1093/hmg/ddu297. Epub 2014 Jun 11.
7
Adaptive synonymous mutations in an experimentally evolved Pseudomonas fluorescens population.在一个经实验进化的荧光假单胞菌群体中适应性同义突变。
Nat Commun. 2014 Jun 10;5:4076. doi: 10.1038/ncomms5076.
8
The role of TNF-alpha/NF-kappa B pathway on the up-regulation of voltage-gated sodium channel Nav1.7 in DRG neurons of rats with diabetic neuropathy.肿瘤坏死因子-α/核因子-κB信号通路在糖尿病性神经病变大鼠背根神经节神经元中对电压门控钠通道Nav1.7上调的作用
Neurochem Int. 2014 Sep;75:112-9. doi: 10.1016/j.neuint.2014.05.012. Epub 2014 Jun 2.
9
A powerful and adaptive association test for rare variants.一种针对罕见变异的强大且自适应的关联测试。
Genetics. 2014 Aug;197(4):1081-95. doi: 10.1534/genetics.114.165035. Epub 2014 May 15.
10
Estimating False Discovery Proportion Under Arbitrary Covariance Dependence.任意协方差依赖下错误发现比例的估计
J Am Stat Assoc. 2012;107(499):1019-1035. doi: 10.1080/01621459.2012.720478.