将非编码注释纳入罕见变异分析。

Incorporating Non-Coding Annotations into Rare Variant Analysis.

作者信息

Richardson Tom G, Campbell Colin, Timpson Nicholas J, Gaunt Tom R

机构信息

MRC Integrative Epidemiology Unit, School of Social and Community Medicine, University of Bristol, Bristol, United Kingdom.

Intelligent Systems Laboratory, University of Bristol, Bristol, United Kingdom.

出版信息

PLoS One. 2016 Apr 29;11(4):e0154181. doi: 10.1371/journal.pone.0154181. eCollection 2016.

DOI:10.1371/journal.pone.0154181

PMID:27128317

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4851421/

Abstract

BACKGROUND

The success of collapsing methods which investigate the combined effect of rare variants on complex traits has so far been limited. The manner in which variants within a gene are selected prior to analysis has a crucial impact on this success, which has resulted in analyses conventionally filtering variants according to their consequence. This study investigates whether an alternative approach to filtering, using annotations from recently developed bioinformatics tools, can aid these types of analyses in comparison to conventional approaches.

METHODS & RESULTS: We conducted a candidate gene analysis using the UK10K sequence and lipids data, filtering according to functional annotations using the resource CADD (Combined Annotation-Dependent Depletion) and contrasting results with 'nonsynonymous' and 'loss of function' consequence analyses. Using CADD allowed the inclusion of potentially deleterious intronic variants, which was not possible when filtering by consequence. Overall, different filtering approaches provided similar evidence of association, although filtering according to CADD identified evidence of association between ANGPTL4 and High Density Lipoproteins (P = 0.02, N = 3,210) which was not observed in the other analyses. We also undertook genome-wide analyses to determine how filtering in this manner compared to conventional approaches for gene regions. Results suggested that filtering by annotations according to CADD, as well as other tools known as FATHMM-MKL and DANN, identified association signals not detected when filtering by variant consequence and vice versa.

CONCLUSION

Incorporating variant annotations from non-coding bioinformatics tools should prove to be a valuable asset for rare variant analyses in the future. Filtering by variant consequence is only possible in coding regions of the genome, whereas utilising non-coding bioinformatics annotations provides an opportunity to discover unknown causal variants in non-coding regions as well. This should allow studies to uncover a greater number of causal variants for complex traits and help elucidate their functional role in disease.

摘要

背景

迄今为止，研究罕见变异对复杂性状综合影响的合并方法的成功率有限。在分析之前选择基因内变异的方式对这种成功率有至关重要的影响，这导致分析通常根据变异的结果进行筛选。本研究调查了一种使用最近开发的生物信息学工具注释进行筛选的替代方法，与传统方法相比，是否有助于这类分析。

方法与结果

我们使用UK10K序列和脂质数据进行了候选基因分析，使用资源CADD（综合注释依赖损耗）根据功能注释进行筛选，并将结果与“非同义”和“功能丧失”结果分析进行对比。使用CADD允许纳入潜在有害的内含子变异，而按结果筛选时则不可能。总体而言，不同的筛选方法提供了相似的关联证据，尽管根据CADD筛选发现了血管生成素样蛋白4（ANGPTL4）与高密度脂蛋白之间的关联证据（P = 0.02，N = 3210），这在其他分析中未观察到。我们还进行了全基因组分析，以确定这种筛选方式与基因区域的传统方法相比如何。结果表明，根据CADD以及其他称为FATHMM-MKL和DANN的工具进行注释筛选，识别出了按变异结果筛选时未检测到的关联信号，反之亦然。

结论

纳入来自非编码生物信息学工具的变异注释在未来应被证明是罕见变异分析的一项宝贵资产。按变异结果进行筛选仅在基因组的编码区域可行，而利用非编码生物信息学注释也提供了在非编码区域发现未知因果变异的机会。这应使研究能够发现更多复杂性状的因果变异，并有助于阐明它们在疾病中的功能作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88dc/4851421/f05fbd6146f4/pone.0154181.g001.jpg

相似文献

Incorporating Non-Coding Annotations into Rare Variant Analysis.将非编码注释纳入罕见变异分析。

PLoS One. 2016 Apr 29;11(4):e0154181. doi: 10.1371/journal.pone.0154181. eCollection 2016.

A pathway-centric approach to rare variant association analysis.一种以通路为中心的罕见变异关联分析方法。

Eur J Hum Genet. 2016 Jan;25(1):123-129. doi: 10.1038/ejhg.2016.113. Epub 2016 Aug 31.

Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data.基于统一序列的关联测试，支持多种功能注释以及代谢芯片数据中非编码变异的荟萃分析。

Am J Hum Genet. 2017 Sep 7;101(3):340-352. doi: 10.1016/j.ajhg.2017.07.011. Epub 2017 Aug 24.

In Silico Functional Annotation of Genomic Variation.基因组变异的计算机功能注释

Curr Protoc Hum Genet. 2016 Jan 1;88:6.15.1-6.15.17. doi: 10.1002/0471142905.hg0615s88.

dbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs.dbNSFP v3.0：一个用于人类非同义突变和剪接位点单核苷酸变异的功能预测与注释一站式数据库。

Hum Mutat. 2016 Mar;37(3):235-41. doi: 10.1002/humu.22932. Epub 2016 Jan 5.

FunSPU: A versatile and adaptive multiple functional annotation-based association test of whole-genome sequencing data.FunSPU：一种基于多功能注释的全基因组测序数据关联测试的通用和自适应方法。

PLoS Genet. 2019 Apr 29;15(4):e1008081. doi: 10.1371/journal.pgen.1008081. eCollection 2019 Apr.

An integrative approach to predicting the functional effects of non-coding and coding sequence variation.一种预测非编码和编码序列变异功能效应的综合方法。

Bioinformatics. 2015 May 15;31(10):1536-43. doi: 10.1093/bioinformatics/btv009. Epub 2015 Jan 11.

Functional architecture of low-frequency variants highlights strength of negative selection across coding and non-coding annotations.低频变异的功能结构凸显了负选择在编码和非编码注释上的强大作用。

Nat Genet. 2018 Nov;50(11):1600-1607. doi: 10.1038/s41588-018-0231-8. Epub 2018 Oct 8.

Deep sequencing of Danish Holstein dairy cattle for variant detection and insight into potential loss-of-function variants in protein coding genes.对丹麦荷斯坦奶牛进行深度测序，以检测变异并深入了解蛋白质编码基因中潜在的功能丧失变异。

BMC Genomics. 2015 Dec 9;16:1043. doi: 10.1186/s12864-015-2249-y.

A community-based resource for automatic exome variant-calling and annotation in Mendelian disorders.一个基于社区的用于孟德尔疾病中自动外显子组变异检测和注释的资源。

BMC Genomics. 2014;15 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2164-15-S3-S5. Epub 2014 May 6.

引用本文的文献

Clinical significance of genetic variation in hypertrophic cardiomyopathy: comparison of computational tools to prioritize missense variants.肥厚型心肌病基因变异的临床意义：用于对错义变异进行优先级排序的计算工具比较

Front Cardiovasc Med. 2022 Aug 18;9:975478. doi: 10.3389/fcvm.2022.975478. eCollection 2022.

A phenome-wide association study of 26 mendelian genes reveals phenotypic expressivity of common and rare variants within the general population.一项针对 26 个孟德尔基因的全基因组关联研究揭示了常见和罕见变异在普通人群中的表型表达。

PLoS Genet. 2020 Nov 23;16(11):e1008802. doi: 10.1371/journal.pgen.1008802. eCollection 2020 Nov.

Targeted sequencing to identify novel genetic risk factors for deep vein thrombosis: a study of 734 genes.靶向测序鉴定深静脉血栓形成的新遗传风险因素：一项研究 734 个基因。

J Thromb Haemost. 2018 Dec;16(12):2432-2441. doi: 10.1111/jth.14279. Epub 2018 Oct 16.

Rare variants in drug target genes contributing to complex diseases, phenome-wide.药物靶点基因中的罕见变异与复杂疾病表型全基因组关联研究

Sci Rep. 2018 Mar 15;8(1):4624. doi: 10.1038/s41598-018-22834-4.

Power Analysis for Genetic Association Test (PAGEANT) provides insights to challenges for rare variant association studies.遗传关联测试的功效分析（PAGEANT）为罕见变异关联研究的挑战提供了深入的见解。

Bioinformatics. 2018 May 1;34(9):1506-1513. doi: 10.1093/bioinformatics/btx770.

本文引用的文献

Hum Mutat. 2016 Mar;37(3):235-41. doi: 10.1002/humu.22932. Epub 2016 Jan 5.

The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.基于基因的罕见变异方法在检测疾病相关变异以及检验关于复杂疾病的假设方面的能力。

PLoS Genet. 2015 Apr 23;11(4):e1005165. doi: 10.1371/journal.pgen.1005165. eCollection 2015 Apr.

An integrative approach to predicting the functional effects of non-coding and coding sequence variation.一种预测非编码和编码序列变异功能效应的综合方法。

Bioinformatics. 2015 May 15;31(10):1536-43. doi: 10.1093/bioinformatics/btv009. Epub 2015 Jan 11.

DANN: a deep learning approach for annotating the pathogenicity of genetic variants.DANN：一种用于注释基因变异致病性的深度学习方法。

Bioinformatics. 2015 Mar 1;31(5):761-3. doi: 10.1093/bioinformatics/btu703. Epub 2014 Oct 22.

A rare variant in APOC3 is associated with plasma triglyceride and VLDL levels in Europeans.APOC3基因中的一种罕见变异与欧洲人的血浆甘油三酯和极低密度脂蛋白水平相关。

Nat Commun. 2014 Sep 16;5:4871. doi: 10.1038/ncomms5871.

Rare-variant association analysis: study designs and statistical tests.罕见变异关联分析：研究设计与统计检验。

Am J Hum Genet. 2014 Jul 3;95(1):5-23. doi: 10.1016/j.ajhg.2014.06.009.

Defining functional DNA elements in the human genome.定义人类基因组中的功能 DNA 元件。

Proc Natl Acad Sci U S A. 2014 Apr 29;111(17):6131-8. doi: 10.1073/pnas.1318948111. Epub 2014 Apr 21.

Obesity-associated variants within FTO form long-range functional connections with IRX3.FTO基因内与肥胖相关的变异与IRX3形成远距离功能连接。

Nature. 2014 Mar 20;507(7492):371-5. doi: 10.1038/nature13138. Epub 2014 Mar 12.

Association of low-frequency and rare coding-sequence variants with blood lipids and coronary heart disease in 56,000 whites and blacks.低频和罕见编码序列变异与 56000 名白人和黑人的血脂和冠心病的关联。

Am J Hum Genet. 2014 Feb 6;94(2):223-32. doi: 10.1016/j.ajhg.2014.01.009.

A general framework for estimating the relative pathogenicity of human genetic variants.一种用于估计人类遗传变异相对致病性的通用框架。

Nat Genet. 2014 Mar;46(3):310-5. doi: 10.1038/ng.2892. Epub 2014 Feb 2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

将非编码注释纳入罕见变异分析。

Incorporating Non-Coding Annotations into Rare Variant Analysis.

作者信息

机构信息

出版信息

BACKGROUND

CONCLUSION

背景

方法与结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献