• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FAVR(稀有变体的过滤和注释):一种从大规模平行测序数据集中分析罕见种系遗传变异的方法。

FAVR (Filtering and Annotation of Variants that are Rare): methods to facilitate the analysis of rare germline genetic variants from massively parallel sequencing datasets.

机构信息

Victorian Life Sciences Computation Initiative, The University of Melbourne, 187 Grattan Street Carlton, Melbourne, Victoria 3010, Australia.

出版信息

BMC Bioinformatics. 2013 Feb 25;14:65. doi: 10.1186/1471-2105-14-65.

DOI:10.1186/1471-2105-14-65
PMID:23441864
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3599469/
Abstract

BACKGROUND

Characterising genetic diversity through the analysis of massively parallel sequencing (MPS) data offers enormous potential to significantly improve our understanding of the genetic basis for observed phenotypes, including predisposition to and progression of complex human disease. Great challenges remain in resolving genetic variants that are genuine from the millions of artefactual signals.

RESULTS

FAVR is a suite of new methods designed to work with commonly used MPS analysis pipelines to assist in the resolution of some of the issues related to the analysis of the vast amount of resulting data, with a focus on relatively rare genetic variants. To the best of our knowledge, no equivalent method has previously been described. The most important and novel aspect of FAVR is the use of signatures in comparator sequence alignment files during variant filtering, and annotation of variants potentially shared between individuals. The FAVR methods use these signatures to facilitate filtering of (i) platform and/or mapping-specific artefacts, (ii) common genetic variants, and, where relevant, (iii) artefacts derived from imbalanced paired-end sequencing, as well as annotation of genetic variants based on evidence of co-occurrence in individuals. We applied conventional variant calling applied to whole-exome sequencing datasets, produced using both SOLiD and TruSeq chemistries, with or without downstream processing by FAVR methods. We demonstrate a 3-fold smaller rare single nucleotide variant shortlist with no detected reduction in sensitivity. This analysis included Sanger sequencing of rare variant signals not evident in dbSNP131, assessment of known variant signal preservation, and comparison of observed and expected rare variant numbers across a range of first cousin pairs. The principles described herein were applied in our recent publication identifying XRCC2 as a new breast cancer risk gene and have been made publically available as a suite of software tools.

CONCLUSIONS

FAVR is a platform-agnostic suite of methods that significantly enhances the analysis of large volumes of sequencing data for the study of rare genetic variants and their influence on phenotypes.

摘要

背景

通过分析大规模平行测序(MPS)数据来描述遗传多样性,为深入了解观察到的表型的遗传基础,包括对复杂人类疾病的易感性和进展,提供了巨大的潜力。从数百万个人为信号中分辨出真正的遗传变异仍然存在巨大的挑战。

结果

FAVR 是一套新的方法,旨在与常用的 MPS 分析管道一起使用,以帮助解决与分析大量相关数据相关的一些问题,重点是相对罕见的遗传变异。据我们所知,以前没有描述过等效的方法。FAVR 最重要和新颖的方面是在变体过滤和潜在个体间共享变体注释过程中使用比较序列比对文件中的特征。FAVR 方法使用这些特征来促进(i)平台和/或映射特定人为因素的过滤、(ii)常见遗传变异,以及在相关情况下,(iii)来自不平衡的配对末端测序的人为因素的过滤,以及基于个体中共同出现证据的遗传变异注释。我们应用了传统的变体调用方法,对使用 SOLiD 和 TruSeq 化学方法生成的全外显子测序数据集进行了分析,这些数据集要么经过 FAVR 方法的下游处理,要么没有经过处理。我们展示了一个罕见的单核苷酸变异短名单,其数量减少了三分之一,而灵敏度没有降低。该分析包括在 dbSNP131 中未明显显示的稀有变异信号的 Sanger 测序、已知变异信号保存的评估,以及在一系列第一代表亲之间观察到的和预期的稀有变异数量的比较。本文所述的原理应用于我们最近发表的一篇论文中,该论文确定 XRCC2 为一种新的乳腺癌风险基因,并已作为一套软件工具公开发布。

结论

FAVR 是一种与平台无关的方法套件,可显著增强对大量测序数据的分析,以研究稀有遗传变异及其对表型的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fb9/3599469/5855f3319fdd/1471-2105-14-65-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fb9/3599469/40c1e3af9537/1471-2105-14-65-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fb9/3599469/d814836c41e9/1471-2105-14-65-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fb9/3599469/9509d6121281/1471-2105-14-65-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fb9/3599469/5855f3319fdd/1471-2105-14-65-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fb9/3599469/40c1e3af9537/1471-2105-14-65-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fb9/3599469/d814836c41e9/1471-2105-14-65-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fb9/3599469/9509d6121281/1471-2105-14-65-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fb9/3599469/5855f3319fdd/1471-2105-14-65-4.jpg

相似文献

1
FAVR (Filtering and Annotation of Variants that are Rare): methods to facilitate the analysis of rare germline genetic variants from massively parallel sequencing datasets.FAVR(稀有变体的过滤和注释):一种从大规模平行测序数据集中分析罕见种系遗传变异的方法。
BMC Bioinformatics. 2013 Feb 25;14:65. doi: 10.1186/1471-2105-14-65.
2
UNDR ROVER - a fast and accurate variant caller for targeted DNA sequencing.UNDR ROVER——一种用于靶向DNA测序的快速且准确的变异检测工具。
BMC Bioinformatics. 2016 Apr 16;17:165. doi: 10.1186/s12859-016-1014-9.
3
ROVER variant caller: read-pair overlap considerate variant-calling software applied to PCR-based massively parallel sequencing datasets.ROVER变异体检测程序:一种考虑读段重叠的变异体检测软件,应用于基于PCR的大规模平行测序数据集。
Source Code Biol Med. 2014 Jan 24;9(1):3. doi: 10.1186/1751-0473-9-3.
4
SEQSpark: A Complete Analysis Tool for Large-Scale Rare Variant Association Studies Using Whole-Genome and Exome Sequence Data.SEQSpark:一种使用全基因组和外显子组序列数据进行大规模罕见变异关联研究的完整分析工具。
Am J Hum Genet. 2017 Jul 6;101(1):115-122. doi: 10.1016/j.ajhg.2017.05.017. Epub 2017 Jun 29.
5
AMLVaran: a software approach to implement variant analysis of targeted NGS sequencing data in an oncological care setting.AMLVaran:一种在肿瘤护理环境中对靶向NGS测序数据进行变异分析的软件方法。
BMC Med Genomics. 2020 Feb 4;13(1):17. doi: 10.1186/s12920-020-0668-3.
6
CSN and CAVA: variant annotation tools for rapid, robust next-generation sequencing analysis in the clinical setting.CSN和CAVA:用于临床环境中快速、可靠的下一代测序分析的变异注释工具。
Genome Med. 2015 Jul 28;7(1):76. doi: 10.1186/s13073-015-0195-6.
7
A survey of tools for variant analysis of next-generation genome sequencing data.下一代基因组测序数据变异分析工具综述。
Brief Bioinform. 2014 Mar;15(2):256-78. doi: 10.1093/bib/bbs086. Epub 2013 Jan 21.
8
Genome analysis and knowledge-driven variant interpretation with TGex.基因组分析和基于 TGex 的知识驱动的变异解释。
BMC Med Genomics. 2019 Dec 30;12(1):200. doi: 10.1186/s12920-019-0647-8.
9
A community-based resource for automatic exome variant-calling and annotation in Mendelian disorders.一个基于社区的用于孟德尔疾病中自动外显子组变异检测和注释的资源。
BMC Genomics. 2014;15 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2164-15-S3-S5. Epub 2014 May 6.
10
Variant Call Format-Diagnostic Annotation and Reporting Tool: A Customizable Analysis Pipeline for Identification of Clinically Relevant Genetic Variants in Next-Generation Sequencing Data.变异调用格式-诊断注释和报告工具:一种用于鉴定下一代测序数据中临床相关遗传变异的可定制分析管道。
J Mol Diagn. 2019 Nov;21(6):951-960. doi: 10.1016/j.jmoldx.2019.07.001. Epub 2019 Aug 20.

引用本文的文献

1
Genetic diversity of Plasmodium falciparum reticulocyte binding protein homologue-5, which is a potential malaria vaccine candidate: baseline data from areas of varying malaria endemicity in Mainland Tanzania.恶性疟原虫网织红细胞结合蛋白同源物5的遗传多样性,该蛋白是一种潜在的疟疾疫苗候选物:坦桑尼亚大陆不同疟疾流行程度地区的基线数据
Malar J. 2025 Jan 27;24(1):29. doi: 10.1186/s12936-025-05269-x.
2
Mutation screening of ACKR3 and COPS8 in kidney cancer cases from the CONFIRM study.来自CONFIRM研究的肾癌病例中ACKR3和COPS8的突变筛查
Fam Cancer. 2017 Jul;16(3):411-416. doi: 10.1007/s10689-016-9961-x.
3
Pedigree based DNA sequencing pipeline for germline genomes of cancer families.

本文引用的文献

1
Rare mutations in XRCC2 increase the risk of breast cancer.XRCC2 中的罕见突变会增加乳腺癌的风险。
Am J Hum Genet. 2012 Apr 6;90(4):734-9. doi: 10.1016/j.ajhg.2012.02.027. Epub 2012 Mar 29.
2
The variant call format and VCFtools.变异调用格式和 VCFtools。
Bioinformatics. 2011 Aug 1;27(15):2156-8. doi: 10.1093/bioinformatics/btr330. Epub 2011 Jun 7.
3
Genome partitioning of genetic variation for complex traits using common SNPs.利用常见 SNP 对复杂性状的遗传变异进行基因组分区。
用于癌症家族种系基因组的基于家系的DNA测序流程
Hered Cancer Clin Pract. 2016 Aug 9;14:16. doi: 10.1186/s13053-016-0058-1. eCollection 2016.
4
Rare mutations in RINT1 predispose carriers to breast and Lynch syndrome-spectrum cancers.RINT1基因的罕见突变使携带者易患乳腺癌和林奇综合征谱系癌症。
Cancer Discov. 2014 Jul;4(7):804-15. doi: 10.1158/2159-8290.CD-14-0212. Epub 2014 May 2.
5
ROVER variant caller: read-pair overlap considerate variant-calling software applied to PCR-based massively parallel sequencing datasets.ROVER变异体检测程序:一种考虑读段重叠的变异体检测软件,应用于基于PCR的大规模平行测序数据集。
Source Code Biol Med. 2014 Jan 24;9(1):3. doi: 10.1186/1751-0473-9-3.
Nat Genet. 2011 Jun;43(6):519-25. doi: 10.1038/ng.823. Epub 2011 May 8.
4
A framework for variation discovery and genotyping using next-generation DNA sequencing data.利用下一代 DNA 测序数据进行变异发现和基因分型的框架。
Nat Genet. 2011 May;43(5):491-8. doi: 10.1038/ng.806. Epub 2011 Apr 10.
5
ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data.ANNOVAR:从高通量测序数据中注释遗传变异的功能。
Nucleic Acids Res. 2010 Sep;38(16):e164. doi: 10.1093/nar/gkq603. Epub 2010 Jul 3.
6
Uncovering the roles of rare variants in common disease through whole-genome sequencing.通过全基因组测序揭示常见疾病中罕见变异的作用。
Nat Rev Genet. 2010 Jun;11(6):415-25. doi: 10.1038/nrg2779.
7
Accurate SNP and mutation detection by targeted custom microarray-based genomic enrichment of short-fragment sequencing libraries.通过靶向基于微阵列的短片段测序文库的基因组富集,实现 SNP 和突变的精确检测。
Nucleic Acids Res. 2010 Jun;38(10):e116. doi: 10.1093/nar/gkq072. Epub 2010 Feb 17.
8
Exome sequencing identifies the cause of a mendelian disorder.外显子组测序确定了一种孟德尔疾病的病因。
Nat Genet. 2010 Jan;42(1):30-5. doi: 10.1038/ng.499. Epub 2009 Nov 13.
9
The Sequence Alignment/Map format and SAMtools.序列比对/映射格式和 SAMtools。
Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.
10
The emerging landscape of breast cancer susceptibility.乳腺癌易感性的新态势
Nat Genet. 2008 Jan;40(1):17-22. doi: 10.1038/ng.2007.53.