• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用同义词库注释对匹配样本中的基因变异进行比较。

Comparison of genetic variants in matched samples using thesaurus annotation.

作者信息

Konopka Tomasz, Nijman Sebastian M B

机构信息

Ludwig Institute for Cancer Research, University of Oxford, Oxford, UK.

出版信息

Bioinformatics. 2016 Mar 1;32(5):657-63. doi: 10.1093/bioinformatics/btv654. Epub 2015 Nov 5.

DOI:10.1093/bioinformatics/btv654
PMID:26545822
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4795618/
Abstract

MOTIVATION

Calling changes in DNA, e.g. as a result of somatic events in cancer, requires analysis of multiple matched sequenced samples. Events in low-mappability regions of the human genome are difficult to encode in variant call files and have been under-reported as a result. However, they can be described accurately through thesaurus annotation-a technique that links multiple genomic loci together to explicate a single variant.

RESULTS

We here describe software and benchmarks for using thesaurus annotation to detect point changes in DNA from matched samples. In benchmarks on matched normal/tumor samples we show that the technique can recover between five and ten percent more true events than conventional approaches, while strictly limiting false discovery and being fully consistent with popular variant analysis workflows. We also demonstrate the utility of the approach for analysis of de novo mutations in parents/child families.

AVAILABILITY AND IMPLEMENTATION

Software performing thesaurus annotation is implemented in java; available in source code on github at GeneticThesaurus (https://github.com/tkonopka/GeneticThesaurus) and as an executable on sourceforge at geneticthesaurus (https://sourceforge.net/projects/geneticthesaurus). Mutation calling is implemented in an R package available on github at RGeneticThesaurus (https://github.com/tkonopka/RGeneticThesaurus).

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

CONTACT

tomasz.konopka@ludwig.ox.ac.uk.

摘要

动机

识别DNA中的变化,例如癌症中的体细胞事件所导致的变化,需要对多个匹配的测序样本进行分析。人类基因组中低映射性区域的事件难以在变异调用文件中进行编码,因此报告不足。然而,它们可以通过词库注释准确描述——这是一种将多个基因组位点联系在一起以阐明单个变异的技术。

结果

我们在此描述了使用词库注释从匹配样本中检测DNA点变化的软件和基准。在匹配的正常/肿瘤样本基准测试中,我们表明该技术比传统方法能够多发现5%到10%的真实事件,同时严格限制错误发现,并且与流行的变异分析工作流程完全一致。我们还展示了该方法在分析亲子家庭中的新生突变方面的效用。

可用性和实现方式

执行词库注释的软件用Java实现;可在GitHub上的GeneticThesaurus(https://github.com/tkonopka/GeneticThesaurus)获取源代码,也可在SourceForge上的geneticthesaurus(https://sourceforge.net/projects/geneticthesaurus)获取可执行文件。变异调用在GitHub上的RGeneticThesaurus(https://github.com/tkonopka/RGeneticThesaurus)的R包中实现。

补充信息

补充数据可在《生物信息学》在线获取。

联系方式

tomasz.konopka@ludwig.ox.ac.uk

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aba/4795618/dd63fc1f6007/btv654f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aba/4795618/29e1a9baa808/btv654f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aba/4795618/2d3724e9b172/btv654f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aba/4795618/141e0b3d11d1/btv654f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aba/4795618/dd63fc1f6007/btv654f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aba/4795618/29e1a9baa808/btv654f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aba/4795618/2d3724e9b172/btv654f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aba/4795618/141e0b3d11d1/btv654f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aba/4795618/dd63fc1f6007/btv654f4p.jpg

相似文献

1
Comparison of genetic variants in matched samples using thesaurus annotation.使用同义词库注释对匹配样本中的基因变异进行比较。
Bioinformatics. 2016 Mar 1;32(5):657-63. doi: 10.1093/bioinformatics/btv654. Epub 2015 Nov 5.
2
ReliableGenome: annotation of genomic regions with high/low variant calling concordance.可靠基因组:具有高/低变异检测一致性的基因组区域注释。
Bioinformatics. 2017 Jan 15;33(2):155-160. doi: 10.1093/bioinformatics/btw587. Epub 2016 Sep 7.
3
A thesaurus of genetic variation for interrogation of repetitive genomic regions.用于重复基因组区域查询的遗传变异词库。
Nucleic Acids Res. 2015 May 26;43(10):e68. doi: 10.1093/nar/gkv178. Epub 2015 Mar 27.
4
Custom workflows to improve joint variant calling from multiple related tumour samples: FreeBayesSomatic and Strelka2Pass.
Bioinformatics. 2021 Nov 5;37(21):3916-3919. doi: 10.1093/bioinformatics/btab606.
5
Jannovar: a java library for exome annotation.Jannovar:一个用于外显子注释的Java库。
Hum Mutat. 2014 May;35(5):548-55. doi: 10.1002/humu.22531. Epub 2014 Apr 9.
6
SAMFIRE: multi-locus variant calling for time-resolved sequence data.SAMFIRE:用于时间分辨序列数据的多位点变异检测
Bioinformatics. 2016 Jul 15;32(14):2208-9. doi: 10.1093/bioinformatics/btw205. Epub 2016 Apr 22.
7
Canvas SPW: calling de novo copy number variants in pedigrees.Canvas SPW:在家系中调用从头发生的拷贝数变异。
Bioinformatics. 2018 Feb 1;34(3):516-518. doi: 10.1093/bioinformatics/btx618.
8
Personal Cancer Genome Reporter: variant interpretation report for precision oncology.个人癌症基因组报告:精准肿瘤学的变异解释报告。
Bioinformatics. 2018 May 15;34(10):1778-1780. doi: 10.1093/bioinformatics/btx817.
9
SNiPA: an interactive, genetic variant-centered annotation browser.SNiPA:一个交互式的、以基因变异为中心的注释浏览器。
Bioinformatics. 2015 Apr 15;31(8):1334-6. doi: 10.1093/bioinformatics/btu779. Epub 2014 Nov 26.
10
VarGen: an R package for disease-associated variant discovery and annotation.VarGen:用于疾病相关变异发现和注释的 R 包。
Bioinformatics. 2020 Apr 15;36(8):2626-2627. doi: 10.1093/bioinformatics/btz930.

引用本文的文献

1
A pan-cancer landscape of somatic mutations in non-unique regions of the human genome.人类基因组非唯一区域体细胞突变的泛癌图谱。
Nat Biotechnol. 2021 Dec;39(12):1589-1596. doi: 10.1038/s41587-021-00971-y. Epub 2021 Jul 19.
2
A Computational Method of Defining Potential Biomarkers based on Differential Sub-Networks.基于差异子网络的潜在生物标志物定义的计算方法。
Sci Rep. 2017 Oct 30;7(1):14339. doi: 10.1038/s41598-017-14682-5.

本文引用的文献

1
Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection.将肿瘤基因组模拟与众包相结合,以评估体细胞单核苷酸变异检测。
Nat Methods. 2015 Jul;12(7):623-30. doi: 10.1038/nmeth.3407. Epub 2015 May 18.
2
RVD2: an ultra-sensitive variant detection model for low-depth heterogeneous next-generation sequencing data.RVD2:一种用于低深度异质下一代测序数据的超灵敏变异检测模型。
Bioinformatics. 2015 Sep 1;31(17):2785-93. doi: 10.1093/bioinformatics/btv275. Epub 2015 Apr 29.
3
Personalized genomic analyses for cancer mutation discovery and interpretation.
用于癌症突变发现与解读的个性化基因组分析。
Sci Transl Med. 2015 Apr 15;7(283):283ra53. doi: 10.1126/scitranslmed.aaa7161.
4
A thesaurus of genetic variation for interrogation of repetitive genomic regions.用于重复基因组区域查询的遗传变异词库。
Nucleic Acids Res. 2015 May 26;43(10):e68. doi: 10.1093/nar/gkv178. Epub 2015 Mar 27.
5
multiSNV: a probabilistic approach for improving detection of somatic point mutations from multiple related tumour samples.多单核苷酸变异:一种用于改进从多个相关肿瘤样本中检测体细胞点突变的概率方法。
Nucleic Acids Res. 2015 May 19;43(9):e61. doi: 10.1093/nar/gkv135. Epub 2015 Feb 26.
6
A Bayesian framework for de novo mutation calling in parents-offspring trios.一种用于亲子三人组中新生突变检测的贝叶斯框架。
Bioinformatics. 2015 May 1;31(9):1375-81. doi: 10.1093/bioinformatics/btu839. Epub 2014 Dec 21.
7
COSMIC: exploring the world's knowledge of somatic mutations in human cancer.COSMIC:探索全球关于人类癌症体细胞突变的知识。
Nucleic Acids Res. 2015 Jan;43(Database issue):D805-11. doi: 10.1093/nar/gku1075. Epub 2014 Oct 29.
8
Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications.整合基于图谱、组装和单倍型的方法以在临床测序应用中进行变异检测。
Nat Genet. 2014 Aug;46(8):912-918. doi: 10.1038/ng.3036. Epub 2014 Jul 13.
9
Joint variant and de novo mutation identification on pedigrees from high-throughput sequencing data.基于高通量测序数据对家系进行联合变异和新生突变鉴定。
J Comput Biol. 2014 Jun;21(6):405-19. doi: 10.1089/cmb.2014.0029.
10
DeNovoGear: de novo indel and point mutation discovery and phasing.DeNovoGear:从头缺失和点突变发现及相位分析。
Nat Methods. 2013 Oct;10(10):985-7. doi: 10.1038/nmeth.2611. Epub 2013 Aug 25.