• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SV-Bay:利用贝叶斯方法检测癌症基因组中的结构变异,并对GC含量和读段可映射性进行校正。

SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability.

作者信息

Iakovishina Daria, Janoueix-Lerosey Isabelle, Barillot Emmanuel, Regnier Mireille, Boeva Valentina

机构信息

INRIA Projet AMIB, Ecole Polytechnique, Palaiseau, France.

Institut Curie, Centre De Recherche, Paris Inserm, U830, Department Genetics and Biology of Cancers, Paris, France.

出版信息

Bioinformatics. 2016 Apr 1;32(7):984-92. doi: 10.1093/bioinformatics/btv751. Epub 2016 Jan 6.

DOI:10.1093/bioinformatics/btv751
PMID:26740523
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4896370/
Abstract

MOTIVATION

Whole genome sequencing of paired-end reads can be applied to characterize the landscape of large somatic rearrangements of cancer genomes. Several methods for detecting structural variants with whole genome sequencing data have been developed. So far, none of these methods has combined information about abnormally mapped read pairs connecting rearranged regions and associated global copy number changes automatically inferred from the same sequencing data file. Our aim was to create a computational method that could use both types of information, i.e. normal and abnormal reads, and demonstrate that by doing so we can highly improve both sensitivity and specificity rates of structural variant prediction.

RESULTS

We developed a computational method, SV-Bay, to detect structural variants from whole genome sequencing mate-pair or paired-end data using a probabilistic Bayesian approach. This approach takes into account depth of coverage by normal reads and abnormalities in read pair mappings. To estimate the model likelihood, SV-Bay considers GC-content and read mappability of the genome, thus making important corrections to the expected read count. For the detection of somatic variants, SV-Bay makes use of a matched normal sample when it is available. We validated SV-Bay on simulated datasets and an experimental mate-pair dataset for the CLB-GA neuroblastoma cell line. The comparison of SV-Bay with several other methods for structural variant detection demonstrated that SV-Bay has better prediction accuracy both in terms of sensitivity and false-positive detection rate.

AVAILABILITY AND IMPLEMENTATION

https://github.com/InstitutCurie/SV-Bay

CONTACT

valentina.boeva@inserm.fr

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

双端测序读段的全基因组测序可用于描绘癌症基因组中大型体细胞重排的格局。已经开发了几种利用全基因组测序数据检测结构变异的方法。到目前为止,这些方法中没有一种能将连接重排区域的异常映射读对信息与从同一测序数据文件中自动推断出的相关全局拷贝数变化信息结合起来。我们的目标是创建一种计算方法,该方法可以同时使用这两种信息,即正常和异常读段,并证明通过这样做可以大幅提高结构变异预测的灵敏度和特异性。

结果

我们开发了一种计算方法SV-Bay,使用概率贝叶斯方法从全基因组测序的配对末端或双端数据中检测结构变异。这种方法考虑了正常读段的覆盖深度和读对映射中的异常情况。为了估计模型似然性,SV-Bay考虑了基因组的GC含量和读段可映射性,从而对预期读段计数进行重要校正。对于体细胞变异的检测,SV-Bay在有匹配的正常样本时会加以利用。我们在模拟数据集和CLB-GA神经母细胞瘤细胞系的实验配对末端数据集上对SV-Bay进行了验证。将SV-Bay与其他几种结构变异检测方法进行比较表明,SV-Bay在灵敏度和假阳性检测率方面都具有更好的预测准确性。

可用性和实现方式

https://github.com/InstitutCurie/SV-Bay

联系方式

valentina.boeva@inserm.fr

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3975/4896370/b00e1f2a892c/btv751f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3975/4896370/90d848138f79/btv751f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3975/4896370/9c1c4e65a03b/btv751f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3975/4896370/602842d84271/btv751f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3975/4896370/b00e1f2a892c/btv751f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3975/4896370/90d848138f79/btv751f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3975/4896370/9c1c4e65a03b/btv751f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3975/4896370/602842d84271/btv751f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3975/4896370/b00e1f2a892c/btv751f4p.jpg

相似文献

1
SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability.SV-Bay:利用贝叶斯方法检测癌症基因组中的结构变异,并对GC含量和读段可映射性进行校正。
Bioinformatics. 2016 Apr 1;32(7):984-92. doi: 10.1093/bioinformatics/btv751. Epub 2016 Jan 6.
2
Detection of structural variants involving repetitive regions in the reference genome.检测参考基因组中涉及重复区域的结构变异。
J Comput Biol. 2014 Mar;21(3):219-33. doi: 10.1089/cmb.2013.0129. Epub 2014 Feb 19.
3
Seeksv: an accurate tool for somatic structural variation and virus integration detection.SeekSV:一种用于检测体细胞结构变异和病毒整合的精确工具。
Bioinformatics. 2017 Jan 15;33(2):184-191. doi: 10.1093/bioinformatics/btw591. Epub 2016 Sep 14.
4
SECEDO: SNV-based subclone detection using ultra-low coverage single-cell DNA sequencing.SECEDO:基于 SNV 的亚克隆检测,使用超低覆盖度单细胞 DNA 测序。
Bioinformatics. 2022 Sep 15;38(18):4293-4300. doi: 10.1093/bioinformatics/btac510.
5
Meltos: multi-sample tumor phylogeny reconstruction for structural variants.Meltos:用于结构变异的多样本肿瘤系统发育重建。
Bioinformatics. 2020 Feb 15;36(4):1082-1090. doi: 10.1093/bioinformatics/btz737.
6
Robust and exact structural variation detection with paired-end and soft-clipped alignments: SoftSV compared with eight algorithms.利用双端和软剪切比对进行稳健且精确的结构变异检测:SoftSV与八种算法的比较
Brief Bioinform. 2016 Jan;17(1):51-62. doi: 10.1093/bib/bbv028. Epub 2015 May 20.
7
SVJedi: genotyping structural variations with long reads.使用长读长进行基因分型结构变异。
Bioinformatics. 2020 Nov 1;36(17):4568-4575. doi: 10.1093/bioinformatics/btaa527.
8
PRISM: pair-read informed split-read mapping for base-pair level detection of insertion, deletion and structural variants.PRISM:基于双读信息的分读比对算法,用于检测插入、缺失和结构变异的碱基对水平。
Bioinformatics. 2012 Oct 15;28(20):2576-83. doi: 10.1093/bioinformatics/bts484. Epub 2012 Jul 31.
9
Toolkit for automated and rapid discovery of structural variants.用于自动化和快速发现结构变体的工具包。
Methods. 2017 Oct 1;129:3-7. doi: 10.1016/j.ymeth.2017.05.030. Epub 2017 Jun 2.
10
RAPTR-SV: a hybrid method for the detection of structural variants.RAPTR-SV:一种用于检测结构变异的混合方法。
Bioinformatics. 2015 Jul 1;31(13):2084-90. doi: 10.1093/bioinformatics/btv086. Epub 2015 Feb 16.

引用本文的文献

1
What makes TMB an ambivalent biomarker for immunotherapy? A subtle mismatch between the sample-based design of variant callers and real clinical cohort.TMB 为何成为免疫治疗的一个矛盾生物标志物?变异 caller 的基于样本的设计与真实临床队列之间存在微妙的不匹配。
Front Immunol. 2023 May 25;14:1151224. doi: 10.3389/fimmu.2023.1151224. eCollection 2023.
2
DelInsCaller: An Efficient Algorithm for Identifying Delins and Estimating Haplotypes from Long Reads with High Level of Sequencing Errors.DelInsCaller:一种高效算法,用于从具有高测序错误水平的长读段中识别 Delins 并估计单倍型。
Genes (Basel). 2022 Dec 20;14(1):4. doi: 10.3390/genes14010004.
3

本文引用的文献

1
Comprehensive characterization of complex structural variations in cancer by directly comparing genome sequence reads.通过直接比较基因组序列读取,全面描绘癌症中复杂结构变异。
Nat Biotechnol. 2014 Nov;32(11):1106-12. doi: 10.1038/nbt.3027. Epub 2014 Oct 26.
2
Gustaf: Detecting and correctly classifying SVs in the NGS twilight zone.古斯塔夫:在 NGS 暮色区检测和正确分类 SV。
Bioinformatics. 2014 Dec 15;30(24):3484-90. doi: 10.1093/bioinformatics/btu431. Epub 2014 Jul 14.
3
LUMPY: a probabilistic framework for structural variant discovery.
Copy Number Variation Detection by Single-Cell DNA Sequencing with SCOPE.
利用 SCOPE 进行单细胞 DNA 测序的拷贝数变异检测。
Methods Mol Biol. 2022;2493:279-288. doi: 10.1007/978-1-0716-2293-3_18.
4
Identification of Copy Number Alterations from Next-Generation Sequencing Data.从下一代测序数据中鉴定拷贝数改变。
Adv Exp Med Biol. 2022;1361:55-74. doi: 10.1007/978-3-030-91836-1_4.
5
PopDel identifies medium-size deletions simultaneously in tens of thousands of genomes.PopDel 可同时在数万个基因组中识别中等大小的缺失。
Nat Commun. 2021 Feb 1;12(1):730. doi: 10.1038/s41467-020-20850-5.
6
Comprehensive evaluation and characterisation of short read general-purpose structural variant calling software.全面评估和特征分析短读通用结构变异调用软件。
Nat Commun. 2019 Jul 19;10(1):3240. doi: 10.1038/s41467-019-11146-4.
7
Discovery of tandem and interspersed segmental duplications using high-throughput sequencing.利用高通量测序发现串联和散在的片段重复。
Bioinformatics. 2019 Oct 15;35(20):3923-3930. doi: 10.1093/bioinformatics/btz237.
8
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
9
Preprocessing Sequence Coverage Data for More Precise Detection of Copy Number Variations.预处理序列覆盖数据以更精确地检测拷贝数变异。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):868-876. doi: 10.1109/TCBB.2018.2869738. Epub 2018 Sep 12.
10
Reconstructing cancer karyotypes from short read data: the half empty and half full glass.从短读长数据重建癌症核型:半空与半满之杯。
BMC Bioinformatics. 2017 Nov 15;18(1):488. doi: 10.1186/s12859-017-1929-9.
LUMPY:一种用于结构变异发现的概率框架。
Genome Biol. 2014 Jun 26;15(6):R84. doi: 10.1186/gb-2014-15-6-r84.
4
Socrates: identification of genomic rearrangements in tumour genomes by re-aligning soft clipped reads.苏格拉底:通过重新比对软剪切读段来鉴定肿瘤基因组中的基因组重排。
Bioinformatics. 2014 Apr 15;30(8):1064-1072. doi: 10.1093/bioinformatics/btt767. Epub 2014 Jan 2.
5
Breakpoint features of genomic rearrangements in neuroblastoma with unbalanced translocations and chromothripsis.神经母细胞瘤中基因组重排的断点特征与不平衡易位和染色体重排有关。
PLoS One. 2013 Aug 26;8(8):e72182. doi: 10.1371/journal.pone.0072182. eCollection 2013.
6
PeSV-Fisher: identification of somatic and non-somatic structural variants using next generation sequencing data.PeSV-Fisher:利用下一代测序数据鉴定体细胞和非体细胞结构变异。
PLoS One. 2013 May 21;8(5):e63377. doi: 10.1371/journal.pone.0063377. Print 2013.
7
Diverse mechanisms of somatic structural variations in human cancer genomes.人类癌症基因组中体细胞结构变异的多种机制。
Cell. 2013 May 9;153(4):919-29. doi: 10.1016/j.cell.2013.04.010.
8
DELLY: structural variant discovery by integrated paired-end and split-read analysis.DELLY:通过整合的 paired-end 和 split-read 分析进行结构变异发现。
Bioinformatics. 2012 Sep 15;28(18):i333-i339. doi: 10.1093/bioinformatics/bts378.
9
PRISM: pair-read informed split-read mapping for base-pair level detection of insertion, deletion and structural variants.PRISM:基于双读信息的分读比对算法,用于检测插入、缺失和结构变异的碱基对水平。
Bioinformatics. 2012 Oct 15;28(20):2576-83. doi: 10.1093/bioinformatics/bts484. Epub 2012 Jul 31.
10
Reconstructing cancer genomes from paired-end sequencing data.从配对末端测序数据中重建癌症基因组。
BMC Bioinformatics. 2012 Apr 19;13 Suppl 6(Suppl 6):S10. doi: 10.1186/1471-2105-13-S6-S10.