• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PostSV:一种用于筛选结构变异的后处理方法。

PostSV: A Post-Processing Approach for Filtering Structural Variations.

作者信息

Alzaid Eman, Allali Achraf El

机构信息

Computer Science Department, King Saud University, Riyadh, Saudi Arabia.

Department of Computer Science, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia.

出版信息

Bioinform Biol Insights. 2020 Jan 20;14:1177932219892957. doi: 10.1177/1177932219892957. eCollection 2020.

DOI:10.1177/1177932219892957
PMID:32009779
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6974750/
Abstract

Genomic structural variations are significant causes of genome diversity and complex diseases. With advances in sequencing technologies, many algorithms have been designed to identify structural differences using next-generation sequencing (NGS) data. Due to repetitions in the human genome and the short reads produced by NGS, the discovery of structural variants (SVs) by state-of-the-art SV callers is not always accurate. To improve performance, multiple SV callers are often used to detect variants. However, most SV callers suffer from high false-positive rates, which diminishes the overall performance, especially in low-coverage genomes. In this article, we propose a post-processing classification-based algorithm that can be used to filter structural variation predictions produced by SV callers. Novel features are defined from putative SV predictions using reads at the local regions around the breakpoints. Several classifiers are employed to classify the candidate predictions and remove false positives. We test our classifier models on simulated and real genomes and show that the proposed approach improves the performance of state-of-the-art algorithms.

摘要

基因组结构变异是基因组多样性和复杂疾病的重要成因。随着测序技术的进步,已设计出许多算法,用于利用下一代测序(NGS)数据识别结构差异。由于人类基因组中的重复序列以及NGS产生的短读长,最先进的结构变异(SV)检测工具发现结构变异并不总是准确的。为了提高性能,通常使用多个SV检测工具来检测变异。然而,大多数SV检测工具存在较高的假阳性率,这降低了整体性能,尤其是在低覆盖度基因组中。在本文中,我们提出了一种基于后处理分类的算法,可用于过滤SV检测工具产生的结构变异预测结果。利用断点周围局部区域的读段,从假定的SV预测中定义了新的特征。采用多个分类器对候选预测进行分类并去除假阳性。我们在模拟基因组和真实基因组上测试了我们的分类器模型,结果表明所提出的方法提高了最先进算法的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3158/6974750/24a72b3d1849/10.1177_1177932219892957-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3158/6974750/2addcd7aba9a/10.1177_1177932219892957-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3158/6974750/7011435d2240/10.1177_1177932219892957-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3158/6974750/46e0aec5ee2d/10.1177_1177932219892957-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3158/6974750/24a72b3d1849/10.1177_1177932219892957-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3158/6974750/2addcd7aba9a/10.1177_1177932219892957-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3158/6974750/7011435d2240/10.1177_1177932219892957-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3158/6974750/46e0aec5ee2d/10.1177_1177932219892957-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3158/6974750/24a72b3d1849/10.1177_1177932219892957-fig4.jpg

相似文献

1
PostSV: A Post-Processing Approach for Filtering Structural Variations.PostSV:一种用于筛选结构变异的后处理方法。
Bioinform Biol Insights. 2020 Jan 20;14:1177932219892957. doi: 10.1177/1177932219892957. eCollection 2020.
2
Structural variation detection using next-generation sequencing data: A comparative technical review.利用下一代测序数据进行结构变异检测:一项比较技术综述。
Methods. 2016 Jun 1;102:36-49. doi: 10.1016/j.ymeth.2016.01.020. Epub 2016 Feb 1.
3
VISTA: an integrated framework for structural variant discovery.VISTA:一个用于结构变异发现的集成框架。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae462.
4
svclassify: a method to establish benchmark structural variant calls.svclassify:一种建立基准结构变异调用的方法。
BMC Genomics. 2016 Jan 16;17:64. doi: 10.1186/s12864-016-2366-2.
5
Structural variation discovery in wheat using PacBio high-fidelity sequencing.利用 PacBio 高保真测序技术发现小麦中的结构变异。
Plant J. 2024 Oct;120(2):687-698. doi: 10.1111/tpj.17011. Epub 2024 Sep 6.
6
CLOVE: classification of genomic fusions into structural variation events.CLOVE:将基因组融合分类为结构变异事件。
BMC Bioinformatics. 2017 Jul 20;18(1):346. doi: 10.1186/s12859-017-1760-3.
7
Comparison of multiple algorithms to reliably detect structural variants in pears.比较多种算法以可靠地检测梨中的结构变异。
BMC Genomics. 2020 Jan 20;21(1):61. doi: 10.1186/s12864-020-6455-x.
8
Automated filtering of genome-wide large deletions through an ensemble deep learning framework.通过集成深度学习框架自动筛选全基因组大片段缺失。
Methods. 2022 Oct;206:77-86. doi: 10.1016/j.ymeth.2022.08.001. Epub 2022 Aug 28.
9
Evaluation of computational genotyping of structural variation for clinical diagnoses.结构变异计算基因分型在临床诊断中的评估。
Gigascience. 2019 Sep 1;8(9). doi: 10.1093/gigascience/giz110.
10
Robust and exact structural variation detection with paired-end and soft-clipped alignments: SoftSV compared with eight algorithms.利用双端和软剪切比对进行稳健且精确的结构变异检测:SoftSV与八种算法的比较
Brief Bioinform. 2016 Jan;17(1):51-62. doi: 10.1093/bib/bbv028. Epub 2015 May 20.

本文引用的文献

1
FusorSV: an algorithm for optimally combining data from multiple structural variation detection methods. FusorSV:一种用于最优组合来自多种结构变异检测方法的数据的算法。
Genome Biol. 2018 Mar 20;19(1):38. doi: 10.1186/s13059-018-1404-6.
2
Alignment of 1000 Genomes Project reads to reference assembly GRCh38.将 1000 基因组计划的读取与参考组装 GRCh38 对齐。
Gigascience. 2017 Jul 1;6(7):1-8. doi: 10.1093/gigascience/gix038.
3
Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.
对GRCh38和从头单倍体基因组组装的评估证明了参考组装的持久质量。
Genome Res. 2017 May;27(5):849-864. doi: 10.1101/gr.213611.116. Epub 2017 Apr 10.
4
Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics.基于全基因组特征,对多种新一代测序比对器的读段比对进行评估。
Genomics. 2017 Jul;109(3-4):186-191. doi: 10.1016/j.ygeno.2017.03.001. Epub 2017 Mar 9.
5
Resolving complex structural genomic rearrangements using a randomized approach.使用随机方法解析复杂的结构基因组重排。
Genome Biol. 2016 Jun 10;17(1):126. doi: 10.1186/s13059-016-0993-1.
6
Structural variation detection using next-generation sequencing data: A comparative technical review.利用下一代测序数据进行结构变异检测:一项比较技术综述。
Methods. 2016 Jun 1;102:36-49. doi: 10.1016/j.ymeth.2016.01.020. Epub 2016 Feb 1.
7
svclassify: a method to establish benchmark structural variant calls.svclassify:一种建立基准结构变异调用的方法。
BMC Genomics. 2016 Jan 16;17:64. doi: 10.1186/s12864-016-2366-2.
8
Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications.Manta:用于种系和癌症测序应用的结构变异和插入缺失的快速检测。
Bioinformatics. 2016 Apr 15;32(8):1220-2. doi: 10.1093/bioinformatics/btv710. Epub 2015 Dec 8.
9
Wham: Identifying Structural Variants of Biological Consequence.Wham:识别具有生物学意义的结构变异。
PLoS Comput Biol. 2015 Dec 1;11(12):e1004572. doi: 10.1371/journal.pcbi.1004572. eCollection 2015 Dec.
10
Robust and exact structural variation detection with paired-end and soft-clipped alignments: SoftSV compared with eight algorithms.利用双端和软剪切比对进行稳健且精确的结构变异检测:SoftSV与八种算法的比较
Brief Bioinform. 2016 Jan;17(1):51-62. doi: 10.1093/bib/bbv028. Epub 2015 May 20.