• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SVM²:一种改进的基于配对末端的工具,用于使用高通量单基因组重测序数据检测小型基因组结构变异。

SVM²: an improved paired-end-based tool for the detection of small genomic structural variations using high-throughput single-genome resequencing data.

机构信息

Department of Biomolecular Sciences and Biotechnology, University of Milan, Milan 20133, Italy.

出版信息

Nucleic Acids Res. 2012 Oct;40(18):e145. doi: 10.1093/nar/gks606. Epub 2012 Jun 25.

DOI:10.1093/nar/gks606
PMID:22735696
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3467043/
Abstract

Several bioinformatics methods have been proposed for the detection and characterization of genomic structural variation (SV) from ultra high-throughput genome resequencing data. Recent surveys show that comprehensive detection of SV events of different types between an individual resequenced genome and a reference sequence is best achieved through the combination of methods based on different principles (split mapping, reassembly, read depth, insert size, etc.). The improvement of individual predictors is thus an important objective. In this study, we propose a new method that combines deviations from expected library insert sizes and additional information from local patterns of read mapping and uses supervised learning to predict the position and nature of structural variants. We show that our approach provides greatly increased sensitivity with respect to other tools based on paired end read mapping at no cost in specificity, and it makes reliable predictions of very short insertions and deletions in repetitive and low-complexity genomic contexts that can confound tools based on split mapping of reads.

摘要

已经提出了几种生物信息学方法,用于从超高通量基因组重测序数据中检测和描述基因组结构变异 (SV)。最近的调查表明,通过组合基于不同原理的方法(拆分映射、重新组装、读取深度、插入大小等),可以最好地实现个体重测序基因组和参考序列之间不同类型 SV 事件的全面检测。因此,提高个体预测器的性能是一个重要目标。在本研究中,我们提出了一种新方法,该方法结合了预期库插入大小的偏差以及来自读取映射局部模式的附加信息,并使用监督学习来预测结构变异的位置和性质。我们表明,与其他基于配对末端读取映射的工具相比,我们的方法在不影响特异性的情况下大大提高了灵敏度,并且可以对重复和低复杂度基因组环境中的非常短的插入和缺失进行可靠预测,这可能会干扰基于拆分映射的工具读取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ca7/3467043/fd07aeccb1ff/gks606f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ca7/3467043/b3f5e63de65c/gks606f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ca7/3467043/78d23f9ebd63/gks606f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ca7/3467043/fd07aeccb1ff/gks606f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ca7/3467043/b3f5e63de65c/gks606f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ca7/3467043/78d23f9ebd63/gks606f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ca7/3467043/fd07aeccb1ff/gks606f3.jpg

相似文献

1
SVM²: an improved paired-end-based tool for the detection of small genomic structural variations using high-throughput single-genome resequencing data.SVM²:一种改进的基于配对末端的工具,用于使用高通量单基因组重测序数据检测小型基因组结构变异。
Nucleic Acids Res. 2012 Oct;40(18):e145. doi: 10.1093/nar/gks606. Epub 2012 Jun 25.
2
Identifying structural variation in haploid microbial genomes from short-read resequencing data using breseq.使用breseq从短读重测序数据中识别单倍体微生物基因组中的结构变异。
BMC Genomics. 2014 Nov 29;15(1):1039. doi: 10.1186/1471-2164-15-1039.
3
Detection of structural variants involving repetitive regions in the reference genome.检测参考基因组中涉及重复区域的结构变异。
J Comput Biol. 2014 Mar;21(3):219-33. doi: 10.1089/cmb.2013.0129. Epub 2014 Feb 19.
4
Toolkit for automated and rapid discovery of structural variants.用于自动化和快速发现结构变体的工具包。
Methods. 2017 Oct 1;129:3-7. doi: 10.1016/j.ymeth.2017.05.030. Epub 2017 Jun 2.
5
Robust and exact structural variation detection with paired-end and soft-clipped alignments: SoftSV compared with eight algorithms.利用双端和软剪切比对进行稳健且精确的结构变异检测:SoftSV与八种算法的比较
Brief Bioinform. 2016 Jan;17(1):51-62. doi: 10.1093/bib/bbv028. Epub 2015 May 20.
6
Detecting genomic deletions from high-throughput sequence data with unsupervised learning.使用无监督学习从高通量测序数据中检测基因组缺失。
BMC Bioinformatics. 2023 Jan 27;23(Suppl 8):568. doi: 10.1186/s12859-023-05139-w.
7
PRISM: pair-read informed split-read mapping for base-pair level detection of insertion, deletion and structural variants.PRISM:基于双读信息的分读比对算法,用于检测插入、缺失和结构变异的碱基对水平。
Bioinformatics. 2012 Oct 15;28(20):2576-83. doi: 10.1093/bioinformatics/bts484. Epub 2012 Jul 31.
8
SVmine improves structural variation detection by integrative mining of predictions from multiple algorithms.SVmine 通过整合来自多种算法的预测结果来提高结构变异检测的效果。
Bioinformatics. 2017 Nov 1;33(21):3348-3354. doi: 10.1093/bioinformatics/btx455.
9
Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping.通过基因组重测序和高通量基因分型对牛的基因组变异进行全球评估。
BMC Genomics. 2011 Nov 14;12:557. doi: 10.1186/1471-2164-12-557.
10
Precise characterization of somatic complex structural variations from tumor/control paired long-read sequencing data with nanomonsv.利用纳米蒙斯 v 从肿瘤/对照配对长读测序数据中精确刻画体细胞复杂结构变异。
Nucleic Acids Res. 2023 Aug 11;51(14):e74. doi: 10.1093/nar/gkad526.

引用本文的文献

1
Comparisons of performances of structural variants detection algorithms in solitary or combination strategy.结构变异检测算法在单独或联合策略下的性能比较。
PLoS One. 2025 Feb 6;20(2):e0314982. doi: 10.1371/journal.pone.0314982. eCollection 2025.
2
Comprehensive evaluation and characterisation of short read general-purpose structural variant calling software.全面评估和特征分析短读通用结构变异调用软件。
Nat Commun. 2019 Jul 19;10(1):3240. doi: 10.1038/s41467-019-11146-4.
3
GPA: A Microbial Genetic Polymorphisms Assignments Tool in Metagenomic Analysis by Bayesian Estimation.

本文引用的文献

1
An integrative probabilistic model for identification of structural variation in sequencing data.一种整合概率模型,用于鉴定测序数据中的结构变异。
Genome Biol. 2012;13(3):R22. doi: 10.1186/gb-2012-13-3-r22.
2
Breakpointer: using local mapping artifacts to support sequence breakpoint discovery from single-end reads.Breakpointer:利用局部比对特征支持从单端读段中发现序列断点。
Bioinformatics. 2012 Apr 1;28(7):1024-5. doi: 10.1093/bioinformatics/bts064. Epub 2012 Feb 1.
3
Natural genetic variation caused by small insertions and deletions in the human genome.
GPA:基于贝叶斯估计的宏基因组分析中微生物遗传多态性分配工具。
Genomics Proteomics Bioinformatics. 2019 Feb;17(1):106-117. doi: 10.1016/j.gpb.2018.12.005. Epub 2019 Apr 23.
4
Comprehensive Identification of Fim-Mediated Inversions in Uropathogenic Escherichia coli with Structural Variation Detection Using Relative Entropy.利用相对熵进行结构变异检测,对尿路致病性大肠杆菌中 fim 介导的倒位进行全面鉴定。
mSphere. 2019 Apr 10;4(2):e00693-18. doi: 10.1128/mSphere.00693-18.
5
InDel marker detection by integration of multiple softwares using machine learning techniques.利用机器学习技术整合多种软件进行插入缺失(InDel)标记检测。
BMC Bioinformatics. 2016 Nov 2;17(1):548. doi: 10.1186/s12859-016-1312-2.
6
Identification of copy number variants in whole-genome data using Reference Coverage Profiles.利用参考覆盖谱图鉴定全基因组数据中的拷贝数变异。
Front Genet. 2015 Feb 17;6:45. doi: 10.3389/fgene.2015.00045. eCollection 2015.
7
A gradient-boosting approach for filtering de novo mutations in parent-offspring trios.一种用于筛选亲子三联体中新生突变的梯度提升方法。
Bioinformatics. 2014 Jul 1;30(13):1830-6. doi: 10.1093/bioinformatics/btu141. Epub 2014 Mar 10.
8
MATCHCLIP: locate precise breakpoints for copy number variation using CIGAR string by matching soft clipped reads.MATCHCLIP:通过匹配软剪辑读取,使用 CIGAR 字符串定位拷贝数变异的精确断点。
Front Genet. 2013 Aug 16;4:157. doi: 10.3389/fgene.2013.00157. eCollection 2013.
9
Unraveling overlapping deletions by agglomerative clustering.解析凝聚聚类中的重叠缺失。
BMC Genomics. 2013;14 Suppl 1(Suppl 1):S12. doi: 10.1186/1471-2164-14-S1-S12. Epub 2013 Jan 21.
人类基因组中小的插入和缺失引起的自然遗传变异。
Genome Res. 2011 Jun;21(6):830-9. doi: 10.1101/gr.115907.110. Epub 2011 Apr 1.
4
Genome structural variation discovery and genotyping.基因组结构变异发现与基因分型。
Nat Rev Genet. 2011 May;12(5):363-76. doi: 10.1038/nrg2958. Epub 2011 Mar 1.
5
CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing.CNVnator:一种从家族和人群基因组测序中发现、基因分型和表征典型和非典型 CNV 的方法。
Genome Res. 2011 Jun;21(6):974-84. doi: 10.1101/gr.114876.110. Epub 2011 Feb 7.
6
Discovery and genotyping of genome structural polymorphism by sequencing on a population scale.基于人群规模测序的基因组结构多态性的发现和基因分型。
Nat Genet. 2011 Mar;43(3):269-76. doi: 10.1038/ng.768. Epub 2011 Feb 13.
7
Mapping copy number variation by population-scale genome sequencing.通过群体规模的基因组测序来绘制拷贝数变异图谱。
Nature. 2011 Feb 3;470(7332):59-65. doi: 10.1038/nature09708.
8
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
9
Dindel: accurate indel calls from short-read data.Dindel:从短读数据中进行精确的插入缺失突变(Indel)调用。
Genome Res. 2011 Jun;21(6):961-73. doi: 10.1101/gr.112326.110. Epub 2010 Oct 27.
10
Functional impact of global rare copy number variation in autism spectrum disorders.自闭症谱系障碍中全球罕见拷贝数变异的功能影响。
Nature. 2010 Jul 15;466(7304):368-72. doi: 10.1038/nature09146. Epub 2010 Jun 9.