文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

SVIM:基于比对的长读段的结构变异识别。

SVIM: structural variant identification using mapped long reads.

机构信息

Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany.

出版信息

Bioinformatics. 2019 Sep 1;35(17):2907-2915. doi: 10.1093/bioinformatics/btz041.


DOI:10.1093/bioinformatics/btz041
PMID:30668829
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6735718/
Abstract

MOTIVATION: Structural variants are defined as genomic variants larger than 50 bp. They have been shown to affect more bases in any given genome than single-nucleotide polymorphisms or small insertions and deletions. Additionally, they have great impact on human phenotype and diversity and have been linked to numerous diseases. Due to their size and association with repeats, they are difficult to detect by shotgun sequencing, especially when based on short reads. Long read, single-molecule sequencing technologies like those offered by Pacific Biosciences or Oxford Nanopore Technologies produce reads with a length of several thousand base pairs. Despite the higher error rate and sequencing cost, long-read sequencing offers many advantages for the detection of structural variants. Yet, available software tools still do not fully exploit the possibilities. RESULTS: We present SVIM, a tool for the sensitive detection and precise characterization of structural variants from long-read data. SVIM consists of three components for the collection, clustering and combination of structural variant signatures from read alignments. It discriminates five different variant classes including similar types, such as tandem and interspersed duplications and novel element insertions. SVIM is unique in its capability of extracting both the genomic origin and destination of duplications. It compares favorably with existing tools in evaluations on simulated data and real datasets from Pacific Biosciences and Nanopore sequencing machines. AVAILABILITY AND IMPLEMENTATION: The source code and executables of SVIM are available on Github: github.com/eldariont/svim. SVIM has been implemented in Python 3 and published on bioconda and the Python Package Index. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

摘要

动机:结构变体被定义为大于 50bp 的基因组变体。它们已被证明在任何给定的基因组中影响的碱基比单核苷酸多态性或小插入和缺失更多。此外,它们对人类表型和多样性有很大的影响,并与许多疾病有关。由于它们的大小和与重复序列的关联,它们很难通过鸟枪法测序检测到,尤其是基于短读长时。Pacific Biosciences 或 Oxford Nanopore Technologies 等提供的长读长、单分子测序技术可产生数千个碱基对长的读长。尽管错误率和测序成本较高,但长读长测序在检测结构变体方面具有许多优势。然而,可用的软件工具仍未充分利用这些可能性。

结果:我们提出了 SVIM,这是一种用于从长读长数据中敏感检测和精确表征结构变体的工具。SVIM 由三个组件组成,用于从读长比对中收集、聚类和组合结构变体特征。它可区分包括串联和散布重复在内的五种不同的变体类型,以及新型元件插入。SVIM 的独特之处在于能够提取重复的基因组起源和目的地。它在模拟数据和来自 Pacific Biosciences 和 Nanopore 测序仪的真实数据集的评估中与现有工具相比表现出色。

可用性和实现:SVIM 的源代码和可执行文件可在 Github 上获得:github.com/eldariont/svim。SVIM 是用 Python 3 实现的,并在 bioconda 和 Python 包索引上发布。

补充信息:补充数据可在 Bioinformatics 在线获得。

相似文献

[1]
SVIM: structural variant identification using mapped long reads.

Bioinformatics. 2019-9-1

[2]
SVIM-asm: structural variant detection from haploid and diploid genome assemblies.

Bioinformatics. 2021-4-1

[3]
lordFAST: sensitive and Fast Alignment Search Tool for LOng noisy Read sequencing Data.

Bioinformatics. 2019-1-1

[4]
Discovery of tandem and interspersed segmental duplications using high-throughput sequencing.

Bioinformatics. 2019-10-15

[5]
BleTIES: annotation of natural genome editing in ciliates using long read sequencing.

Bioinformatics. 2021-11-5

[6]
Evaluation of tools for long read RNA-seq splice-aware alignment.

Bioinformatics. 2018-3-1

[7]
Noise-cancelling repeat finder: uncovering tandem repeats in error-prone long-read sequencing data.

Bioinformatics. 2019-11-1

[8]
MsPAC: a tool for haplotype-phased structural variant detection.

Bioinformatics. 2020-2-1

[9]
SVJedi: genotyping structural variations with long reads.

Bioinformatics. 2020-11-1

[10]
NucBreak: location of structural errors in a genome assembly by using paired-end Illumina reads.

BMC Bioinformatics. 2020-2-21

引用本文的文献

[1]
BVSim: A benchmarking variation simulator mimicking human variation spectrum.

Gigascience. 2025-1-6

[2]
Structural Variants: Mechanisms, Mapping, and Interpretation in Human Genetics.

Genes (Basel). 2025-7-29

[3]
A Hitchhiker Guide to Structural Variant Calling: A Comprehensive Benchmark Through Different Sequencing Technologies.

Biomedicines. 2025-8-9

[4]
Long read whole genome sequencing-based discovery of structural variants and their role in aetiology of non-syndromic autism spectrum disorder in India.

BMC Med Genomics. 2025-8-20

[5]
TRsv: simultaneous detection of tandem repeat variations, structural variations, and short indels using long read sequencing data.

Genome Biol. 2025-8-20

[6]
How Structural Variations Influence Crop Improvement.

Int J Mol Sci. 2025-7-10

[7]
The crosstalk between host and rumen microbiome in cattle: insights from multi-omics approaches and genome-wide association studies.

World J Microbiol Biotechnol. 2025-7-28

[8]
Complex genetic variation in nearly complete human genomes.

Nature. 2025-7-23

[9]
ASVBM: Structural variant benchmarking with local joint analysis for multiple callsets.

Comput Struct Biotechnol J. 2025-6-29

[10]
A haplotype-resolved pangenome of the barley wild relative Hordeum bulbosum.

Nature. 2025-7-9

本文引用的文献

[1]
Multi-platform discovery of haplotype-resolved structural variation in human genomes.

Nat Commun. 2019-4-16

[2]
Minimap2: pairwise alignment for nucleotide sequences.

Bioinformatics. 2018-9-15

[3]
Accurate detection of complex structural variations using single-molecule sequencing.

Nat Methods. 2018-4-30

[4]
Piercing the dark matter: bioinformatics of long-range sequencing and mapping.

Nat Rev Genet. 2018-6

[5]
Nanopore sequencing and assembly of a human genome with ultra-long reads.

Nat Biotechnol. 2018-1-29

[6]
Long-read genome sequencing identifies causal structural variation in a Mendelian disease.

Genet Med. 2017-6-22

[7]
Discovery and genotyping of structural variation from long-read haploid genome sequence data.

Genome Res. 2017-5

[8]
SimLoRD: Simulation of Long Read Data.

Bioinformatics. 2016-9-1

[9]
An Incomplete Understanding of Human Genetic Variation.

Genetics. 2016-4

[10]
Mechanisms underlying structural variant formation in genomic disorders.

Nat Rev Genet. 2016-4

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索