• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用长读长进行基因分型结构变异。

SVJedi: genotyping structural variations with long reads.

机构信息

Univ Rennes, Inria, CNRS, IRISA, F-35000 Rennes, France.

出版信息

Bioinformatics. 2020 Nov 1;36(17):4568-4575. doi: 10.1093/bioinformatics/btaa527.

DOI:10.1093/bioinformatics/btaa527
PMID:32437523
Abstract

MOTIVATION

Studies on structural variants (SVs) are expanding rapidly. As a result, and thanks to third generation sequencing technologies, the number of discovered SVs is increasing, especially in the human genome. At the same time, for several applications such as clinical diagnoses, it is important to genotype newly sequenced individuals on well-defined and characterized SVs. Whereas several SV genotypers have been developed for short read data, there is a lack of such dedicated tool to assess whether known SVs are present or not in a new long read sequenced sample, such as the one produced by Pacific Biosciences or Oxford Nanopore Technologies.

RESULTS

We present a novel method to genotype known SVs from long read sequencing data. The method is based on the generation of a set of representative allele sequences that represent the two alleles of each structural variant. Long reads are aligned to these allele sequences. Alignments are then analyzed and filtered out to keep only informative ones, to quantify and estimate the presence of each SV allele and the allele frequencies. We provide an implementation of the method, SVJedi, to genotype SVs with long reads. The tool has been applied to both simulated and real human datasets and achieves high genotyping accuracy. We show that SVJedi obtains better performances than other existing long read genotyping tools and we also demonstrate that SV genotyping is considerably improved with SVJedi compared to other approaches, namely SV discovery and short read SV genotyping approaches.

AVAILABILITY AND IMPLEMENTATION

https://github.com/llecompte/SVJedi.git.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

结构变异(SVs)的研究正在迅速扩展。因此,由于第三代测序技术的发展,已发现的 SV 数量正在增加,尤其是在人类基因组中。同时,对于临床诊断等几种应用,对新测序的个体进行明确定义和特征化的 SV 基因分型非常重要。虽然已经开发了几种用于短读数据的 SV 基因分型器,但缺乏这样的专用工具来评估新的长读测序样本中是否存在已知的 SV,例如 Pacific Biosciences 或 Oxford Nanopore Technologies 生产的样本。

结果

我们提出了一种从长读测序数据中对已知 SV 进行基因分型的新方法。该方法基于生成一组代表每个结构变异的两个等位基因的代表性等位基因序列。将长读与这些等位基因序列对齐。然后分析和过滤对齐,以保留唯一的信息,以量化和估计每个 SV 等位基因的存在和等位基因频率。我们提供了一种用于对长读进行 SV 基因分型的方法 SVJedi 的实现。该工具已应用于模拟和真实的人类数据集,并实现了高基因分型准确性。我们表明,SVJedi 比其他现有的长读基因分型工具具有更好的性能,并且与其他方法(即 SV 发现和短读 SV 基因分型方法)相比,SV 基因分型得到了极大的改进。

可用性和实现

https://github.com/llecompte/SVJedi.git。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
SVJedi: genotyping structural variations with long reads.使用长读长进行基因分型结构变异。
Bioinformatics. 2020 Nov 1;36(17):4568-4575. doi: 10.1093/bioinformatics/btaa527.
2
SVJedi-graph: improving the genotyping of close and overlapping structural variants with long reads using a variation graph.SVJedi-graph:使用变异图提高长读长对紧密和重叠结构变异的基因分型。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i270-i278. doi: 10.1093/bioinformatics/btad237.
3
GGTyper: genotyping complex structural variants using short-read sequencing data.GGTyper:使用短读测序数据进行基因分型复杂结构变异。
Bioinformatics. 2024 Sep 1;40(Suppl 2):ii11-ii19. doi: 10.1093/bioinformatics/btae391.
4
NPSV-deep: a deep learning method for genotyping structural variants in short read genome sequencing data.NPSV-deep:一种用于在短读长基因组测序数据中进行基因分型结构变体的深度学习方法。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae129.
5
NPSV: A simulation-driven approach to genotyping structural variants in whole-genome sequencing data.NPSV:一种基于模拟的全基因组测序数据分析中结构变异基因分型方法。
Gigascience. 2021 Jul 1;10(7). doi: 10.1093/gigascience/giab046.
6
Comprehensive evaluation of structural variant genotyping methods based on long-read sequencing data.基于长读测序数据的结构变异基因分型方法的综合评估。
BMC Genomics. 2022 Apr 23;23(1):324. doi: 10.1186/s12864-022-08548-y.
7
HQAlign: aligning nanopore reads for SV detection using current-level modeling.HQAlign:使用电流水平建模对齐纳米孔读取以进行 SV 检测。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad580.
8
MAMnet: detecting and genotyping deletions and insertions based on long reads and a deep learning approach.MAMnet:基于长读长和深度学习方法检测和基因分型缺失和插入。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac195.
9
Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data.纳米孔测序数据的种系结构变异检测方法评估
Front Genet. 2021 Nov 18;12:761791. doi: 10.3389/fgene.2021.761791. eCollection 2021.
10
Comparison of structural variants detected by PacBio-CLR and ONT sequencing in pear.梨中 PacBio-CLR 和 ONT 测序检测到的结构变异比较。
BMC Genomics. 2022 Dec 14;23(1):830. doi: 10.1186/s12864-022-09074-7.

引用本文的文献

1
cuteFC: regenotyping structural variants through an accurate and efficient force-calling method.cuteFC:通过一种准确高效的强制调用方法对结构变异进行重新基因分型。
Genome Biol. 2025 Jun 13;26(1):166. doi: 10.1186/s13059-025-03642-2.
2
SVDF: enhancing structural variation detect from long-read sequencing via automatic filtering strategies.SVDF:通过自动过滤策略增强长读测序中的结构变异检测
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae336.
3
Detection of mosaic and population-level structural variants with Sniffles2.使用 Sniffles2 检测嵌合体和群体水平的结构变异。
Nat Biotechnol. 2024 Oct;42(10):1571-1580. doi: 10.1038/s41587-023-02024-y. Epub 2024 Jan 2.
4
Quartet DNA reference materials and datasets for comprehensively evaluating germline variant calling performance. Quartet DNA 参考资料和数据集,用于全面评估种系变异calling 性能。
Genome Biol. 2023 Nov 27;24(1):270. doi: 10.1186/s13059-023-03109-2.
5
SVJedi-graph: improving the genotyping of close and overlapping structural variants with long reads using a variation graph.SVJedi-graph:使用变异图提高长读长对紧密和重叠结构变异的基因分型。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i270-i278. doi: 10.1093/bioinformatics/btad237.
6
A survey of algorithms for the detection of genomic structural variants from long-read sequencing data.长读测序数据中基因组结构变异检测算法研究综述。
Nat Methods. 2023 Aug;20(8):1143-1158. doi: 10.1038/s41592-023-01932-w. Epub 2023 Jun 29.
7
Population-scale genotyping of structural variation in the era of long-read sequencing.长读长测序时代结构变异的群体规模基因分型
Comput Struct Biotechnol J. 2022 May 27;20:2639-2647. doi: 10.1016/j.csbj.2022.05.047. eCollection 2022.
8
Comprehensive evaluation of structural variant genotyping methods based on long-read sequencing data.基于长读测序数据的结构变异基因分型方法的综合评估。
BMC Genomics. 2022 Apr 23;23(1):324. doi: 10.1186/s12864-022-08548-y.
9
Towards population-scale long-read sequencing.迈向大规模长读长测序。
Nat Rev Genet. 2021 Sep;22(9):572-587. doi: 10.1038/s41576-021-00367-3. Epub 2021 May 28.
10
Identification and Genotyping of Transposable Element Insertions From Genome Sequencing Data.从基因组测序数据中鉴定和分型转座元件插入。
Curr Protoc Hum Genet. 2020 Sep;107(1):e102. doi: 10.1002/cphg.102.