• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Aquila_stLFR:用于stLFR连接 reads 的基于二倍体基因组组装的结构变异检测软件包。

Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads.

作者信息

Liu Yichen Henry, Grubbs Griffin L, Zhang Lu, Fang Xiaodong, Dill David L, Sidow Arend, Zhou Xin

机构信息

Department of Computer Science, Vanderbilt University, Nashville, TN 37235, USA.

Department of Biomedical Engineering, Vanderbilt University, Nashville, TN 37235, USA.

出版信息

Bioinform Adv. 2021 Jun 16;1(1):vbab007. doi: 10.1093/bioadv/vbab007. eCollection 2021.

DOI:10.1093/bioadv/vbab007
PMID:36700103
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9710574/
Abstract

MOTIVATION

Identifying structural variants (SVs) is critical in health and disease, however, detecting them remains a challenge. Several linked-read sequencing technologies, including 10X Genomics, TELL-Seq and single tube long fragment read (stLFR), have been recently developed as cost-effective approaches to reconstruct multi-megabase haplotypes (phase blocks) from sequence data of a single sample. These technologies provide an optimal sequencing platform to characterize SVs, though few computational algorithms can utilize them. Thus, we developed Aquila_stLFR, an approach that resolves SVs through haplotype-based assembly of stLFR linked-reads.

RESULTS

Aquila_stLFR first partitions long fragment reads into two haplotype-specific blocks with the assistance of the high-quality reference genome, by taking advantage of the potential phasing ability of the linked-read itself. Each haplotype is then assembled independently, to achieve a complete diploid assembly to finally reconstruct the genome-wide SVs. We benchmarked Aquila_stLFR on a well-studied sample, NA24385, and showed Aquila_stLFR can detect medium to large size deletions (50 bp-10 kb) with high sensitivity and medium-size insertions (50 bp-1 kb) with high specificity.

AVAILABILITY AND IMPLEMENTATION

Source code and documentation are available on https://github.com/maiziex/Aquila_stLFR.

SUPPLEMENTARY INFORMATION

Supplementary data are available at online.

摘要

动机

识别结构变异(SVs)在健康与疾病研究中至关重要,然而,检测它们仍然是一项挑战。最近已开发出几种连接读长测序技术,包括10X基因组学、TELL-Seq和单管长片段读长(stLFR),作为从单个样本的序列数据中重建多兆碱基单倍型(相位块)的经济高效方法。这些技术为表征SVs提供了一个最佳测序平台,尽管很少有计算算法能够利用它们。因此,我们开发了Aquila_stLFR,一种通过基于单倍型的stLFR连接读长组装来解析SVs的方法。

结果

Aquila_stLFR首先借助高质量参考基因组,利用连接读长本身潜在的定相能力,将长片段读长划分为两个特定单倍型块。然后分别对每个单倍型进行组装,以实现完整的二倍体组装,最终重建全基因组的SVs。我们在经过充分研究的样本NA24385上对Aquila_stLFR进行了基准测试,结果表明Aquila_stLFR能够以高灵敏度检测中等至大尺寸的缺失(50 bp - 10 kb),并以高特异性检测中等尺寸的插入(50 bp - 1 kb)。

可用性与实现方式

源代码和文档可在https://github.com/maiziex/Aquila_stLFR获取。

补充信息

补充数据可在网上获取。

相似文献

1
Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads.Aquila_stLFR:用于stLFR连接 reads 的基于二倍体基因组组装的结构变异检测软件包。
Bioinform Adv. 2021 Jun 16;1(1):vbab007. doi: 10.1093/bioadv/vbab007. eCollection 2021.
2
Haplotyping-Assisted Diploid Assembly and Variant Detection with Linked Reads.基于连锁reads 的单体型辅助二倍体组装和变异检测。
Methods Mol Biol. 2023;2590:161-182. doi: 10.1007/978-1-0716-2819-5_11.
3
Automated filtering of genome-wide large deletions through an ensemble deep learning framework.通过集成深度学习框架自动筛选全基因组大片段缺失。
Methods. 2022 Oct;206:77-86. doi: 10.1016/j.ymeth.2022.08.001. Epub 2022 Aug 28.
4
BLR: a flexible pipeline for haplotype analysis of multiple linked-read technologies.BLR:一种用于多种链接读取技术的单倍型分析的灵活管道。
Nucleic Acids Res. 2023 Dec 11;51(22):e114. doi: 10.1093/nar/gkad1010.
5
Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly.高效且独特的第二代测序读长 DNA 分子 cobarcoding,实现经济高效、准确的测序、单倍型分析和从头组装。
Genome Res. 2019 May;29(5):798-808. doi: 10.1101/gr.245126.118. Epub 2019 Apr 2.
6
Aquila enables reference-assisted diploid personal genome assembly and comprehensive variant detection based on linked reads.Aquila能够基于连接片段实现参考辅助的二倍体个人基因组组装和全面的变异检测。
Nat Commun. 2021 Feb 17;12(1):1077. doi: 10.1038/s41467-021-21395-x.
7
IterCluster: a barcode clustering algorithm for long fragment read analysis.IterCluster:一种用于长片段读取分析的条形码聚类算法。
PeerJ. 2020 Mar 24;8:e8431. doi: 10.7717/peerj.8431. eCollection 2020.
8
LongPhase: an ultra-fast chromosome-scale phasing algorithm for small and large variants.LongPhase:一种用于小变异和大变异的超快速染色体规模定相算法。
Bioinformatics. 2022 Mar 28;38(7):1816-1822. doi: 10.1093/bioinformatics/btac058.
9
JTK: targeted diploid genome assembler.JTK:靶向二倍体基因组组装器。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad398.
10
Structural variant analysis for linked-read sequencing data with gemtools.使用 gemtools 进行链接读取测序数据的结构变异分析。
Bioinformatics. 2019 Nov 1;35(21):4397-4399. doi: 10.1093/bioinformatics/btz239.

引用本文的文献

1
Large indel detection in region-based phased diploid assemblies from linked-reads.基于连接 reads 的区域分阶段二倍体组装中的大片段插入缺失检测
BMC Genomics. 2025 Mar 18;26(Suppl 2):263. doi: 10.1186/s12864-025-11398-z.
2
AsmMix: an efficient haplotype-resolved hybrid genome assembling pipeline.AsmMix:一种高效的单倍型解析混合基因组组装流程。
Front Genet. 2024 Jul 26;15:1421565. doi: 10.3389/fgene.2024.1421565. eCollection 2024.
3
The Bioinformatic Applications of Hi-C and Linked Reads.Hi-C 和链接读取的生物信息学应用。
Genomics Proteomics Bioinformatics. 2024 Oct 15;22(4). doi: 10.1093/gpbjnl/qzae048.
4
BLR: a flexible pipeline for haplotype analysis of multiple linked-read technologies.BLR:一种用于多种链接读取技术的单倍型分析的灵活管道。
Nucleic Acids Res. 2023 Dec 11;51(22):e114. doi: 10.1093/nar/gkad1010.
5
MTG-Link: leveraging barcode information from linked-reads to assemble specific loci.MTG-Link:利用来自链接读取的条形码信息来组装特定的基因座。
BMC Bioinformatics. 2023 Jul 14;24(1):284. doi: 10.1186/s12859-023-05395-w.

本文引用的文献

1
Aquila enables reference-assisted diploid personal genome assembly and comprehensive variant detection based on linked reads.Aquila能够基于连接片段实现参考辅助的二倍体个人基因组组装和全面的变异检测。
Nat Commun. 2021 Feb 17;12(1):1077. doi: 10.1038/s41467-021-21395-x.
2
Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly.高效且独特的第二代测序读长 DNA 分子 cobarcoding,实现经济高效、准确的测序、单倍型分析和从头组装。
Genome Res. 2019 May;29(5):798-808. doi: 10.1101/gr.245126.118. Epub 2019 Apr 2.
3
HAPDeNovo: a haplotype-based approach for filtering and phasing de novo mutations in linked read sequencing data.HAPDeNovo:一种基于单倍型的方法,用于过滤和相位-linked read 测序数据中的新突变。
BMC Genomics. 2018 Jun 18;19(1):467. doi: 10.1186/s12864-018-4867-7.
4
Minimap2: pairwise alignment for nucleotide sequences.Minimap2:核苷酸序列的两两比对。
Bioinformatics. 2018 Sep 15;34(18):3094-3100. doi: 10.1093/bioinformatics/bty191.
5
Identifying structural variants using linked-read sequencing data.使用连接读长测序数据鉴定结构变异体。
Bioinformatics. 2018 Jan 15;34(2):353-360. doi: 10.1093/bioinformatics/btx712.
6
Genome-wide reconstruction of complex structural variants using read clouds.利用读取云进行复杂结构变异的全基因组重建。
Nat Methods. 2017 Sep;14(9):915-920. doi: 10.1038/nmeth.4366. Epub 2017 Jul 17.
7
Long Fragment Read (LFR) Technology: Cost-Effective, High-Quality Genome-Wide Molecular Haplotyping.长片段读取(LFR)技术:具有成本效益的高质量全基因组分子单倍型分型
Methods Mol Biol. 2017;1551:191-205. doi: 10.1007/978-1-4939-6750-6_11.
8
Haplotyping germline and cancer genomes with high-throughput linked-read sequencing.利用高通量连锁读长测序对种系和癌症基因组进行单倍型分型
Nat Biotechnol. 2016 Mar;34(3):303-11. doi: 10.1038/nbt.3432. Epub 2016 Feb 1.
9
SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.SPAdes:一种新的基因组组装算法及其在单细胞测序中的应用
J Comput Biol. 2012 May;19(5):455-77. doi: 10.1089/cmb.2012.0021. Epub 2012 Apr 16.