• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

stLFRsv:一种使用共条形码读取的种系结构变异分析流程

stLFRsv: A Germline Structural Variant Analysis Pipeline Using Co-barcoded Reads.

作者信息

Guo Junfu, Shi Chang, Chen Xi, Wang Ou, Liu Ping, Yang Huanming, Xu Xun, Zhang Wenwei, Zhu Hongmei

机构信息

BGI-Tianjin, BGI-Shenzhen, Tianjin, China.

BGI-Shenzhen, Shenzhen, China.

出版信息

Front Genet. 2021 Mar 18;12:636239. doi: 10.3389/fgene.2021.636239. eCollection 2021.

DOI:10.3389/fgene.2021.636239
PMID:33815469
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8012683/
Abstract

Co-barcoded reads originating from long DNA fragments (mean length >30 kbp) maintain both single base level accuracy and long-range genomic information. We propose a pipeline, stLFRsv, to detect structural variation using co-barcoded reads. stLFRsv identifies abnormal large gaps between co-barcoded reads to detect potential breakpoints and reconstruct complex structural variants (SVs). Haplotype phasing by co-barcoded reads increases the signal to noise ratio, and barcode sharing profiles are used to filter out false positives. We integrate the short read SV caller smoove for smaller variants with stLFRsv. The integrated pipeline was evaluated on the well-characterized genome HG002/NA24385, and 74.5% precision and a 22.4% recall rate were obtained for deletions. stLFRsv revealed some large variants not included in the benchmark set that were verified by long reads or assembly. For the HG001/NA12878 genome, stLFRsv also achieved the best performance for both resource usage and the detection of large variants. Our work indicates that co-barcoded read technology has the potential to improve genome completeness.

摘要

源自长DNA片段(平均长度>30 kbp)的共条形码读取既能保持单碱基水平的准确性,又能保留长程基因组信息。我们提出了一种流程stLFRsv,用于使用共条形码读取来检测结构变异。stLFRsv通过识别共条形码读取之间异常的大间隙来检测潜在的断点,并重建复杂的结构变异(SV)。共条形码读取的单倍型定相提高了信噪比,条形码共享图谱用于过滤假阳性。我们将短读长SV调用工具smoove与stLFRsv整合,用于检测较小的变异。在特征明确的基因组HG002/NA24385上对整合后的流程进行了评估,对于缺失变异,获得了74.5%的精确率和22.4%的召回率。stLFRsv揭示了一些未包含在基准集中的大变异,这些变异通过长读长或组装得到了验证。对于HG001/NA12878基因组,stLFRsv在资源使用和大变异检测方面也取得了最佳性能。我们的工作表明,共条形码读取技术有潜力提高基因组的完整性。

相似文献

1
stLFRsv: A Germline Structural Variant Analysis Pipeline Using Co-barcoded Reads.stLFRsv:一种使用共条形码读取的种系结构变异分析流程
Front Genet. 2021 Mar 18;12:636239. doi: 10.3389/fgene.2021.636239. eCollection 2021.
2
AsmMix: an efficient haplotype-resolved hybrid genome assembling pipeline.AsmMix:一种高效的单倍型解析混合基因组组装流程。
Front Genet. 2024 Jul 26;15:1421565. doi: 10.3389/fgene.2024.1421565. eCollection 2024.
3
Identifying structural variants using linked-read sequencing data.使用连接读长测序数据鉴定结构变异体。
Bioinformatics. 2018 Jan 15;34(2):353-360. doi: 10.1093/bioinformatics/btx712.
4
Automated filtering of genome-wide large deletions through an ensemble deep learning framework.通过集成深度学习框架自动筛选全基因组大片段缺失。
Methods. 2022 Oct;206:77-86. doi: 10.1016/j.ymeth.2022.08.001. Epub 2022 Aug 28.
5
Detection and visualization of complex structural variants from long reads.从长读中检测和可视化复杂结构变体。
BMC Bioinformatics. 2018 Dec 21;19(Suppl 20):508. doi: 10.1186/s12859-018-2539-x.
6
Enhancing the detection of barcoded reads in high throughput DNA sequencing data by controlling the false discovery rate.通过控制假发现率来提高高通量 DNA 测序数据中条码读取的检测能力。
BMC Bioinformatics. 2014 Aug 7;15(1):264. doi: 10.1186/1471-2105-15-264.
7
LinkedSV for detection of mosaic structural variants from linked-read exome and genome sequencing data.LinkedSV 用于检测来自连锁读取外显子组和基因组测序数据的嵌合结构变体。
Nat Commun. 2019 Dec 6;10(1):5585. doi: 10.1038/s41467-019-13397-7.
8
A Workflow to Improve Variant Calling Accuracy in Molecular Barcoded Sequencing Reads.一种提高分子条形码测序读数中变异位点检测准确性的工作流程。
J Comput Biol. 2019 Jan;26(1):96-103. doi: 10.1089/cmb.2018.0110. Epub 2018 Aug 17.
9
Genome sequencing in cytogenetics: Comparison of short-read and linked-read approaches for germline structural variant detection and characterization.基因组测序在细胞遗传学中的应用:短读长和连接读长方法在种系结构变异检测和特征分析中的比较。
Mol Genet Genomic Med. 2020 Mar;8(3):e1114. doi: 10.1002/mgg3.1114. Epub 2020 Jan 27.
10
svclassify: a method to establish benchmark structural variant calls.svclassify:一种建立基准结构变异调用的方法。
BMC Genomics. 2016 Jan 16;17:64. doi: 10.1186/s12864-016-2366-2.

引用本文的文献

1
AsmMix: an efficient haplotype-resolved hybrid genome assembling pipeline.AsmMix:一种高效的单倍型解析混合基因组组装流程。
Front Genet. 2024 Jul 26;15:1421565. doi: 10.3389/fgene.2024.1421565. eCollection 2024.
2
MetaTrass: A high-quality metagenome assembler of the human gut microbiome by cobarcoding sequencing reads.MetaTrass:一种通过共条形码测序读数对人类肠道微生物组进行高质量宏基因组组装的工具。
Imeta. 2022 Aug 15;1(4):e46. doi: 10.1002/imt2.46. eCollection 2022 Dec.
3
A Simple Cost-Effective Method for Whole-Genome Sequencing, Haplotyping, and Assembly.

本文引用的文献

1
Aquila enables reference-assisted diploid personal genome assembly and comprehensive variant detection based on linked reads.Aquila能够基于连接片段实现参考辅助的二倍体个人基因组组装和全面的变异检测。
Nat Commun. 2021 Feb 17;12(1):1077. doi: 10.1038/s41467-021-21395-x.
2
LinkedSV for detection of mosaic structural variants from linked-read exome and genome sequencing data.LinkedSV 用于检测来自连锁读取外显子组和基因组测序数据的嵌合结构变体。
Nat Commun. 2019 Dec 6;10(1):5585. doi: 10.1038/s41467-019-13397-7.
3
Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome.
一种简单且具有成本效益的全基因组测序、单倍型分析和组装方法。
Methods Mol Biol. 2023;2590:101-125. doi: 10.1007/978-1-0716-2819-5_7.
精确的圆形共识长读测序提高了人类基因组变异检测和组装的准确性。
Nat Biotechnol. 2019 Oct;37(10):1155-1162. doi: 10.1038/s41587-019-0217-9. Epub 2019 Aug 12.
4
Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly.高效且独特的第二代测序读长 DNA 分子 cobarcoding,实现经济高效、准确的测序、单倍型分析和从头组装。
Genome Res. 2019 May;29(5):798-808. doi: 10.1101/gr.245126.118. Epub 2019 Apr 2.
5
Resolving the full spectrum of human genome variation using Linked-Reads.利用连接读取技术解析人类基因组变异的全貌。
Genome Res. 2019 Apr;29(4):635-645. doi: 10.1101/gr.234443.118. Epub 2019 Mar 20.
6
Characterizing the Major Structural Variant Alleles of the Human Genome.人类基因组主要结构变异等位基因的特征。
Cell. 2019 Jan 24;176(3):663-675.e19. doi: 10.1016/j.cell.2018.12.019. Epub 2019 Jan 17.
7
Minimap2: pairwise alignment for nucleotide sequences.Minimap2:核苷酸序列的两两比对。
Bioinformatics. 2018 Sep 15;34(18):3094-3100. doi: 10.1093/bioinformatics/bty191.
8
Accurate detection of complex structural variations using single-molecule sequencing.利用单分子测序技术准确检测复杂结构变异。
Nat Methods. 2018 Jun;15(6):461-468. doi: 10.1038/s41592-018-0001-7. Epub 2018 Apr 30.
9
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
10
Identification of large rearrangements in cancer genomes with barcode linked reads.利用条码连接读取技术鉴定癌症基因组中的大片段重排。
Nucleic Acids Res. 2018 Feb 28;46(4):e19. doi: 10.1093/nar/gkx1193.