• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于全基因组结构变异检测的二倍体基因组组装

diploid genome assembly for genome-wide structural variant detection.

作者信息

Zhang Lu, Zhou Xin, Weng Ziming, Sidow Arend

机构信息

Department of Computer Science, Hong Kong Baptist University, Kowloon Tong, Hong Kong.

Department of Pathology, 300 Pasteur Dr, Stanford University, Stanford, CA 94305, USA.

出版信息

NAR Genom Bioinform. 2019 Dec 6;2(1):lqz018. doi: 10.1093/nargab/lqz018. eCollection 2020 Mar.

DOI:10.1093/nargab/lqz018
PMID:33575568
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7671403/
Abstract

Detection of structural variants (SVs) on the basis of read alignment to a reference genome remains a difficult problem. assembly, traditionally used to generate reference genomes, offers an alternative for SV detection. However, it has not been applied broadly to human genomes because of fundamental limitations of short-fragment approaches and high cost of long-read technologies. We here show that 10× linked-read sequencing supports accurate SV detection. We examined variants in six 10× assemblies with diverse experimental parameters from two commonly used human cell lines: NA12878 and NA24385. The assemblies are effective for detecting mid-size SVs, which were discovered by simple pairwise alignment of the assemblies' contigs to the reference (hg38). Our study also shows that the base-pair level SV breakpoint accuracy is high, with a majority of SVs having precisely correct sizes and breakpoints. Setting the ancestral state of SV loci by comparing to ape orthologs allows inference of the actual molecular mechanism (insertion or deletion) causing the mutation. In about half of cases, the mechanism is the opposite of the reference-based call. We uncover 214 SVs that may have been maintained as polymorphisms in the human lineage since before our divergence from chimp. Overall, we show that assembly of 10× linked-read data can achieve cost-effective SV detection for personal genomes.

摘要

基于与参考基因组的读段比对来检测结构变异(SVs)仍然是一个难题。传统上用于生成参考基因组的组装方法为SV检测提供了一种替代方案。然而,由于短片段方法的根本局限性和长读长技术的高成本,它尚未广泛应用于人类基因组。我们在此表明,10× 连锁读段测序支持准确的SV检测。我们检查了来自两种常用人类细胞系(NA12878和NA24385)的六个具有不同实验参数的10× 组装中的变异。这些组装对于检测中等大小的SVs是有效的,这些SVs是通过将组装的重叠群与参考基因组(hg38)进行简单的成对比对发现的。我们的研究还表明,碱基对水平的SV断点准确性很高,大多数SVs具有精确正确的大小和断点。通过与猿类直系同源物比较来设定SV位点的祖先状态,可以推断导致突变的实际分子机制(插入或缺失)。在大约一半的情况下,该机制与基于参考的推断相反。我们发现了214个自人类与黑猩猩分化之前可能就作为多态性在人类谱系中保留下来的SVs。总体而言,我们表明10× 连锁读段数据的组装可以实现对个人基因组具有成本效益的SV检测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/a424feba9b6a/lqz018fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/e341ccb2c6ec/lqz018fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/c066161e615b/lqz018fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/222ee69051ca/lqz018fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/9ad96d8b78ef/lqz018fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/a424feba9b6a/lqz018fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/e341ccb2c6ec/lqz018fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/c066161e615b/lqz018fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/222ee69051ca/lqz018fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/9ad96d8b78ef/lqz018fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def3/7671403/a424feba9b6a/lqz018fig5.jpg

相似文献

1
diploid genome assembly for genome-wide structural variant detection.用于全基因组结构变异检测的二倍体基因组组装
NAR Genom Bioinform. 2019 Dec 6;2(1):lqz018. doi: 10.1093/nargab/lqz018. eCollection 2020 Mar.
2
Assessment of human diploid genome assembly with 10x Linked-Reads data.使用 10x 连锁读取数据评估人类二倍体基因组组装。
Gigascience. 2019 Nov 1;8(11). doi: 10.1093/gigascience/giz141.
3
VolcanoSV enables accurate and robust structural variant calling in diploid genomes from single-molecule long read sequencing.VolcanoSV 可实现基于单分子长读测序的二倍体基因组中准确稳健的结构变异 calling。
Nat Commun. 2024 Aug 13;15(1):6956. doi: 10.1038/s41467-024-51282-0.
4
Comparison and benchmark of structural variants detected from long read and long-read assembly.长读与长读组装检测到的结构变异的比较和基准测试。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad188.
5
Small allelic variants are a source of ancestral bias in structural variant breakpoint placement.小等位基因变异是结构变异断点定位中祖先偏差的一个来源。
bioRxiv. 2023 Jun 26:2023.06.25.546295. doi: 10.1101/2023.06.25.546295.
6
Genome sequencing in cytogenetics: Comparison of short-read and linked-read approaches for germline structural variant detection and characterization.基因组测序在细胞遗传学中的应用:短读长和连接读长方法在种系结构变异检测和特征分析中的比较。
Mol Genet Genomic Med. 2020 Mar;8(3):e1114. doi: 10.1002/mgg3.1114. Epub 2020 Jan 27.
7
NovoGraph: Human genome graph construction from multiple long-read assemblies.NovoGraph:基于多个长读长组装构建人类基因组图谱。
F1000Res. 2018 Sep 3;7:1391. doi: 10.12688/f1000research.15895.2. eCollection 2018.
8
Haplotyping-Assisted Diploid Assembly and Variant Detection with Linked Reads.基于连锁reads 的单体型辅助二倍体组装和变异检测。
Methods Mol Biol. 2023;2590:161-182. doi: 10.1007/978-1-0716-2819-5_11.
9
Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data.基于比对和组装的方法在长读测序数据结构变异检测中的权衡。
Nat Commun. 2024 Mar 19;15(1):2447. doi: 10.1038/s41467-024-46614-z.
10
Small polymorphisms are a source of ancestral bias in structural variant breakpoint placement.小的多态性是结构变异断点位置中祖先偏见的一个来源。
Genome Res. 2024 Feb 7;34(1):7-19. doi: 10.1101/gr.278203.123.

引用本文的文献

1
Blackbird: structural variant detection using synthetic and low-coverage long-reads.黑鹂:利用合成和低覆盖度长读段进行结构变异检测
Bioinform Adv. 2025 Jul 4;5(1):vbaf151. doi: 10.1093/bioadv/vbaf151. eCollection 2025.
2
Toward a Kinh Vietnamese Reference Genome: Constructing a De Novo Genome Assembly Using Long-Read Sequencing and Optical Mapping.迈向京族越南人参考基因组:利用长读长测序和光学图谱构建从头基因组组装
Genes (Basel). 2025 Apr 29;16(5):536. doi: 10.3390/genes16050536.
3
Large indel detection in region-based phased diploid assemblies from linked-reads.

本文引用的文献

1
Multi-platform discovery of haplotype-resolved structural variation in human genomes.多平台发现人类基因组中单体型分辨率结构变异。
Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.
2
An open resource for accurately benchmarking small variant and reference calls.用于准确基准测试小型变体和参考调用的开放资源。
Nat Biotechnol. 2019 May;37(5):561-566. doi: 10.1038/s41587-019-0074-6. Epub 2019 Apr 1.
3
Resolving the full spectrum of human genome variation using Linked-Reads.利用连接读取技术解析人类基因组变异的全貌。
基于连接 reads 的区域分阶段二倍体组装中的大片段插入缺失检测
BMC Genomics. 2025 Mar 18;26(Suppl 2):263. doi: 10.1186/s12864-025-11398-z.
4
Blackbird: structural variant detection using synthetic and low-coverage long-reads.黑鹂:利用合成和低覆盖度长读段进行结构变异检测
bioRxiv. 2024 Nov 18:2024.11.17.624011. doi: 10.1101/2024.11.17.624011.
5
LRTK: a platform agnostic toolkit for linked-read analysis of both human genome and metagenome.LRTK:一个与平台无关的工具包,用于人类基因组和宏基因组的连锁读长分析。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae028.
6
SVDSS: structural variation discovery in hard-to-call genomic regions using sample-specific strings from accurate long reads.SVDSS:使用准确长读段中样本特异性字符串在难以测序的基因组区域发现结构变异。
Nat Methods. 2023 Apr;20(4):550-558. doi: 10.1038/s41592-022-01674-1. Epub 2022 Dec 22.
7
Efficient detection and assembly of non-reference DNA sequences with synthetic long reads.使用合成长读长进行非参考 DNA 序列的高效检测和组装。
Nucleic Acids Res. 2022 Oct 14;50(18):e108. doi: 10.1093/nar/gkac653.
8
Aquila enables reference-assisted diploid personal genome assembly and comprehensive variant detection based on linked reads.Aquila能够基于连接片段实现参考辅助的二倍体个人基因组组装和全面的变异检测。
Nat Commun. 2021 Feb 17;12(1):1077. doi: 10.1038/s41467-021-21395-x.
Genome Res. 2019 Apr;29(4):635-645. doi: 10.1101/gr.234443.118. Epub 2019 Mar 20.
4
Coding mutations in contribute to Parkinson's disease. 编码突变导致帕金森病。
Proc Natl Acad Sci U S A. 2018 Nov 6;115(45):11567-11572. doi: 10.1073/pnas.1809969115. Epub 2018 Oct 22.
5
High-quality genome sequences of uncultured microbes by assembly of read clouds.通过读取云组装获得未培养微生物的高质量基因组序列。
Nat Biotechnol. 2018 Oct 15. doi: 10.1038/nbt.4266.
6
De novo human genome assemblies reveal spectrum of alternative haplotypes in diverse populations.从头人类基因组组装揭示了不同人群中多种替代单倍型的图谱。
Nat Commun. 2018 Aug 2;9(1):3040. doi: 10.1038/s41467-018-05513-w.
7
Minimap2: pairwise alignment for nucleotide sequences.Minimap2:核苷酸序列的两两比对。
Bioinformatics. 2018 Sep 15;34(18):3094-3100. doi: 10.1093/bioinformatics/bty191.
8
Accurate detection of complex structural variations using single-molecule sequencing.利用单分子测序技术准确检测复杂结构变异。
Nat Methods. 2018 Jun;15(6):461-468. doi: 10.1038/s41592-018-0001-7. Epub 2018 Apr 30.
9
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
10
Reference quality assembly of the 3.5-Gb genome of from a single linked-read library.来自单个连接读数文库的35亿碱基基因组的参考质量组装。
Hortic Res. 2018 Jan 12;5:4. doi: 10.1038/s41438-017-0011-0. eCollection 2018.