• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 Hi-C 对长读从头基因组组装进行扩展单倍型相位分析。

Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C.

机构信息

Phase Genomics, Seattle, WA, USA.

Pacific Biosciences, Menlo Park, CA, USA.

出版信息

Nat Commun. 2021 Apr 28;12(1):1935. doi: 10.1038/s41467-020-20536-y.

DOI:10.1038/s41467-020-20536-y
PMID:33911078
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8081726/
Abstract

Haplotype-resolved genome assemblies are important for understanding how combinations of variants impact phenotypes. To date, these assemblies have been best created with complex protocols, such as cultured cells that contain a single-haplotype (haploid) genome, single cells where haplotypes are separated, or co-sequencing of parental genomes in a trio-based approach. These approaches are impractical in most situations. To address this issue, we present FALCON-Phase, a phasing tool that uses ultra-long-range Hi-C chromatin interaction data to extend phase blocks of partially-phased diploid assembles to chromosome or scaffold scale. FALCON-Phase uses the inherent phasing information in Hi-C reads, skipping variant calling, and reduces the computational complexity of phasing. Our method is validated on three benchmark datasets generated as part of the Vertebrate Genomes Project (VGP), including human, cow, and zebra finch, for which high-quality, fully haplotype-resolved assemblies are available using the trio-based approach. FALCON-Phase is accurate without having parental data and performance is better in samples with higher heterozygosity. For cow and zebra finch the accuracy is 97% compared to 80-91% for human. FALCON-Phase is applicable to any draft assembly that contains long primary contigs and phased associate contigs.

摘要

单体型解析基因组组装对于理解变异组合如何影响表型至关重要。迄今为止,这些组装最好通过复杂的方案来创建,例如含有单倍型(单倍体)基因组的培养细胞、分离单倍型的单细胞,或基于三亲的方法对亲本基因组进行共测序。在大多数情况下,这些方法都不切实际。为了解决这个问题,我们提出了 FALCON-Phase,这是一种相位工具,它使用超长距离 Hi-C 染色质相互作用数据将部分相位的二倍体组装的相位块扩展到染色体或支架规模。FALCON-Phase 利用 Hi-C 读取中的固有相位信息,跳过变异调用,并降低了相位的计算复杂度。我们的方法在三个基准数据集上进行了验证,这些数据集是作为脊椎动物基因组计划 (VGP) 的一部分生成的,包括人类、牛和斑胸草雀,对于这些物种,使用基于三亲的方法可以获得高质量、完全单体型解析的组装。FALCON-Phase 在没有亲本数据的情况下也很准确,并且在杂合度更高的样本中性能更好。对于牛和斑胸草雀,准确性为 97%,而人类为 80-91%。FALCON-Phase 适用于任何包含长原始 contigs 和相位关联 contigs 的草案组装。

相似文献

1
Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C.利用 Hi-C 对长读从头基因组组装进行扩展单倍型相位分析。
Nat Commun. 2021 Apr 28;12(1):1935. doi: 10.1038/s41467-020-20536-y.
2
Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies.清除单倍型:三代二倍体基因组组装的等位基因 contig 重新分配。
BMC Bioinformatics. 2018 Nov 29;19(1):460. doi: 10.1186/s12859-018-2485-7.
3
Physical separation of haplotypes in dikaryons allows benchmarking of phasing accuracy in Nanopore and HiFi assemblies with Hi-C data.双核体中单倍型的物理分离允许使用 Hi-C 数据对 Nanopore 和 HiFi 组装的相位准确性进行基准测试。
Genome Biol. 2022 Mar 25;23(1):84. doi: 10.1186/s13059-022-02658-2.
4
Fully phased human genome assembly without parental data using single-cell strand sequencing and long reads.利用单细胞测序和长读长技术进行全相基因组组装,无需父母数据。
Nat Biotechnol. 2021 Mar;39(3):302-308. doi: 10.1038/s41587-020-0719-5. Epub 2020 Dec 7.
5
Genome assembly and haplotyping with Hi-C.利用Hi-C技术进行基因组组装和单倍型分型。
Nat Biotechnol. 2013 Dec;31(12):1099-101. doi: 10.1038/nbt.2764.
6
HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies.HapCUT2:适用于多种测序技术的强大且准确的单倍型组装工具。
Genome Res. 2017 May;27(5):801-812. doi: 10.1101/gr.213462.116. Epub 2016 Dec 9.
7
Integrating read-based and population-based phasing for dense and accurate haplotyping of individual genomes.基于读取和基于群体的相位整合,实现个体基因组的密集和精确单倍型分型。
Bioinformatics. 2019 Jul 15;35(14):i242-i248. doi: 10.1093/bioinformatics/btz329.
8
Benchmarking multi-platform sequencing technologies for human genome assembly.多平台测序技术在人类基因组组装中的基准测试。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad300.
9
Graphasing: phasing diploid genome assembly graphs with single-cell strand sequencing.Graphasing:利用单细胞测序进行二倍体基因组组装图谱的相位分析。
Genome Biol. 2024 Oct 10;25(1):265. doi: 10.1186/s13059-024-03409-1.
10
De novo assembly and phasing of a Korean human genome.韩国人类基因组的从头组装和相位。
Nature. 2016 Oct 13;538(7624):243-247. doi: 10.1038/nature20098. Epub 2016 Oct 5.

引用本文的文献

1
High-quality, haplotype-resolved reference genomes of the Dutch warmblood horse and Friesian horse using trio binning.使用三联体分箱法构建的荷兰温血马和弗里斯兰马的高质量、单倍型解析参考基因组。
BMC Genomics. 2025 Sep 1;26(1):790. doi: 10.1186/s12864-025-11985-0.
2
Chromatin dynamics of a large-sized genome provides insights into polyphenism and X0 dosage compensation of locusts.大型基因组的染色质动力学为蝗虫的多型现象和X0剂量补偿提供了见解。
Nat Genet. 2025 Sep 1. doi: 10.1038/s41588-025-02330-y.
3
The Genomic Basis of the Tristylous Floral Polymorphism: Evidence for a Role of Gene Duplications in a Region of Restricted Recombination.

本文引用的文献

1
Towards complete and error-free genome assemblies of all vertebrate species.致力于完成所有脊椎动物物种的完整且无错误的基因组组装。
Nature. 2021 Apr;592(7856):737-746. doi: 10.1038/s41586-021-03451-0. Epub 2021 Apr 28.
2
Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm.使用带有 hifiasm 的相定装配图进行单体型解析从头组装。
Nat Methods. 2021 Feb;18(2):170-175. doi: 10.1038/s41592-020-01056-5. Epub 2021 Feb 1.
3
Chromosome-scale, haplotype-resolved assembly of human genomes.人类基因组的染色体规模、单倍型解析组装。
三型花柱花多态性的基因组基础:基因重复在有限重组区域中作用的证据
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf170.
4
The impact of telomere-to-telomere genome assembly in the plant pan-genomics era.端粒到端粒基因组组装在植物泛基因组时代的影响。
Breed Sci. 2025 Mar;75(1):3-12. doi: 10.1270/jsbbs.24065. Epub 2025 Feb 21.
5
HPTAS: An Alignment-Free Haplotype Phasing Algorithm Focused on Allele-Specific Studies Using Transcriptome Data.HPTAS:一种无比对的单倍型分型算法,专注于利用转录组数据进行等位基因特异性研究。
Int J Mol Sci. 2025 Jun 13;26(12):5700. doi: 10.3390/ijms26125700.
6
PISAD: reference-free intraspecies sample anomalies detection tool based on k-mer counting.PISAD:基于k-mer计数的无参考种内样本异常检测工具。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf061.
7
Verkko2 integrates proximity-ligation data with long-read De Bruijn graphs for efficient telomere-to-telomere genome assembly, phasing, and scaffolding.Verkko2将邻近连接数据与长读长德布鲁因图相结合,以实现高效的端粒到端粒基因组组装、定相和支架搭建。
Genome Res. 2025 Jun 12. doi: 10.1101/gr.280383.124.
8
Near-complete Middle Eastern genomes refine autozygosity and enhance disease-causing and population-specific variant discovery.近乎完整的中东基因组改善了纯合性,并增强了致病和群体特异性变异的发现。
Nat Genet. 2025 May;57(5):1119-1131. doi: 10.1038/s41588-025-02173-7. Epub 2025 May 5.
9
Establishing genome sequencing and assembly for non-model and emerging model organisms: a brief guide.为非模式生物和新兴模式生物建立基因组测序与组装:简要指南
Front Zool. 2025 Apr 17;22(1):7. doi: 10.1186/s12983-025-00561-7.
10
Genomics Research on the Road of Studying Biology and Virulence of Cereal Rust Fungi.谷物锈菌生物学与致病性研究道路上的基因组学研究
Mol Plant Pathol. 2025 Apr;26(4):e70082. doi: 10.1111/mpp.70082.
Nat Biotechnol. 2021 Mar;39(3):309-312. doi: 10.1038/s41587-020-0711-0. Epub 2020 Dec 7.
4
Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies.Merqury:基因组组装的无参考质量、完整性和相位评估。
Genome Biol. 2020 Sep 14;21(1):245. doi: 10.1186/s13059-020-02134-9.
5
Haplotype-resolved genomes provide insights into structural variation and gene content in Angus and Brahman cattle.单体型解析基因组为安格斯牛和婆罗门牛的结构变异和基因组成提供了新的见解。
Nat Commun. 2020 Apr 29;11(1):2071. doi: 10.1038/s41467-020-15848-y.
6
Identifying and removing haplotypic duplication in primary genome assemblies.鉴定和去除初级基因组组装中的单倍型重复。
Bioinformatics. 2020 May 1;36(9):2896-2898. doi: 10.1093/bioinformatics/btaa025.
7
breakpointR: an R/Bioconductor package to localize strand state changes in Strand-seq data.breakpointR:一个用于本地化 Strand-seq 数据中链状态变化的 R/Bioconductor 包。
Bioinformatics. 2020 Feb 15;36(4):1260-1261. doi: 10.1093/bioinformatics/btz681.
8
Multi-platform discovery of haplotype-resolved structural variation in human genomes.多平台发现人类基因组中单体型分辨率结构变异。
Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.
9
Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies.清除单倍型:三代二倍体基因组组装的等位基因 contig 重新分配。
BMC Bioinformatics. 2018 Nov 29;19(1):460. doi: 10.1186/s12859-018-2485-7.
10
De novo assembly of haplotype-resolved genomes with trio binning.利用三人分箱法对单倍型解析基因组进行从头组装。
Nat Biotechnol. 2018 Oct 22. doi: 10.1038/nbt.4277.