• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

韩牛全基因组定相分析中Hi-C与10X基因组连接读长测序的比较

A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle.

作者信息

Srikanth Krishnamoorthy, Park Jong-Eun, Lim Dajeong, Cha Jihye, Cho Sang-Rae, Cho In-Cheol, Park Woncheoul

机构信息

Animal Genomics and Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea.

Hanwoo Research Institute, National Institute of Animal Science, RDA, Pyeongchang 25340, Korea.

出版信息

Genes (Basel). 2020 Mar 20;11(3):332. doi: 10.3390/genes11030332.

DOI:10.3390/genes11030332
PMID:32245072
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7140831/
Abstract

Until recently, genome-scale phasing was limited due to the short read sizes of sequence data. Though the use of long-read sequencing can overcome this limitation, they require extensive error correction. The emergence of technologies such as 10X genomics linked read sequencing and Hi-C which uses short-read sequencers along with library preparation protocols that facilitates long-read assemblies have greatly reduced the complexities of genome scale phasing. Moreover, it is possible to accurately assemble phased genome of individual samples using these methods. Therefore, in this study, we compared three phasing strategies which included two sample preparation methods along with the Long Ranger pipeline of 10X genomics and HapCut2 software, namely 10X-LG, 10X-HapCut2, and HiC-HapCut2 and assessed their performance and accuracy. We found that the 10X-LG had the best phasing performance amongst the method analyzed. They had the highest phasing rate (89.6%), longest adjusted N50 (1.24 Mb), and lowest switch error rate (0.07%). Moreover, the phasing accuracy and yield of the 10X-LG stayed over 90% for distances up to 4 Mb and 550 Kb respectively, which were considerably higher than 10X-HapCut2 and Hi-C Hapcut2. The results of this study will serve as a good reference for future benchmarking studies and also for reference-based imputation in Hanwoo.

摘要

直到最近,由于序列数据的短读长,全基因组定相仍受到限制。虽然使用长读长测序可以克服这一限制,但它们需要大量的错误校正。诸如10X基因组学链接读长测序和Hi-C等技术的出现,这些技术使用短读长测序仪以及有助于长读长组装的文库制备方案,极大地降低了全基因组定相的复杂性。此外,使用这些方法可以准确地组装单个样本的定相基因组。因此,在本研究中,我们比较了三种定相策略,包括两种样本制备方法以及10X基因组学的Long Ranger流程和HapCut2软件,即10X-LG、10X-HapCut2和HiC-HapCut2,并评估了它们的性能和准确性。我们发现,在分析的方法中,10X-LG具有最佳的定相性能。它们具有最高的定相率(89.6%)、最长的调整后N50(1.24 Mb)和最低的切换错误率(0.07%)。此外,10X-LG的定相准确性和产量在距离分别达到4 Mb和550 Kb时分别保持在90%以上,这明显高于10X-HapCut2和Hi-C Hapcut2。本研究结果将为未来的基准研究以及韩牛基于参考的插补提供良好的参考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/336d/7140831/337863947dae/genes-11-00332-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/336d/7140831/bb767b4ecaa3/genes-11-00332-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/336d/7140831/337863947dae/genes-11-00332-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/336d/7140831/bb767b4ecaa3/genes-11-00332-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/336d/7140831/337863947dae/genes-11-00332-g002.jpg

相似文献

1
A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle.韩牛全基因组定相分析中Hi-C与10X基因组连接读长测序的比较
Genes (Basel). 2020 Mar 20;11(3):332. doi: 10.3390/genes11030332.
2
HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies.HapCUT2:适用于多种测序技术的强大且准确的单倍型组装工具。
Genome Res. 2017 May;27(5):801-812. doi: 10.1101/gr.213462.116. Epub 2016 Dec 9.
3
Benchmarking phasing software with a whole-genome sequenced cattle pedigree.利用全基因组测序的牛系谱对相位软件进行基准测试。
BMC Genomics. 2022 Feb 15;23(1):130. doi: 10.1186/s12864-022-08354-6.
4
Physical separation of haplotypes in dikaryons allows benchmarking of phasing accuracy in Nanopore and HiFi assemblies with Hi-C data.双核体中单倍型的物理分离允许使用 Hi-C 数据对 Nanopore 和 HiFi 组装的相位准确性进行基准测试。
Genome Biol. 2022 Mar 25;23(1):84. doi: 10.1186/s13059-022-02658-2.
5
Integrating read-based and population-based phasing for dense and accurate haplotyping of individual genomes.基于读取和基于群体的相位整合,实现个体基因组的密集和精确单倍型分型。
Bioinformatics. 2019 Jul 15;35(14):i242-i248. doi: 10.1093/bioinformatics/btz329.
6
BLR: a flexible pipeline for haplotype analysis of multiple linked-read technologies.BLR:一种用于多种链接读取技术的单倍型分析的灵活管道。
Nucleic Acids Res. 2023 Dec 11;51(22):e114. doi: 10.1093/nar/gkad1010.
7
HapCUT2: A Method for Phasing Genomes Using Experimental Sequence Data.HapCUT2:一种使用实验序列数据进行基因组相位分析的方法。
Methods Mol Biol. 2023;2590:139-147. doi: 10.1007/978-1-0716-2819-5_9.
8
PhaseME: Automatic rapid assessment of phasing quality and phasing improvement.PhaseME:自动快速评估相位质量和相位改进。
Gigascience. 2020 Jul 1;9(7). doi: 10.1093/gigascience/giaa078.
9
Linked-read sequencing enables haplotype-resolved resequencing at population scale.链接读取测序能够以群体规模实现单倍型分辨率重测序。
Mol Ecol Resour. 2020 Sep;20(5):1311-1322. doi: 10.1111/1755-0998.13192. Epub 2020 Jun 29.
10
Genome sequencing in cytogenetics: Comparison of short-read and linked-read approaches for germline structural variant detection and characterization.基因组测序在细胞遗传学中的应用:短读长和连接读长方法在种系结构变异检测和特征分析中的比较。
Mol Genet Genomic Med. 2020 Mar;8(3):e1114. doi: 10.1002/mgg3.1114. Epub 2020 Jan 27.

引用本文的文献

1
The Bioinformatic Applications of Hi-C and Linked Reads.Hi-C 和链接读取的生物信息学应用。
Genomics Proteomics Bioinformatics. 2024 Oct 15;22(4). doi: 10.1093/gpbjnl/qzae048.

本文引用的文献

1
Accurate, scalable and integrative haplotype estimation.精确、可扩展且综合的单倍型估计。
Nat Commun. 2019 Nov 28;10(1):5436. doi: 10.1038/s41467-019-13225-y.
2
Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data.基于 Hi-C 数据的等位基因感知的染色体规模自倍性基因组组装。
Nat Plants. 2019 Aug;5(8):833-845. doi: 10.1038/s41477-019-0487-8. Epub 2019 Aug 5.
3
Sequencing of human genomes with nanopore technology.纳米孔技术测序人类基因组。
Nat Commun. 2019 Apr 23;10(1):1869. doi: 10.1038/s41467-019-09637-5.
4
Resolving the full spectrum of human genome variation using Linked-Reads.利用连接读取技术解析人类基因组变异的全貌。
Genome Res. 2019 Apr;29(4):635-645. doi: 10.1101/gr.234443.118. Epub 2019 Mar 20.
5
Sliding window haplotype approaches overcome single SNP analysis limitations in identifying genes for meat tenderness in Nelore cattle.滑动窗口单体型方法克服了单 SNP 分析在鉴定内罗尔牛嫩度相关基因方面的局限性。
BMC Genet. 2019 Jan 14;20(1):8. doi: 10.1186/s12863-019-0713-4.
6
1000 Bull Genomes Project to Map Simple and Complex Genetic Traits in Cattle: Applications and Outcomes.“1000 头公牛基因组计划”旨在对牛的简单和复杂遗传特征进行定位:应用与成果。
Annu Rev Anim Biosci. 2019 Feb 15;7:89-102. doi: 10.1146/annurev-animal-020518-115024. Epub 2019 Dec 3.
7
Chromosomal-level assembly of yellow catfish genome using third-generation DNA sequencing and Hi-C analysis.利用第三代DNA测序和Hi-C分析对黄颡鱼基因组进行染色体水平组装
Gigascience. 2018 Nov 1;7(11):giy120. doi: 10.1093/gigascience/giy120.
8
Comparison of phasing strategies for whole human genomes.全人类基因组相位策略比较。
PLoS Genet. 2018 Apr 5;14(4):e1007308. doi: 10.1371/journal.pgen.1007308. eCollection 2018 Apr.
9
Assembly of 913 microbial genomes from metagenomic sequencing of the cow rumen.通过对奶牛瘤胃宏基因组测序组装出913个微生物基因组。
Nat Commun. 2018 Feb 28;9(1):870. doi: 10.1038/s41467-018-03317-6.
10
A genome-wide interactome of DNA-associated proteins in the human liver.人类肝脏中与 DNA 相关的蛋白质的全基因组互作组。
Genome Res. 2017 Nov;27(11):1950-1960. doi: 10.1101/gr.222083.117. Epub 2017 Oct 11.