• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

精确的圆形共识长读测序提高了人类基因组变异检测和组装的准确性。

Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome.

机构信息

Pacific Biosciences, Menlo Park, CA, USA.

Google Inc., Mountain View, CA, USA.

出版信息

Nat Biotechnol. 2019 Oct;37(10):1155-1162. doi: 10.1038/s41587-019-0217-9. Epub 2019 Aug 12.

DOI:10.1038/s41587-019-0217-9
PMID:31406327
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6776680/
Abstract

The DNA sequencing technologies in use today produce either highly accurate short reads or less-accurate long reads. We report the optimization of circular consensus sequencing (CCS) to improve the accuracy of single-molecule real-time (SMRT) sequencing (PacBio) and generate highly accurate (99.8%) long high-fidelity (HiFi) reads with an average length of 13.5 kilobases (kb). We applied our approach to sequence the well-characterized human HG002/NA24385 genome and obtained precision and recall rates of at least 99.91% for single-nucleotide variants (SNVs), 95.98% for insertions and deletions <50 bp (indels) and 95.99% for structural variants. Our CCS method matches or exceeds the ability of short-read sequencing to detect small variants and structural variants. We estimate that 2,434 discordances are correctable mistakes in the 'genome in a bottle' (GIAB) benchmark set. Nearly all (99.64%) variants can be phased into haplotypes, further improving variant detection. De novo genome assembly using CCS reads alone produced a contiguous and accurate genome with a contig N50 of >15 megabases (Mb) and concordance of 99.997%, substantially outperforming assembly with less-accurate long reads.

摘要

目前使用的 DNA 测序技术要么产生高度准确的短读长,要么产生不太准确的长读长。我们报告了对环形一致性测序(CCS)的优化,以提高单分子实时(SMRT)测序(PacBio)的准确性,并生成高度准确(99.8%)的长高保真度(HiFi)读长,平均长度为 13.5 千碱基(kb)。我们应用我们的方法对特征明确的人类 HG002/NA24385 基因组进行测序,并获得至少 99.91%的单核苷酸变异(SNV)、95.98%的<50 bp 插入和缺失(indels)和 95.99%的结构变异的精度和召回率。我们的 CCS 方法与短读长测序检测小变异和结构变异的能力相匹配或超过。我们估计在“瓶中基因组”(GIAB)基准集中,2434 个不一致是可纠正的错误。几乎所有(99.64%)的变体都可以被相位成单倍型,进一步提高了变体检测的准确性。单独使用 CCS 读取进行从头基因组组装生成了一个连续且准确的基因组,其 contig N50 大于 15 兆碱基(Mb),一致性为 99.997%,明显优于使用不太准确的长读长进行的组装。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d81/6776680/1a487e011b4a/nihms-1533949-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d81/6776680/1cdc01f785b4/nihms-1533949-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d81/6776680/a859e89ce978/nihms-1533949-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d81/6776680/156642d0f986/nihms-1533949-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d81/6776680/1a487e011b4a/nihms-1533949-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d81/6776680/1cdc01f785b4/nihms-1533949-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d81/6776680/a859e89ce978/nihms-1533949-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d81/6776680/156642d0f986/nihms-1533949-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d81/6776680/1a487e011b4a/nihms-1533949-f0004.jpg

相似文献

1
Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome.精确的圆形共识长读测序提高了人类基因组变异检测和组装的准确性。
Nat Biotechnol. 2019 Oct;37(10):1155-1162. doi: 10.1038/s41587-019-0217-9. Epub 2019 Aug 12.
2
Highly accurate long reads are crucial for realizing the potential of biodiversity genomics.高质量的长读长序列对于实现生物多样性基因组学的潜力至关重要。
BMC Genomics. 2023 Mar 16;24(1):117. doi: 10.1186/s12864-023-09193-9.
3
Fully phased human genome assembly without parental data using single-cell strand sequencing and long reads.利用单细胞测序和长读长技术进行全相基因组组装,无需父母数据。
Nat Biotechnol. 2021 Mar;39(3):302-308. doi: 10.1038/s41587-020-0719-5. Epub 2020 Dec 7.
4
HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads.HiCanu:从高保真长读段中精确组装片段重复、卫星和等位基因变体。
Genome Res. 2020 Sep;30(9):1291-1305. doi: 10.1101/gr.263566.120. Epub 2020 Aug 14.
5
Benchmarking multi-platform sequencing technologies for human genome assembly.多平台测序技术在人类基因组组装中的基准测试。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad300.
6
BLR: a flexible pipeline for haplotype analysis of multiple linked-read technologies.BLR:一种用于多种链接读取技术的单倍型分析的灵活管道。
Nucleic Acids Res. 2023 Dec 11;51(22):e114. doi: 10.1093/nar/gkad1010.
7
A comprehensive investigation of metagenome assembly by linked-read sequencing.基于链接读取测序的宏基因组组装综合研究。
Microbiome. 2020 Nov 11;8(1):156. doi: 10.1186/s40168-020-00929-3.
8
De novo assembly and phasing of a Korean human genome.韩国人类基因组的从头组装和相位。
Nature. 2016 Oct 13;538(7624):243-247. doi: 10.1038/nature20098. Epub 2016 Oct 5.
9
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
10
Automated filtering of genome-wide large deletions through an ensemble deep learning framework.通过集成深度学习框架自动筛选全基因组大片段缺失。
Methods. 2022 Oct;206:77-86. doi: 10.1016/j.ymeth.2022.08.001. Epub 2022 Aug 28.

引用本文的文献

1
Intron retention regulates STAT2 function and predicts immunotherapy response in lung cancer.内含子保留调控STAT2功能并预测肺癌的免疫治疗反应。
bioRxiv. 2025 Aug 19:2025.08.19.671121. doi: 10.1101/2025.08.19.671121.
2
A practical guide to identifying associations between tandem repeats and complex human traits using consensus genotypes from multiple tools.利用多种工具的一致性基因型识别串联重复序列与复杂人类性状之间关联的实用指南。
Nat Protoc. 2025 Sep 1. doi: 10.1038/s41596-025-01231-y.
3
Why Are Long-Read Sequencing Methods Revolutionizing Microbiome Analysis?

本文引用的文献

1
Fast and accurate long-read assembly with wtdbg2.使用 wtdbg2 实现快速准确的长读长序列组装。
Nat Methods. 2020 Feb;17(2):155-158. doi: 10.1038/s41592-019-0669-3. Epub 2019 Dec 9.
2
Multi-platform discovery of haplotype-resolved structural variation in human genomes.多平台发现人类基因组中单体型分辨率结构变异。
Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.
3
An open resource for accurately benchmarking small variant and reference calls.用于准确基准测试小型变体和参考调用的开放资源。
为什么长读长测序方法正在彻底改变微生物组分析?
Microorganisms. 2025 Aug 9;13(8):1861. doi: 10.3390/microorganisms13081861.
4
Genomic Evaluation of AML-Main Techniques and Novel Approaches.急性髓系白血病的基因组评估——主要技术与新方法
J Clin Med. 2025 Aug 11;14(16):5685. doi: 10.3390/jcm14165685.
5
Unveiling the Future of Infective Endocarditis Diagnosis: The Transformative Role of Metagenomic Next-Generation Sequencing in Culture-Negative Cases.揭示感染性心内膜炎诊断的未来:宏基因组下一代测序在血培养阴性病例中的变革性作用
J Epidemiol Glob Health. 2025 Aug 22;15(1):108. doi: 10.1007/s44197-025-00455-1.
6
De novo Genome Assembly Using Long Reads and Chromosome Conformation Capture.使用长读长和染色体构象捕获进行从头基因组组装
Methods Mol Biol. 2025;2935:1-27. doi: 10.1007/978-1-0716-4583-3_1.
7
Topsicle: a method for estimating telomere length from whole genome long-read sequencing data.Topsicle:一种从全基因组长读测序数据估计端粒长度的方法。
bioRxiv. 2025 Jul 15:2025.07.10.664126. doi: 10.1101/2025.07.10.664126.
8
An annotated chromosome-level genome of a troglophilic spider (Pimoa clavata).一种喜穴居蜘蛛(棒络新妇蛛)的带注释的染色体水平基因组。
Sci Data. 2025 Aug 11;12(1):1398. doi: 10.1038/s41597-025-05703-4.
9
Whole-genome variant detection in long-read sequencing data from ultra-low input patient samples.超低输入量患者样本长读长测序数据中的全基因组变异检测
medRxiv. 2025 Jul 27:2025.07.25.25332067. doi: 10.1101/2025.07.25.25332067.
10
Oatk: a de novo assembly tool for complex plant organelle genomes.Oatk:一种用于复杂植物细胞器基因组的从头组装工具。
Genome Biol. 2025 Aug 7;26(1):235. doi: 10.1186/s13059-025-03676-6.
Nat Biotechnol. 2019 May;37(5):561-566. doi: 10.1038/s41587-019-0074-6. Epub 2019 Apr 1.
4
Best practices for benchmarking germline small-variant calls in human genomes.人类基因组中小变异calls 的基准测试最佳实践。
Nat Biotechnol. 2019 May;37(5):555-560. doi: 10.1038/s41587-019-0054-x. Epub 2019 Mar 11.
5
A multi-task convolutional deep neural network for variant calling in single molecule sequencing.一种用于单分子测序中变异调用的多任务卷积深度神经网络。
Nat Commun. 2019 Mar 1;10(1):998. doi: 10.1038/s41467-019-09025-z.
6
Long-read sequence and assembly of segmental duplications.长读序列和串联重复序列的组装。
Nat Methods. 2019 Jan;16(1):88-94. doi: 10.1038/s41592-018-0236-3. Epub 2018 Dec 17.
7
A fast adaptive algorithm for computing whole-genome homology maps.一种用于计算全基因组同源图谱的快速自适应算法。
Bioinformatics. 2018 Sep 1;34(17):i748-i756. doi: 10.1093/bioinformatics/bty597.
8
De novo assembly of haplotype-resolved genomes with trio binning.利用三人分箱法对单倍型解析基因组进行从头组装。
Nat Biotechnol. 2018 Oct 22. doi: 10.1038/nbt.4277.
9
A universal SNP and small-indel variant caller using deep neural networks.使用深度神经网络的通用 SNP 和小插入缺失变体调用器。
Nat Biotechnol. 2018 Nov;36(10):983-987. doi: 10.1038/nbt.4235. Epub 2018 Sep 24.
10
A synthetic-diploid benchmark for accurate variant-calling evaluation.用于准确变异呼叫评估的合成二倍体基准。
Nat Methods. 2018 Aug;15(8):595-597. doi: 10.1038/s41592-018-0054-7. Epub 2018 Jul 16.