• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过高度准确和完整的纠错实现纳米孔读取的高效组装。

Efficient assembly of nanopore reads via highly accurate and intact error correction.

机构信息

State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, #7 Jinsui Road, Tianhe District, Guangzhou, People's Republic of China.

School of Information Science and Engineering, Central South University, Changsha, 410083, People's Republic of China.

出版信息

Nat Commun. 2021 Jan 4;12(1):60. doi: 10.1038/s41467-020-20236-7.

DOI:10.1038/s41467-020-20236-7
PMID:33397900
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7782737/
Abstract

Long nanopore reads are advantageous in de novo genome assembly. However, nanopore reads usually have broad error distribution and high-error-rate subsequences. Existing error correction tools cannot correct nanopore reads efficiently and effectively. Most methods trim high-error-rate subsequences during error correction, which reduces both the length of the reads and contiguity of the final assembly. Here, we develop an error correction, and de novo assembly tool designed to overcome complex errors in nanopore reads. We propose an adaptive read selection and two-step progressive method to quickly correct nanopore reads to high accuracy. We introduce a two-stage assembler to utilize the full length of nanopore reads. Our tool achieves superior performance in both error correction and de novo assembling nanopore reads. It requires only 8122 hours to assemble a 35X coverage human genome and achieves a 2.47-fold improvement in NG50. Furthermore, our assembly of the human WERI cell line shows an NG50 of 22 Mbp. The high-quality assembly of nanopore reads can significantly reduce false positives in structure variation detection.

摘要

长纳米孔读长在从头基因组组装中具有优势。然而,纳米孔读通常具有广泛的错误分布和高错误率的序列。现有的错误校正工具不能有效地校正纳米孔读。大多数方法在错误校正过程中修剪高错误率的序列,这减少了读的长度和最终组装的连续性。在这里,我们开发了一种错误校正和从头组装工具,旨在克服纳米孔读中的复杂错误。我们提出了一种自适应读选择和两步渐进方法,以快速将纳米孔读校正到高精度。我们引入了一个两阶段的组装器来利用纳米孔读的全长。我们的工具在纳米孔读的错误校正和从头组装方面都具有优异的性能。它只需要 8122 小时就能组装一个 35X 覆盖的人类基因组,并且在 NG50 上提高了 2.47 倍。此外,我们对人类 WERI 细胞系的组装结果显示 NG50 为 22 Mbp。纳米孔读的高质量组装可以显著减少结构变异检测中的假阳性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1e0/7782737/78609b90ad2e/41467_2020_20236_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1e0/7782737/70b7db9fe10f/41467_2020_20236_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1e0/7782737/5feb463108d9/41467_2020_20236_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1e0/7782737/78609b90ad2e/41467_2020_20236_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1e0/7782737/70b7db9fe10f/41467_2020_20236_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1e0/7782737/5feb463108d9/41467_2020_20236_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1e0/7782737/78609b90ad2e/41467_2020_20236_Fig3_HTML.jpg

相似文献

1
Efficient assembly of nanopore reads via highly accurate and intact error correction.通过高度准确和完整的纠错实现纳米孔读取的高效组装。
Nat Commun. 2021 Jan 4;12(1):60. doi: 10.1038/s41467-020-20236-7.
2
De novo diploid genome assembly using long noisy reads.从头组装具有长噪声读长的二倍体基因组。
Nat Commun. 2024 Apr 5;15(1):2964. doi: 10.1038/s41467-024-47349-7.
3
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.用于纳米孔数据的从头组装算法基准测试揭示了重叠布局一致(OLC)方法的最佳性能。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8.
4
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
5
Oxford Nanopore sequencing, hybrid error correction, and de novo assembly of a eukaryotic genome.牛津纳米孔测序、混合纠错及真核生物基因组的从头组装
Genome Res. 2015 Nov;25(11):1750-6. doi: 10.1101/gr.191395.115. Epub 2015 Oct 7.
6
NPGREAT: assembly of human subtelomere regions with the use of ultralong nanopore reads and linked-reads.NPGREAT:利用超长纳米孔读取和链接读取组装人类亚端粒区域。
BMC Bioinformatics. 2022 Dec 16;23(1):545. doi: 10.1186/s12859-022-05081-3.
7
NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads.NextDenovo:一种用于处理有噪声长读段的高效纠错和精确组装工具。
Genome Biol. 2024 Apr 26;25(1):107. doi: 10.1186/s13059-024-03252-4.
8
De novo Nanopore read quality improvement using deep learning.基于深度学习的从头纳米孔读质量改进。
BMC Bioinformatics. 2019 Nov 6;20(1):552. doi: 10.1186/s12859-019-3103-z.
9
Can we use it? On the utility of de novo and reference-based assembly of Nanopore data for plant plastome sequencing.我们可以使用它吗?从头组装和基于参考的纳米孔数据在植物质体测序中的应用。
PLoS One. 2020 Mar 24;15(3):e0226234. doi: 10.1371/journal.pone.0226234. eCollection 2020.
10
Are we there yet? Benchmarking low-coverage nanopore long-read sequencing for the assembling of mitochondrial genomes using the vulnerable silky shark Carcharhinus falciformis.我们到了吗?使用脆弱的灰鲭鲨(Carcharhinus falciformis)对低覆盖度纳米孔长读测序进行线粒体基因组组装的基准测试。
BMC Genomics. 2022 Apr 22;23(1):320. doi: 10.1186/s12864-022-08482-z.

引用本文的文献

1
Chromosome-level assembly of cv. 'Tokiwa' as a reference genome of Japanese cucumber.栽培品种‘常盘’的染色体水平组装,作为日本黄瓜的参考基因组。
Breed Sci. 2025 Apr;75(2):85-92. doi: 10.1270/jsbbs.24066. Epub 2025 Mar 27.
2
Highly contiguous genome of the medicinal plant Sarcandra glabra (Thunb.) Nakai.药用植物九节龙(学名:Sarcandra glabra (Thunb.) Nakai)的高度连续基因组。
Sci Data. 2025 Aug 28;12(1):1508. doi: 10.1038/s41597-025-05796-x.
3
MicroRNAs in long COVID: roles, diagnostic biomarker potential and detection.长新冠中的微小RNA:作用、诊断生物标志物潜力及检测

本文引用的文献

1
SMARTdenovo: a assembler using long noisy reads.SMARTdenovo:一种使用长的有噪声读段的序列拼接软件。
GigaByte. 2021 Mar 8;2021:gigabyte15. doi: 10.46471/gigabyte.15. eCollection 2021.
2
Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes.纳米孔测序和 Shasta 工具包可实现 11 个人类基因组的高效从头组装。
Nat Biotechnol. 2020 Sep;38(9):1044-1053. doi: 10.1038/s41587-020-0503-6. Epub 2020 May 4.
3
Fast and accurate long-read assembly with wtdbg2.使用 wtdbg2 实现快速准确的长读长序列组装。
Hum Genomics. 2025 Aug 13;19(1):90. doi: 10.1186/s40246-025-00810-0.
4
Hydrogen Oxidation Benefits Alphaproteobacterial Methanotrophs Under Severe Methane Limitation.在严重甲烷限制条件下,氢氧化作用对α-变形菌纲甲烷营养菌有益。
Environ Microbiol. 2025 Aug;27(8):e70163. doi: 10.1111/1462-2920.70163.
5
Identification and Pathogenicity Analysis of Qf-1 in Mink ().水貂中Qf-1的鉴定及致病性分析()。 (括号部分原文缺失具体内容)
Microorganisms. 2025 Jul 8;13(7):1604. doi: 10.3390/microorganisms13071604.
6
Benchmarking of bioinformatics tools for the hybrid assembly of human and non-human whole-genome sequencing data.用于人类和非人类全基因组测序数据混合组装的生物信息学工具的基准测试。
Comput Struct Biotechnol J. 2025 Jul 13;27:3099-3109. doi: 10.1016/j.csbj.2025.07.020. eCollection 2025.
7
Inference of antimicrobial resistance (AMR) from a whole genome database outperforming AMR gene detection.从全基因组数据库推断抗菌药物耐药性(AMR),其性能优于AMR基因检测。
iScience. 2025 Jun 20;28(8):112962. doi: 10.1016/j.isci.2025.112962. eCollection 2025 Aug 15.
8
Telomere-to-telomere genome and resequencing of 231 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis.端粒到端粒基因组及231个个体的重测序揭示了亚洲冰鱼(中华小公鱼)的进化和基因组印记。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf067.
9
The telomere-to-telomere gap-free reference genome and taxonomic reassessment of Siniperca roulei.鳜鱼的端粒到端粒无间隙参考基因组及分类学重新评估
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf068.
10
Genomic evolution and ecotype divergence in thraustochytrids: insights from comparative genomics and phylogenomics.破囊壶菌的基因组进化与生态型分化:来自比较基因组学和系统发育基因组学的见解
Front Microbiol. 2025 Jun 30;16:1608951. doi: 10.3389/fmicb.2025.1608951. eCollection 2025.
Nat Methods. 2020 Feb;17(2):155-158. doi: 10.1038/s41592-019-0669-3. Epub 2019 Dec 9.
4
Assembly of long, error-prone reads using repeat graphs.使用重复图组装长的、易错的读取。
Nat Biotechnol. 2019 May;37(5):540-546. doi: 10.1038/s41587-019-0072-8. Epub 2019 Apr 1.
5
Selective single molecule sequencing and assembly of a human Y chromosome of African origin.选择性单分子测序和组装非洲起源的人类 Y 染色体。
Nat Commun. 2019 Jan 2;10(1):4. doi: 10.1038/s41467-018-07885-5.
6
From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy.从扭曲到碱基对:提高纳米孔测序读取准确性的计算方法。
Genome Biol. 2018 Jul 13;19(1):90. doi: 10.1186/s13059-018-1462-9.
7
Minimap2: pairwise alignment for nucleotide sequences.Minimap2:核苷酸序列的两两比对。
Bioinformatics. 2018 Sep 15;34(18):3094-3100. doi: 10.1093/bioinformatics/bty191.
8
Accurate detection of complex structural variations using single-molecule sequencing.利用单分子测序技术准确检测复杂结构变异。
Nat Methods. 2018 Jun;15(6):461-468. doi: 10.1038/s41592-018-0001-7. Epub 2018 Apr 30.
9
Linear assembly of a human centromere on the Y chromosome.线性组装人类着丝粒于 Y 染色体上。
Nat Biotechnol. 2018 Apr;36(4):321-323. doi: 10.1038/nbt.4109. Epub 2018 Mar 19.
10
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.