• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鉴定和纠正端粒纳米孔测序中的重复调用错误。

Identifying and correcting repeat-calling errors in nanopore sequencing of telomeres.

机构信息

Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA.

Cancer Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

出版信息

Genome Biol. 2022 Aug 26;23(1):180. doi: 10.1186/s13059-022-02751-6.

DOI:10.1186/s13059-022-02751-6
PMID:36028900
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9414165/
Abstract

Nanopore long-read sequencing is an emerging approach for studying genomes, including long repetitive elements like telomeres. Here, we report extensive basecalling induced errors at telomere repeats across nanopore datasets, sequencing platforms, basecallers, and basecalling models. We find that telomeres in many organisms are frequently miscalled. We demonstrate that tuning of nanopore basecalling models leads to improved recovery and analysis of telomeric regions, with minimal negative impact on other genomic regions. We highlight the importance of verifying nanopore basecalls in long, repetitive, and poorly defined regions, and showcase how artefacts can be resolved by improvements in nanopore basecalling models.

摘要

纳米孔长读测序是一种新兴的研究基因组的方法,包括端粒等长重复元件。在这里,我们报告了在纳米孔数据集、测序平台、碱基调用器和碱基调用模型中,端粒重复序列的广泛碱基调用诱导错误。我们发现许多生物的端粒经常被误报。我们证明,纳米孔碱基调用模型的调整可以改善端粒区域的恢复和分析,而对其他基因组区域的负面影响最小。我们强调了在长、重复和定义不明确的区域中验证纳米孔碱基调用的重要性,并展示了如何通过改进纳米孔碱基调用模型来解决伪影问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c0d/9414165/cffc705a32e8/13059_2022_2751_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c0d/9414165/2dcb753a7fc6/13059_2022_2751_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c0d/9414165/cffc705a32e8/13059_2022_2751_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c0d/9414165/2dcb753a7fc6/13059_2022_2751_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c0d/9414165/cffc705a32e8/13059_2022_2751_Fig2_HTML.jpg

相似文献

1
Identifying and correcting repeat-calling errors in nanopore sequencing of telomeres.鉴定和纠正端粒纳米孔测序中的重复调用错误。
Genome Biol. 2022 Aug 26;23(1):180. doi: 10.1186/s13059-022-02751-6.
2
Estimated Nucleotide Reconstruction Quality Symbols of Basecalling Tools for Oxford Nanopore Sequencing.用于 Oxford Nanopore 测序的碱基调用工具的核苷酸重建质量符号估计。
Sensors (Basel). 2023 Jul 29;23(15):6787. doi: 10.3390/s23156787.
3
RODAN: a fully convolutional architecture for basecalling nanopore RNA sequencing data.RODAN:一种用于纳米孔 RNA 测序数据碱基调用的全卷积架构。
BMC Bioinformatics. 2022 Apr 20;23(1):142. doi: 10.1186/s12859-022-04686-y.
4
SACall: A Neural Network Basecaller for Oxford Nanopore Sequencing Data Based on Self-Attention Mechanism.SACall:基于自注意力机制的牛津纳米孔测序数据的神经网络碱基调用程序。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):614-623. doi: 10.1109/TCBB.2020.3039244. Epub 2022 Feb 3.
5
Benchmarking reveals superiority of deep learning variant callers on bacterial nanopore sequence data.基准测试显示深度学习变异调用程序在细菌纳米孔测序数据上的优越性。
Elife. 2024 Oct 10;13:RP98300. doi: 10.7554/eLife.98300.
6
Performance of neural network basecalling tools for Oxford Nanopore sequencing.基于神经网络的牛津纳米孔测序碱基调用工具的性能。
Genome Biol. 2019 Jun 24;20(1):129. doi: 10.1186/s13059-019-1727-y.
7
Beyond sequencing: machine learning algorithms extract biology hidden in Nanopore signal data.超越测序:机器学习算法从纳米孔信号数据中提取隐藏的生物学信息。
Trends Genet. 2022 Mar;38(3):246-257. doi: 10.1016/j.tig.2021.09.001. Epub 2021 Oct 25.
8
NPGREAT: assembly of human subtelomere regions with the use of ultralong nanopore reads and linked-reads.NPGREAT:利用超长纳米孔读取和链接读取组装人类亚端粒区域。
BMC Bioinformatics. 2022 Dec 16;23(1):545. doi: 10.1186/s12859-022-05081-3.
9
NanoReviser: An Error-Correction Tool for Nanopore Sequencing Based on a Deep Learning Algorithm.NanoReviser:一种基于深度学习算法的纳米孔测序纠错工具。
Front Genet. 2020 Aug 12;11:900. doi: 10.3389/fgene.2020.00900. eCollection 2020.
10
DeepRepeat: direct quantification of short tandem repeats on signal data from nanopore sequencing.DeepRepeat:从纳米孔测序信号数据中直接定量短串联重复序列。
Genome Biol. 2022 Apr 28;23(1):108. doi: 10.1186/s13059-022-02670-6.

引用本文的文献

1
High-resolution metagenome assembly for modern long reads with myloasm.利用肌浆瘤对现代长读长进行高分辨率宏基因组组装。
bioRxiv. 2025 Sep 6:2025.09.05.674543. doi: 10.1101/2025.09.05.674543.
2
Determinants of chromosome-specific telomere lengths among 2,573 All of Us participants.2573名“我们所有人”项目参与者中特定染色体端粒长度的决定因素。
Res Sq. 2025 Aug 18:rs.3.rs-7293781. doi: 10.21203/rs.3.rs-7293781/v1.
3
Topsicle: a method for estimating telomere length from whole genome long-read sequencing data.Topsicle:一种从全基因组长读测序数据估计端粒长度的方法。

本文引用的文献

1
Ectopic expression of BOTRYTIS SUSCEPTIBLE1 reveals its function as a positive regulator of wound-induced cell death and plant susceptibility to Botrytis.异位表达 BOTRYTIS SUSCEPTIBLE1 揭示了其作为伤口诱导细胞死亡和植物对 Botrytis 易感性的正调控因子的功能。
Plant Cell. 2022 Sep 27;34(10):4105-4116. doi: 10.1093/plcell/koac206.
2
Intraspecific gene birth revealed by presence-absence variant genes in .种内基因诞生通过存在-缺失变异基因揭示于…… (原文此处不完整)
NAR Genom Bioinform. 2022 Apr 21;4(2):lqac031. doi: 10.1093/nargab/lqac031. eCollection 2022 Jun.
3
Transposon activity, local duplications and propagation of structural variants across haplotypes drive the evolution of the Drosophila S2 cell line.
bioRxiv. 2025 Jul 15:2025.07.10.664126. doi: 10.1101/2025.07.10.664126.
4
Long-read RNA-sequencing reveals transcript-specific regulation in human-derived cortical neurons.长读长RNA测序揭示了人源皮质神经元中特定转录本的调控机制。
Open Biol. 2025 Jul;15(7):250200. doi: 10.1098/rsob.250200. Epub 2025 Jul 30.
5
Genetic diversity and comparative genomics across Leishmania (Viannia) species.利什曼原虫(维扬亚属)物种间的遗传多样性与比较基因组学
Commun Biol. 2025 Jun 14;8(1):925. doi: 10.1038/s42003-025-08331-1.
6
BonoboFlow: viral genome assembly and haplotype reconstruction from nanopore reads.倭黑猩猩流程:基于纳米孔测序 reads 的病毒基因组组装与单倍型重建
Bioinform Adv. 2025 May 13;5(1):vbaf115. doi: 10.1093/bioadv/vbaf115. eCollection 2025.
7
SVCROWS: A User-Defined Tool for Interpreting Significant Structural Variants in Heterogeneous Datasets.SVCROWS:一种用于解释异构数据集中显著结构变异的用户定义工具。
bioRxiv. 2025 May 19:2025.01.24.634734. doi: 10.1101/2025.01.24.634734.
8
The fate of artificial transgenes in Acanthamoeba castellanii.人工转基因在卡氏棘阿米巴中的命运。
BMC Genomics. 2025 Apr 13;26(1):368. doi: 10.1186/s12864-025-11552-7.
9
Navigating triplet repeats sequencing: concepts, methodological challenges and perspective for Huntington's disease.解读三联体重复序列测序:概念、方法学挑战及亨廷顿舞蹈症研究前景
Nucleic Acids Res. 2025 Jan 7;53(1). doi: 10.1093/nar/gkae1155.
10
Resolution of ring chromosomes, Robertsonian translocations, and complex structural variants from long-read sequencing and telomere-to-telomere assembly.通过长读长测序和端粒到端粒组装解析环状染色体、罗伯逊易位和复杂结构变异。
Am J Hum Genet. 2024 Dec 5;111(12):2693-2706. doi: 10.1016/j.ajhg.2024.10.006. Epub 2024 Nov 8.
转座子活性、局部重复和结构变异在单倍型间的传播驱动了果蝇 S2 细胞系的进化。
BMC Genomics. 2022 Apr 7;23(1):276. doi: 10.1186/s12864-022-08472-1.
4
Extensive sequence divergence between the reference genomes of two zebrafish strains, Tuebingen and AB.两个斑马鱼品系(图宾根和 AB)的参考基因组之间存在广泛的序列差异。
Mol Ecol Resour. 2022 Aug;22(6):2148-2157. doi: 10.1111/1755-0998.13602. Epub 2022 Mar 6.
5
Phased polyploid genomes provide deeper insight into the multiple origins of domesticated Saccharomyces cerevisiae beer yeasts.分阶段的多倍体基因组为深入了解驯化的酿酒酵母啤酒酵母的多种起源提供了更深入的认识。
Curr Biol. 2022 Mar 28;32(6):1350-1361.e3. doi: 10.1016/j.cub.2022.01.068. Epub 2022 Feb 17.
6
Genetic variation at mouse and human ribosomal DNA influences associated epigenetic states.小鼠和人类核糖体 DNA 上的遗传变异影响相关的表观遗传状态。
Genome Biol. 2022 Feb 14;23(1):54. doi: 10.1186/s13059-022-02617-x.
7
CRISPR-Cas9 induces large structural variants at on-target and off-target sites in vivo that segregate across generations.CRISPR-Cas9 在体内诱导靶向和非靶向位点的大结构变体,这些变体在代际间分离。
Nat Commun. 2022 Feb 2;13(1):627. doi: 10.1038/s41467-022-28244-5.
8
Genome-wide detection of cytosine methylations in plant from Nanopore data using deep learning.基于深度学习的纳米孔数据在植物中全基因组检测胞嘧啶甲基化。
Nat Commun. 2021 Oct 13;12(1):5976. doi: 10.1038/s41467-021-26278-9.
9
Haplotype diversity and sequence heterogeneity of human telomeres.人类端粒的单倍型多样性和序列异质性。
Genome Res. 2021 Jul;31(7):1269-1279. doi: 10.1101/gr.274639.120. Epub 2021 Jun 23.
10
The structure, function and evolution of a complete human chromosome 8.完整人类 8 号染色体的结构、功能与进化
Nature. 2021 May;593(7857):101-107. doi: 10.1038/s41586-021-03420-7. Epub 2021 Apr 7.