• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估模拟模型以模拟纳米孔测序仪和分割算法引入的扭曲。

Evaluation of simulation models to mimic the distortions introduced into squiggles by nanopore sequencers and segmentation algorithms.

机构信息

Department of Electrical and Computer Engineering, University of Calgary, Calgary, Alberta, Canada.

Department of Radiology, University of Calgary, Calgary, Alberta, Canada.

出版信息

PLoS One. 2019 Jul 18;14(7):e0219495. doi: 10.1371/journal.pone.0219495. eCollection 2019.

DOI:10.1371/journal.pone.0219495
PMID:31318901
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6638935/
Abstract

Nucleotides ratcheted through the biomolecular pores of nanopore sequencers generate raw picoamperage currents, which are segmented into step-current level signals representing the nucleotide sequence. These 'squiggles' are a noisy, distorted representation of the underlying true stepped current levels due to experimental and algorithmic factors. We were interested in developing a simulation model to support a white-box approach to identify common distortions, rather than relying on commonly used black box neural network techniques for basecalling nanopore signals. Dynamic time warped-space averaging (DTWA) techniques can generate a consensus from multiple noisy signals without introducing key feature distortions that occur with standard averaging. As a preprocessing tool, DTWA could provide cleaner and more accurate current signals for direct RNA or DNA analysis tools. However, DTWA approaches need modification to take advantage of the a-priori knowledge regarding a common, underlying gold-standard RNA / DNA sequence. Using experimental data, we derive a simulation model to provide known squiggle distortion signals to assist in validating the performance of analysis tools such as DTWA. Simulation models were evaluated by comparing mocked and experimental squiggle characteristics from one Enolase mRNA squiggle group produced by an Oxford MinION nanopore sequencer, and cross-validated using other Enolase, Sequin R1_71_1 and Sequin R2_55_3 mRNA studies. New techniques identified high inserted but low deleted base rates, generating consistent x1.7 squiggle event to base called ratios. Similar probability density and cumulative distribution functions, PDF and CDF, were found across all studies. Experimental PDFs were not the normal distributions expected if squiggle distortion arose from segmentation algorithm artefacts, or through individual nucleotides randomly interacting with individual nanopores. Matching experimental and mocked CDFs required the assumption that there are unique features associated with individual raw-current data streams. Z-normalized signal-to-noise ratios suggest intrinsic sensor limitations being responsible for half the gold standard and noisy squiggle DTW differences.

摘要

核苷酸通过纳米孔测序仪的生物分子孔棘轮产生原始皮安电流,这些电流被分割成代表核苷酸序列的阶跃电流水平信号。由于实验和算法因素,这些“扭结”是底层真实阶跃电流水平的嘈杂、扭曲表示。我们有兴趣开发一个模拟模型来支持一种白盒方法来识别常见的扭曲,而不是依赖于常用的黑盒神经网络技术来进行纳米孔信号的碱基调用。动态时间扭曲空间平均(DTWA)技术可以在不引入标准平均会产生的关键特征扭曲的情况下,从多个嘈杂信号中生成共识。作为预处理工具,DTWA 可以为直接的 RNA 或 DNA 分析工具提供更清洁、更准确的电流信号。然而,DTWA 方法需要进行修改,以利用关于常见潜在金标准 RNA/DNA 序列的先验知识。使用实验数据,我们推导出一个模拟模型,为分析工具(如 DTWA)提供已知的扭结失真信号,以协助验证其性能。通过比较由牛津 MinION 纳米孔测序仪产生的一个 Enolase mRNA 扭结组的模拟和实验扭结特征,并使用其他 Enolase、Sequin R1_71_1 和 Sequin R2_55_3 mRNA 研究进行交叉验证,对模拟模型进行了评估。新技术确定了高插入但低缺失碱基率,产生了一致的 x1.7 扭结事件到碱基调用比。在所有研究中都发现了相似的概率密度和累积分布函数(PDF 和 CDF)。如果扭结失真源于分段算法伪影或单个核苷酸与单个纳米孔随机相互作用,则实验 PDF 不应是预期的正态分布。实验和模拟 CDF 的匹配要求假设存在与单个原始电流数据流相关的独特特征。Z 归一化信号噪声比表明内在传感器限制负责金标准和嘈杂扭结 DTW 差异的一半。

相似文献

1
Evaluation of simulation models to mimic the distortions introduced into squiggles by nanopore sequencers and segmentation algorithms.评估模拟模型以模拟纳米孔测序仪和分割算法引入的扭曲。
PLoS One. 2019 Jul 18;14(7):e0219495. doi: 10.1371/journal.pone.0219495. eCollection 2019.
2
Evaluating the effectiveness of ensemble voting in improving the accuracy of consensus signals produced by various DTWA algorithms from step-current signals generated during nanopore sequencing.评估集成投票在提高各种 DTWA 算法从纳米孔测序产生的步电流信号中产生的共识信号准确性方面的有效性。
PLoS Comput Biol. 2021 Sep 10;17(9):e1009350. doi: 10.1371/journal.pcbi.1009350. eCollection 2021 Sep.
3
Evaluating the Accuracy of Consensus Nanosequencer Squiggles Generated by Dynamic Time Warp Barycentre Averaging (DBA).评估通过动态时间规整重心平均法(DBA)生成的一致性纳米测序仪波形的准确性。
Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:233-237. doi: 10.1109/EMBC.2019.8856460.
4
NanoSplicer: accurate identification of splice junctions using Oxford Nanopore sequencing.NanoSplicer:使用 Oxford Nanopore 测序进行准确的剪接接头识别。
Bioinformatics. 2022 Aug 2;38(15):3741-3748. doi: 10.1093/bioinformatics/btac359.
5
Simulation of nanopore sequencing signal data with tunable parameters.可调参数的纳米孔测序信号数据模拟。
Genome Res. 2024 Jun 25;34(5):778-783. doi: 10.1101/gr.278730.123.
6
Estimated Nucleotide Reconstruction Quality Symbols of Basecalling Tools for Oxford Nanopore Sequencing.用于 Oxford Nanopore 测序的碱基调用工具的核苷酸重建质量符号估计。
Sensors (Basel). 2023 Jul 29;23(15):6787. doi: 10.3390/s23156787.
7
Basecalling Using Joint Raw and Event Nanopore Data Sequence-to-Sequence Processing.使用联合原始和事件纳米孔数据序列到序列处理进行碱基调用。
Sensors (Basel). 2022 Mar 15;22(6):2275. doi: 10.3390/s22062275.
8
RNA Modification Detection Using Nanopore Direct RNA Sequencing and nanoDoc2.使用纳米孔直接 RNA 测序和 nanoDoc2 进行 RNA 修饰检测。
Methods Mol Biol. 2023;2632:299-319. doi: 10.1007/978-1-0716-2996-3_21.
9
Beyond sequencing: machine learning algorithms extract biology hidden in Nanopore signal data.超越测序:机器学习算法从纳米孔信号数据中提取隐藏的生物学信息。
Trends Genet. 2022 Mar;38(3):246-257. doi: 10.1016/j.tig.2021.09.001. Epub 2021 Oct 25.
10
Evaluation of Dynamic Time Warp Barycenter Averaging (DBA) for its Potential in Generating a Consensus Nanopore Signal for Genetic and Epigenetic Sequences.评估动态时间规整重心平均法(DBA)在生成遗传和表观遗传序列的一致性纳米孔信号方面的潜力。
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:2821-2824. doi: 10.1109/EMBC.2018.8512873.

引用本文的文献

1
Evaluating the effectiveness of ensemble voting in improving the accuracy of consensus signals produced by various DTWA algorithms from step-current signals generated during nanopore sequencing.评估集成投票在提高各种 DTWA 算法从纳米孔测序产生的步电流信号中产生的共识信号准确性方面的有效性。
PLoS Comput Biol. 2021 Sep 10;17(9):e1009350. doi: 10.1371/journal.pcbi.1009350. eCollection 2021 Sep.

本文引用的文献

1
Evaluation of Dynamic Time Warp Barycenter Averaging (DBA) for its Potential in Generating a Consensus Nanopore Signal for Genetic and Epigenetic Sequences.评估动态时间规整重心平均法(DBA)在生成遗传和表观遗传序列的一致性纳米孔信号方面的潜力。
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:2821-2824. doi: 10.1109/EMBC.2018.8512873.
2
The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community.牛津纳米孔MinION测序仪:将纳米孔测序技术带给基因组学界。
Genome Biol. 2016 Nov 25;17(1):239. doi: 10.1186/s13059-016-1103-0.
3
Spliced synthetic genes as internal controls in RNA sequencing experiments.
拼接合成基因作为 RNA 测序实验中的内参。
Nat Methods. 2016 Sep;13(9):792-8. doi: 10.1038/nmeth.3958. Epub 2016 Aug 8.
4
NanoOK: multi-reference alignment analysis of nanopore sequencing data, quality and error profiles.NanoOK:纳米孔测序数据的多参考比对分析、质量和错误概况
Bioinformatics. 2016 Jan 1;32(1):142-4. doi: 10.1093/bioinformatics/btv540. Epub 2015 Sep 17.
5
Decoding long nanopore sequencing reads of natural DNA.解码天然DNA的长纳米孔测序读数。
Nat Biotechnol. 2014 Aug;32(8):829-33. doi: 10.1038/nbt.2950. Epub 2014 Jun 25.
6
The potential and challenges of nanopore sequencing.纳米孔测序的潜力与挑战。
Nat Biotechnol. 2008 Oct;26(10):1146-53. doi: 10.1038/nbt.1495.