PyPore：一个用于纳米孔测序数据处理的 Python 工具包。

PyPore: a python toolbox for nanopore sequencing data handling.

机构信息

Department of Experimental and Clinical Medicine, University of Florence, Florence, Italy.

出版信息

Bioinformatics. 2019 Nov 1;35(21):4445-4447. doi: 10.1093/bioinformatics/btz269.

DOI:10.1093/bioinformatics/btz269

PMID:30993318

Abstract

MOTIVATION

The recent technological improvement of Oxford Nanopore sequencing pushed the throughput of these devices to 10-20 Gb allowing the generation of millions of reads. For these reasons, the availability of fast software packages for evaluating experimental quality by generating highly informative and interactive summary plots is of fundamental importance.

RESULTS

We developed PyPore, a three module python toolbox designed to handle raw FAST5 files from quality checking to alignment to a reference genome and to explore their features through the generation of browsable HTML files. The first module provides an interface to explore and evaluate the information contained in FAST5 and summarize them into informative quality measures. The second module converts raw data in FASTQ format, while the third module allows to easily use three state-of-the-art aligners and collects mapping statistics.

AVAILABILITY AND IMPLEMENTATION

PyPore is an open-source software and is written in Python2.7, source code is freely available, for all OS platforms, in Github at https://github.com/rsemeraro/PyPore.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

最近牛津纳米孔测序技术的进步将这些设备的通量推至 10-20Gb，允许生成数百万个读数。出于这些原因，开发快速软件包对于通过生成高度信息丰富和交互式摘要图来评估实验质量变得至关重要。

结果

我们开发了 PyPore，这是一个由三个模块组成的 Python 工具包，旨在从质量检查到与参考基因组对齐处理原始 FAST5 文件，并通过生成可浏览的 HTML 文件来探索它们的特征。第一个模块提供了一个接口来探索和评估 FAST5 中包含的信息，并将其总结为有用的质量度量。第二个模块将原始数据转换为 FASTQ 格式，而第三个模块允许轻松使用三种最先进的对齐器并收集映射统计信息。

可用性和实现

PyPore 是一个开源软件，用 Python2.7 编写，源代码可在所有操作系统平台上免费获取，在 Github 上的网址为 https://github.com/rsemeraro/PyPore。

补充信息

补充数据可在生物信息学在线获得。

相似文献

PyPore: a python toolbox for nanopore sequencing data handling.PyPore：一个用于纳米孔测序数据处理的 Python 工具包。

Bioinformatics. 2019 Nov 1;35(21):4445-4447. doi: 10.1093/bioinformatics/btz269.

BulkVis: a graphical viewer for Oxford nanopore bulk FAST5 files.BulkVis：用于牛津纳米孔批量 FAST5 文件的图形查看器。

Bioinformatics. 2019 Jul 1;35(13):2193-2198. doi: 10.1093/bioinformatics/bty841.

Sequoia: an interactive visual analytics platform for interpretation and feature extraction from nanopore sequencing datasets.红杉：一个用于从纳米孔测序数据集中进行解释和特征提取的交互式可视分析平台。

BMC Genomics. 2021 Jul 7;22(1):513. doi: 10.1186/s12864-021-07791-z.

NanoCLUST: a species-level analysis of 16S rRNA nanopore sequencing data.NanoCLUST：基于 16S rRNA 纳米孔测序数据的种水平分析。

Bioinformatics. 2021 Jul 12;37(11):1600-1601. doi: 10.1093/bioinformatics/btaa900.

Poretools: a toolkit for analyzing nanopore sequence data.Poretools：一个用于分析纳米孔序列数据的工具包。

Bioinformatics. 2014 Dec 1;30(23):3399-401. doi: 10.1093/bioinformatics/btu555. Epub 2014 Aug 20.

ENANO: Encoder for NANOpore FASTQ files.ENANO：用于 Nanopore FASTQ 文件的编码器。

Bioinformatics. 2020 Aug 15;36(16):4506-4507. doi: 10.1093/bioinformatics/btaa551.

Real-time mapping of nanopore raw signals.实时纳米孔原始信号映射。

Bioinformatics. 2021 Jul 12;37(Suppl_1):i477-i483. doi: 10.1093/bioinformatics/btab264.

RabbitQC: high-speed scalable quality control for sequencing data.兔 QC：测序数据的高速可扩展质量控制。

Bioinformatics. 2021 May 1;37(4):573-574. doi: 10.1093/bioinformatics/btaa719.

DeepSimulator1.5: a more powerful, quicker and lighter simulator for Nanopore sequencing.DeepSimulator1.5：一款更强大、更快速、更轻量级的纳米孔测序模拟软件。

Bioinformatics. 2020 Apr 15;36(8):2578-2580. doi: 10.1093/bioinformatics/btz963.

ModPhred: an integrative toolkit for the analysis and storage of nanopore sequencing DNA and RNA modification data.ModPhred：一个用于分析和存储纳米孔测序 DNA 和 RNA 修饰数据的集成工具包。

Bioinformatics. 2021 Dec 22;38(1):257-260. doi: 10.1093/bioinformatics/btab539.

引用本文的文献

Resolving complex structural variants via nanopore sequencing.通过纳米孔测序解析复杂结构变异

Front Genet. 2023 Aug 16;14:1213917. doi: 10.3389/fgene.2023.1213917. eCollection 2023.

High-resolution Nanopore methylome-maps reveal random hyper-methylation at CpG-poor regions as driver of chemoresistance in leukemias.高分辨率纳米孔甲基组图谱揭示了 CpG 贫乏区域的随机超甲基化是白血病化疗耐药的驱动因素。

Commun Biol. 2023 Apr 8;6(1):382. doi: 10.1038/s42003-023-04756-8.

Nanopore sequencing technology, bioinformatics and applications.纳米孔测序技术、生物信息学及其应用。

Nat Biotechnol. 2021 Nov;39(11):1348-1365. doi: 10.1038/s41587-021-01108-x. Epub 2021 Nov 8.

Probability distribution of copy number alterations along the genome: an algorithm to distinguish different tumour profiles.基因组上拷贝数改变的概率分布：一种区分不同肿瘤特征的算法。

Sci Rep. 2020 Sep 10;10(1):14868. doi: 10.1038/s41598-020-71859-1.

Versatile Quality Control Methods for Nanopore Sequencing.纳米孔测序的通用质量控制方法

Evol Bioinform Online. 2019 Jul 23;15:1176934319863068. doi: 10.1177/1176934319863068. eCollection 2019.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

PyPore：一个用于纳米孔测序数据处理的 Python 工具包。

PyPore: a python toolbox for nanopore sequencing data handling.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

补充信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献