• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于减小纳米孔测序数据大小的新压缩策略。

A new compression strategy to reduce the size of nanopore sequencing data.

作者信息

Jayasooriya Kavindu, Jenner Sasha P, Marasinghe Pasindu, Senanayake Udith, Saadat Hassaan, Taubman David, Ragel Roshan, Gamaarachchi Hasindu, Deveson Ira W

机构信息

Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales 2010, Australia.

Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Sydney, New South Wales 2010, Australia.

出版信息

Genome Res. 2025 Jul 1;35(7):1574-1582. doi: 10.1101/gr.280090.124.

DOI:10.1101/gr.280090.124
PMID:40374535
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12212073/
Abstract

Nanopore sequencing is an increasingly central tool for genomics. Despite rapid advances in the field, large data volumes and computational bottlenecks continue to pose major challenges. Here, we introduce ex-zd, a new data compression strategy that helps address the large size of raw signal data generated during nanopore experiments. Ex-zd encompasses both a lossless compression method, which modestly outperforms all current methods for nanopore signal data compression, and a 'lossy' method, which can be used to achieve additional savings. The latter component works by reducing the number of bits used to encode signal data. We show that the three least significant bits in signal data generated on instruments from Oxford Nanopore Technologies (ONT) predominantly encode noise. Their removal reduces file sizes by half without impacting downstream analyses, including basecalling and detection of modified DNA or RNA bases. Ex-zd compression saves hundreds of gigabytes on a single ONT sequencing experiment, thereby increasing the scalability, portability, and accessibility of nanopore sequencing.

摘要

纳米孔测序日益成为基因组学的核心工具。尽管该领域取得了快速进展,但大数据量和计算瓶颈仍然构成重大挑战。在此,我们介绍了ex-zd,这是一种新的数据压缩策略,有助于解决纳米孔实验中产生的原始信号数据量庞大的问题。Ex-zd包含一种无损压缩方法,其性能略优于目前所有用于纳米孔信号数据压缩的方法,以及一种“有损”方法,可用于实现进一步的数据节省。后一种方法通过减少用于编码信号数据的位数来实现。我们表明,牛津纳米孔技术公司(ONT)仪器生成的信号数据中,最低有效三位主要编码噪声。去除这三位可使文件大小减半,而不会影响包括碱基识别以及修饰的DNA或RNA碱基检测在内的下游分析。Ex-zd压缩在单次ONT测序实验中可节省数百GB的数据,从而提高了纳米孔测序的可扩展性、便携性和可及性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/6a045fc9673a/1574f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/818c17feb521/1574f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/c35c63125545/1574f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/a32f9fbda169/1574f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/4c2e9cb261e3/1574f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/6a045fc9673a/1574f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/818c17feb521/1574f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/c35c63125545/1574f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/a32f9fbda169/1574f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/4c2e9cb261e3/1574f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3abb/12212073/6a045fc9673a/1574f05.jpg

相似文献

1
A new compression strategy to reduce the size of nanopore sequencing data.一种用于减小纳米孔测序数据大小的新压缩策略。
Genome Res. 2025 Jul 1;35(7):1574-1582. doi: 10.1101/gr.280090.124.
2
Tick genomics through a Nanopore: a low-cost approach for tick genomics.通过纳米孔进行蜱类基因组学研究:一种低成本的蜱类基因组学研究方法。
BMC Genomics. 2025 Jul 1;26(1):591. doi: 10.1186/s12864-025-11733-4.
3
Comparison of Illumina and Oxford Nanopore Technology systems for the genomic characterization of .用于……基因组特征分析的Illumina和牛津纳米孔技术系统的比较
Microbiol Spectr. 2025 Jul;13(7):e0129424. doi: 10.1128/spectrum.01294-24. Epub 2025 May 28.
4
Identification of bioinformatic pipelines for virus monitoring using nanopore sequence data: A systematic assessment.使用纳米孔序列数据进行病毒监测的生物信息学流程鉴定:一项系统评估
J Virol Methods. 2025 Jul;336:115153. doi: 10.1016/j.jviromet.2025.115153. Epub 2025 Apr 5.
5
Optimizing fungal DNA extraction and purification for Oxford Nanopore untargeted shotgun metagenomic sequencing from simulated hemoculture specimens.优化从模拟血液培养标本中进行牛津纳米孔非靶向鸟枪法宏基因组测序的真菌DNA提取和纯化方法。
mSystems. 2025 Jun 17;10(6):e0116624. doi: 10.1128/msystems.01166-24. Epub 2025 Apr 8.
6
Aerolysin Nanopore Electrochemistry.气单胞菌溶素纳米孔电化学
Acc Chem Res. 2025 Feb 18;58(4):517-528. doi: 10.1021/acs.accounts.4c00630. Epub 2025 Jan 28.
7
Eliminating the Interference of Neighboring Nucleobases in Aerolysin for Nanopore Sequencing.消除气单胞菌溶素中相邻核碱基对纳米孔测序的干扰。
ACS Sens. 2025 Jun 27;10(6):4202-4208. doi: 10.1021/acssensors.5c00334. Epub 2025 Apr 8.
8
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
9
Gonadotropin-releasing hormone (GnRH) analogues for premenstrual syndrome (PMS).用于经前综合征(PMS)的促性腺激素释放激素(GnRH)类似物。
Cochrane Database Syst Rev. 2025 Jun 10;6(6):CD011330. doi: 10.1002/14651858.CD011330.pub2.
10
AI-based Hepatic Steatosis Detection and Integrated Hepatic Assessment from Cardiac CT Attenuation Scans Enhances All-cause Mortality Risk Stratification: A Multi-center Study.基于人工智能的心脏CT衰减扫描检测肝脂肪变性及综合肝脏评估可增强全因死亡风险分层:一项多中心研究
medRxiv. 2025 Jun 11:2025.06.09.25329157. doi: 10.1101/2025.06.09.25329157.

本文引用的文献

1
Uncalled4 improves nanopore DNA and RNA modification detection via fast and accurate signal alignment.Uncalled4通过快速准确的信号比对改进了纳米孔DNA和RNA修饰检测。
Nat Methods. 2025 Apr;22(4):681-691. doi: 10.1038/s41592-025-02631-4. Epub 2025 Mar 28.
2
A systematic benchmark of Nanopore long-read RNA sequencing for transcript-level analysis in human cell lines.用于人类细胞系转录本水平分析的纳米孔长读长RNA测序的系统基准测试。
Nat Methods. 2025 Apr;22(4):801-812. doi: 10.1038/s41592-025-02623-4. Epub 2025 Mar 13.
3
Interactive visualization of nanopore sequencing signal data with Squigualiser.
使用 Squigualiser 对纳米孔测序信号数据进行交互式可视化。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae501.
4
RawHash2: mapping raw nanopore signals using hash-based seeding and adaptive quantization.RawHash2:基于哈希的种子生成和自适应量化的原始纳米孔信号映射。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae478.
5
Nanopore Current Events Magnifier (nanoCEM): a novel tool for visualizing current events at modification sites of nanopore sequencing.纳米孔电流事件放大器(nanoCEM):一种用于可视化纳米孔测序修饰位点电流事件的新型工具。
NAR Genom Bioinform. 2024 May 20;6(2):lqae052. doi: 10.1093/nargab/lqae052. eCollection 2024 Jun.
6
Simulation of nanopore sequencing signal data with tunable parameters.可调参数的纳米孔测序信号数据模拟。
Genome Res. 2024 Jun 25;34(5):778-783. doi: 10.1101/gr.278730.123.
7
Streamlining remote nanopore data access with slow5curl.使用 slow5curl 简化远程 nanopore 数据访问。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae016.
8
Symphonizing pileup and full-alignment for deep learning-based long-read variant calling.基于深度学习的长读变异调用的交响乐堆积和全对齐。
Nat Comput Sci. 2022 Dec;2(12):797-803. doi: 10.1038/s43588-022-00387-x. Epub 2022 Dec 19.
9
The landscape of genomic structural variation in Indigenous Australians.澳大利亚原住民的基因组结构变异景观。
Nature. 2023 Dec;624(7992):602-610. doi: 10.1038/s41586-023-06842-7. Epub 2023 Dec 13.
10
6mA DNA Methylation on Genes in Plants Is Associated with Gene Complexity, Expression and Duplication.植物基因上的6mA DNA甲基化与基因复杂性、表达及复制相关。
Plants (Basel). 2023 May 10;12(10):1949. doi: 10.3390/plants12101949.