DNA-m6A 调用及与. 的整合长读表观遗传和遗传分析

DNA-m6A calling and integrated long-read epigenetic and genetic analysis with .

机构信息

Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.

Division of Medical Genetics, University of Washington, Seattle, Washington 98195, USA.

出版信息

Genome Res. 2024 Nov 20;34(11):1976-1986. doi: 10.1101/gr.279095.124.

DOI:10.1101/gr.279095.124

PMID:38849157

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11610455/

Abstract

Long-read DNA sequencing has recently emerged as a powerful tool for studying both genetic and epigenetic architectures at single-molecule and single-nucleotide resolution. Long-read epigenetic studies encompass both the direct identification of native cytosine methylation and the identification of exogenously placed DNA -methyladenine (DNA-m6A). However, detecting DNA-m6A modifications using single-molecule sequencing, as well as coprocessing single-molecule genetic and epigenetic architectures, is limited by computational demands and a lack of supporting tools. Here, we introduce , a state-of-the-art toolkit that features a semisupervised convolutional neural network for fast and accurate identification of m6A-marked bases using Pacific Biosciences (PacBio) single-molecule long-read sequencing, as well as the coprocessing of long-read genetic and epigenetic data produced using either the PacBio or Oxford Nanopore Technologies (ONT) sequencing platforms. We demonstrate accurate DNA-m6A identification (>90% precision and recall) along >20 kb long DNA molecules with an ∼1000-fold improvement in speed. In addition, we demonstrate that can readily integrate genetic and epigenetic data at single-molecule resolution, including the seamless conversion between molecular and reference coordinate systems, allowing for accurate genetic and epigenetic analyses of long-read data within structurally and somatically variable genomic regions.

摘要

长读 DNA 测序最近成为一种强大的工具，可用于在单分子和单核苷酸分辨率下研究遗传和表观遗传结构。长读表观遗传研究既包括对天然胞嘧啶甲基化的直接鉴定，也包括对外源性放置的 DNA -甲基腺嘌呤（DNA-m6A）的鉴定。然而，使用单分子测序检测 DNA-m6A 修饰以及共处理单分子遗传和表观遗传结构，受到计算需求和缺乏支持工具的限制。在这里，我们介绍，这是一种最先进的工具包，它具有一个半监督卷积神经网络，用于使用 Pacific Biosciences (PacBio) 单分子长读测序快速准确地识别 m6A 标记碱基，以及使用 PacBio 或 Oxford Nanopore Technologies (ONT) 测序平台生成的长读遗传和表观遗传数据的共处理。我们证明了在 >20 kb 长的 DNA 分子上进行准确的 DNA-m6A 鉴定（>90%的精度和召回率），速度提高了约 1000 倍。此外，我们证明可以轻松地在单分子分辨率下整合遗传和表观遗传数据，包括分子和参考坐标系之间的无缝转换，从而可以在结构和体细胞变异的基因组区域中对长读数据进行准确的遗传和表观遗传分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/406f/11610455/15fb8b2e56e3/1976f01.jpg

相似文献

DNA-m6A calling and integrated long-read epigenetic and genetic analysis with .DNA-m6A 调用及与. 的整合长读表观遗传和遗传分析

Genome Res. 2024 Nov 20;34(11):1976-1986. doi: 10.1101/gr.279095.124.

DNA-m6A calling and integrated long-read epigenetic and genetic analysis with fibertools.使用fibertools进行DNA - m6A检测以及整合长读长表观遗传和遗传分析

bioRxiv. 2023 Dec 11:2023.04.20.537673. doi: 10.1101/2023.04.20.537673.

Comparison of Illumina and Oxford Nanopore Technology systems for the genomic characterization of .用于……基因组特征分析的Illumina和牛津纳米孔技术系统的比较

Microbiol Spectr. 2025 Jul;13(7):e0129424. doi: 10.1128/spectrum.01294-24. Epub 2025 May 28.

SAKit: An all-in-one analysis pipeline for identifying novel proteins resulting from variant events at both large and small scales.SAKit：一种用于鉴定由大尺度和小尺度变异事件产生的新型蛋白质的一体化分析管道。

J Bioinform Comput Biol. 2024 Oct;22(5):2450022. doi: 10.1142/S0219720024500227. Epub 2024 Oct 1.

NANOME: A Nextflow pipeline for haplotype-aware allele-specific consensus DNA methylation detection by nanopore long-read sequencing.NANOME：一种用于通过纳米孔长读长测序进行单倍型感知等位基因特异性一致性DNA甲基化检测的Nextflow流程。

bioRxiv. 2025 Jul 4:2025.06.29.662079. doi: 10.1101/2025.06.29.662079.

Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。

Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.

Accurate and reproducible whole-genome genotyping for bacterial genomic surveillance with Nanopore sequencing data.利用纳米孔测序数据进行细菌基因组监测的准确且可重复的全基因组基因分型。

J Clin Microbiol. 2025 Jul 9;63(7):e0036925. doi: 10.1128/jcm.00369-25. Epub 2025 Jun 13.

An open-source nanopore-only sequencing workflow for analysis of clonal outbreaks delivers short-read level accuracy.一种用于分析克隆性暴发的仅基于纳米孔的开源测序工作流程可实现短读长水平的准确性。

J Clin Microbiol. 2025 Jul 18:e0066425. doi: 10.1128/jcm.00664-25.

SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification.SQANTI：用于全长转录组鉴定和定量的长读转录序列的广泛特征化，以进行质量控制。

Genome Res. 2018 Mar 1;28(3):396-411. doi: 10.1101/gr.222976.117.

Optimizing fungal DNA extraction and purification for Oxford Nanopore untargeted shotgun metagenomic sequencing from simulated hemoculture specimens.优化从模拟血液培养标本中进行牛津纳米孔非靶向鸟枪法宏基因组测序的真菌DNA提取和纯化方法。

mSystems. 2025 Jun 17;10(6):e0116624. doi: 10.1128/msystems.01166-24. Epub 2025 Apr 8.

引用本文的文献

CUT&TIME captures the history of open chromatin in developing neurons.CUT&TIME技术捕捉了发育中神经元开放染色质的历史。

bioRxiv. 2025 Sep 6:2025.08.29.673195. doi: 10.1101/2025.08.29.673195.

A haplotype-resolved view of human gene regulation.人类基因调控的单倍型解析视图。

bioRxiv. 2025 Jun 2:2024.06.14.599122. doi: 10.1101/2024.06.14.599122.

Revealing long-range heterogeneous organization of nucleoproteins with 6mA footprinting by ipdTrimming.通过ipdTrimming技术揭示具有6mA足迹的核蛋白的长程异质组织。

Genome Biol. 2025 May 21;26(1):136. doi: 10.1186/s13059-025-03592-9.

Advancing chronic myeloid leukemia research with next-generation sequencing: potential benefits, limitations, and future clinical integration.利用下一代测序推进慢性髓性白血病研究：潜在益处、局限性及未来临床整合

Hum Genet. 2025 May;144(5):481-503. doi: 10.1007/s00439-025-02745-x. Epub 2025 Apr 21.

A Hitchhiker's Guide to long-read genomic analysis.长读长基因组分析指南

Genome Res. 2025 Apr 14;35(4):545-558. doi: 10.1101/gr.279975.124.

Integrating Single-Molecule Sequencing and Deep Learning to Predict Haplotype-Specific 3D Chromatin Organization in a Mendelian Condition.整合单分子测序和深度学习以预测孟德尔遗传病中特定单倍型的三维染色质结构

bioRxiv. 2025 Mar 20:2025.02.26.640261. doi: 10.1101/2025.02.26.640261.

Computational analysis of DNA methylation from long-read sequencing.基于长读长测序的DNA甲基化计算分析

Nat Rev Genet. 2025 Mar 28. doi: 10.1038/s41576-025-00822-5.

Conservation of dichromatin organization along regional centromeres.沿区域着丝粒的双染色质组织的保守性。

Cell Genom. 2025 Apr 9;5(4):100819. doi: 10.1016/j.xgen.2025.100819. Epub 2025 Mar 26.

Centromeric transposable elements and epigenetic status drive karyotypic variation in the eastern hoolock gibbon.着丝粒转座元件和表观遗传状态驱动东白眉长臂猿的核型变异。

Cell Genom. 2025 Apr 9;5(4):100808. doi: 10.1016/j.xgen.2025.100808. Epub 2025 Mar 14.

Synchronized long-read genome, methylome, epigenome and transcriptome profiling resolve a Mendelian condition.同步长读长基因组、甲基化组、表观基因组和转录组分析解析一种孟德尔遗传病。

Nat Genet. 2025 Feb;57(2):469-479. doi: 10.1038/s41588-024-02067-0. Epub 2025 Jan 29.

本文引用的文献

Nucleosome density shapes kilobase-scale regulation by a mammalian chromatin remodeler.核小体密度通过哺乳动物染色质重塑因子调节千碱基尺度。

Nat Struct Mol Biol. 2023 Oct;30(10):1571-1581. doi: 10.1038/s41594-023-01093-6. Epub 2023 Sep 11.

Evaluation of -methyldeoxyadenosine antibody-based genomic profiling in eukaryotes.评估基于 -甲基脱氧腺苷抗体的真核生物基因组分析。

Genome Res. 2023 Mar;33(3):427-434. doi: 10.1101/gr.276696.122. Epub 2023 Feb 14.

Dynamics of CTCF- and cohesin-mediated chromatin looping revealed by live-cell imaging.活细胞成像揭示 CTCF 和黏连蛋白介导的染色质环的动态变化。

Science. 2022 Apr 29;376(6592):496-501. doi: 10.1126/science.abn6583. Epub 2022 Apr 14.

DiMeLo-seq: a long-read, single-molecule method for mapping protein-DNA interactions genome wide.DiMeLo-seq：一种长读长、单分子的全基因组蛋白质-DNA 相互作用作图方法。

Nat Methods. 2022 Jun;19(6):711-723. doi: 10.1038/s41592-022-01475-6. Epub 2022 Apr 8.

The complete sequence of a human genome.人类基因组的完整序列。

Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.

Critical assessment of DNA adenine methylation in eukaryotes using quantitative deconvolution.使用定量反卷积技术对真核生物中的 DNA 腺嘌呤甲基化进行批判性评估。

Science. 2022 Feb 4;375(6580):515-522. doi: 10.1126/science.abe7489. Epub 2022 Feb 3.

The three-dimensional structure of Epstein-Barr virus genome varies by latency type and is regulated by PARP1 enzymatic activity.EB 病毒基因组的三维结构因潜伏期类型而异，并受 PARP1 酶活性的调节。

Nat Commun. 2022 Jan 17;13(1):187. doi: 10.1038/s41467-021-27894-1.

mokapot: Fast and Flexible Semisupervised Learning for Peptide Detection.mokapot：用于肽检测的快速灵活的半监督学习。

J Proteome Res. 2021 Apr 2;20(4):1966-1971. doi: 10.1021/acs.jproteome.0c01010. Epub 2021 Feb 17.

Genome-wide detection of cytosine methylation by single molecule real-time sequencing.基于单分子实时测序的全基因组胞嘧啶甲基化检测。

Proc Natl Acad Sci U S A. 2021 Feb 2;118(5). doi: 10.1073/pnas.2019768118.

Massively multiplex single-molecule oligonucleosome footprinting.大规模多重单分子寡核小体足迹分析

Elife. 2020 Dec 2;9:e59404. doi: 10.7554/eLife.59404.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

DNA-m6A 调用及与. 的整合长读表观遗传和遗传分析

DNA-m6A calling and integrated long-read epigenetic and genetic analysis with .

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献