• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 Pacific Biosciences 和 Oxford Nanopore Technologies 对个体条形码 cDNA 进行测序可揭示特定于平台的错误模式。

Sequencing of individual barcoded cDNAs using Pacific Biosciences and Oxford Nanopore Technologies reveals platform-specific error patterns.

机构信息

Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg 199034, Russia.

Brain and Mind Research Institute and Center for Neurogenetics, Weill Cornell Medicine, New York, New York 10065, USA.

出版信息

Genome Res. 2022 Apr;32(4):726-737. doi: 10.1101/gr.276405.121. Epub 2022 Mar 17.

DOI:10.1101/gr.276405.121
PMID:35301264
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8997348/
Abstract

Long-read transcriptomics require understanding error sources inherent to technologies. Current approaches cannot compare methods for an individual RNA molecule. Here, we present a novel platform-comparison method that combines barcoding strategies and long-read sequencing to sequence cDNA copies representing an individual RNA molecule on both Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT). We compare these long-read pairs in terms of sequence content and isoform patterns. Although individual read pairs show high similarity, we find differences in (1) aligned length, (2) transcription start site (TSS), (3) polyadenylation site (poly(A)-site) assignment, and (4) exon-intron structures. Overall, 25% of read pairs disagree on either TSS, poly(A)-site, or splice site. Intron-chain disagreement typically arises from alignment errors of microexons and complicated splice sites. Our single-molecule technology comparison reveals that inconsistencies are often caused by sequencing error-induced inaccurate ONT alignments, especially to downstream GUNNGU donor motifs. However, annotation-disagreeing upstream shifts in NAGNAG acceptors in ONT are often confirmed by PacBio and are thus likely real. In both barcoded and nonbarcoded ONT reads, we find that intron number and proximity of GU/AGs better predict inconsistencies with the annotation than read quality alone. We summarize these findings in an annotation-based algorithm for spliced alignment correction that improves subsequent transcript construction with ONT reads.

摘要

长读转录组学需要了解技术固有的误差源。目前的方法无法比较单个 RNA 分子的方法。在这里,我们提出了一种新的平台比较方法,该方法结合了条形码策略和长读测序,以对单个 RNA 分子的 cDNA 拷贝进行测序,这些拷贝代表了太平洋生物科学公司(PacBio)和牛津纳米孔技术公司(ONT)的两种技术。我们根据序列内容和同工型模式对这些长读对进行比较。尽管单个读对显示出高度的相似性,但我们发现了以下差异:(1)比对长度,(2)转录起始位点(TSS),(3)多聚腺苷酸化位点(poly(A)-site)分配,以及(4)外显子-内含子结构。总体而言,25%的读对在 TSS、poly(A)-site 或剪接位点上存在分歧。内含子链的分歧通常是由于微外显子和复杂剪接位点的比对错误引起的。我们的单分子技术比较表明,不一致通常是由测序错误引起的 ONT 不准确比对引起的,尤其是下游 GUNNGU 供体基序。然而,ONT 中注释不一致的上游 NAGNAG 受体的移位通常被 PacBio 证实,因此可能是真实的。在带条形码和不带条形码的 ONT 读取中,我们发现,内含子数量和 GU/AG 的接近程度比单独的读取质量更能预测与注释的不一致性。我们将这些发现总结在一个基于注释的剪接对齐校正算法中,该算法可以提高使用 ONT 读取进行后续转录本构建的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/527a12376790/726f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/751652eae9eb/726f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/b2b04a7fe3a3/726f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/2e9bcd2a058e/726f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/4e09eb274df8/726f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/527a12376790/726f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/751652eae9eb/726f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/b2b04a7fe3a3/726f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/2e9bcd2a058e/726f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/4e09eb274df8/726f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adfb/8997348/527a12376790/726f05.jpg

相似文献

1
Sequencing of individual barcoded cDNAs using Pacific Biosciences and Oxford Nanopore Technologies reveals platform-specific error patterns.使用 Pacific Biosciences 和 Oxford Nanopore Technologies 对个体条形码 cDNA 进行测序可揭示特定于平台的错误模式。
Genome Res. 2022 Apr;32(4):726-737. doi: 10.1101/gr.276405.121. Epub 2022 Mar 17.
2
Improving nanopore read accuracy with the R2C2 method enables the sequencing of highly multiplexed full-length single-cell cDNA.利用 R2C2 方法提高纳米孔读取准确性,使高度多重化全长单细胞 cDNA 的测序成为可能。
Proc Natl Acad Sci U S A. 2018 Sep 25;115(39):9726-9731. doi: 10.1073/pnas.1806447115. Epub 2018 Sep 10.
3
Comparison of the two up-to-date sequencing technologies for genome assembly: HiFi reads of Pacific Biosciences Sequel II system and ultralong reads of Oxford Nanopore.比较两种最新的基因组组装测序技术:太平洋生物科学测序仪二代系统的 HiFi 读取和牛津纳米孔的超长读取。
Gigascience. 2020 Dec 15;9(12). doi: 10.1093/gigascience/giaa123.
4
The newest Oxford Nanopore R10.4.1 full-length 16S rRNA sequencing enables the accurate resolution of species-level microbial community profiling.最新的牛津纳米孔 R10.4.1 全长 16S rRNA 测序可实现精确解析物种水平的微生物群落组成。
Appl Environ Microbiol. 2023 Oct 31;89(10):e0060523. doi: 10.1128/aem.00605-23. Epub 2023 Oct 6.
5
Evaluation of tools for long read RNA-seq splice-aware alignment.长读 RNA-seq 剪接感知比对工具评估。
Bioinformatics. 2018 Mar 1;34(5):748-754. doi: 10.1093/bioinformatics/btx668.
6
High-accuracy long-read amplicon sequences using unique molecular identifiers with Nanopore or PacBio sequencing.使用独特分子标识符结合纳米孔或PacBio测序的高精度长读长扩增子序列。
Nat Methods. 2021 Feb;18(2):165-169. doi: 10.1038/s41592-020-01041-y. Epub 2021 Jan 11.
7
Comparison of ONT and CCS sequencing technologies on the polyploid genome of a medicinal plant showed that high error rate of ONT reads are not suitable for self-correction.对一种药用植物多倍体基因组上的纳米孔测序(ONT)技术和环形一致序列(CCS)测序技术进行比较后发现,ONT读数的高错误率不适用于自我校正。
Chin Med. 2022 Aug 9;17(1):94. doi: 10.1186/s13020-022-00644-1.
8
Analysis and comprehensive comparison of PacBio and nanopore-based RNA sequencing of the transcriptome.基于PacBio和纳米孔的转录组RNA测序分析与综合比较。
Plant Methods. 2020 Jun 12;16:85. doi: 10.1186/s13007-020-00629-x. eCollection 2020.
9
Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes.比较长读测序技术在细菌和果蝇基因组分析中的应用。
G3 (Bethesda). 2021 Jun 17;11(6). doi: 10.1093/g3journal/jkab083.
10
Comparison of Illumina and Oxford Nanopore Technology for genome analysis of Francisella tularensis, Bacillus anthracis, and Brucella suis.Illumina 与 Oxford Nanopore 技术在分析土拉弗朗西斯菌、炭疽芽孢杆菌和猪布鲁氏菌基因组中的比较。
BMC Genomics. 2023 May 12;24(1):258. doi: 10.1186/s12864-023-09343-z.

引用本文的文献

1
Perplexity as a Metric for Isoform Diversity in the Human Transcriptome.将困惑度作为人类转录组中异构体多样性的一个指标。
bioRxiv. 2025 Jul 2:2025.07.02.662769. doi: 10.1101/2025.07.02.662769.
2
Ecological Impacts of Sunflowers on Soil Microbial Communities: Insights from Full-Length 16S rRNA Sequencing.向日葵对土壤微生物群落的生态影响:基于全长16S rRNA测序的见解
Curr Microbiol. 2025 May 21;82(7):297. doi: 10.1007/s00284-025-04273-3.
3
A systematic benchmark of Nanopore long-read RNA sequencing for transcript-level analysis in human cell lines.

本文引用的文献

1
Systematic assessment of long-read RNA-seq methods for transcript identification and quantification.系统评估长读 RNA-seq 方法在转录本鉴定和定量中的应用。
Nat Methods. 2024 Jul;21(7):1349-1363. doi: 10.1038/s41592-024-02298-3. Epub 2024 Jun 7.
2
Single-nuclei isoform RNA sequencing unlocks barcoded exon connectivity in frozen brain tissue.单细胞异构体 RNA 测序揭示了冷冻脑组织中带有条形码的外显子连接性。
Nat Biotechnol. 2022 Jul;40(7):1082-1092. doi: 10.1038/s41587-022-01231-3. Epub 2022 Mar 7.
3
Accurate spliced alignment of long RNA sequencing reads.
用于人类细胞系转录本水平分析的纳米孔长读长RNA测序的系统基准测试。
Nat Methods. 2025 Apr;22(4):801-812. doi: 10.1038/s41592-025-02623-4. Epub 2025 Mar 13.
4
Discovery of Novel Protein-Coding and Long Non-coding Transcripts in Distinct Regions of the Human Brain.在人类大脑不同区域发现新型蛋白质编码和长链非编码转录本
J Mol Neurosci. 2025 Mar 6;75(1):30. doi: 10.1007/s12031-025-02316-9.
5
Challenges in identifying mRNA transcript starts and ends from long-read sequencing data.从长读测序数据中识别 mRNA 转录本起始和结束的挑战。
Genome Res. 2024 Nov 20;34(11):1719-1734. doi: 10.1101/gr.279559.124.
6
Understanding isoform expression by pairing long-read sequencing with single-cell and spatial transcriptomics.通过将长读测序与单细胞和空间转录组学相结合来理解异构体表达。
Genome Res. 2024 Nov 20;34(11):1735-1746. doi: 10.1101/gr.279640.124.
7
SpaDiT: diffusion transformer for spatial gene expression prediction using scRNA-seq.SpaDiT:基于 scRNA-seq 的空间基因表达预测扩散转换器。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae571.
8
Contrasting and combining transcriptome complexity captured by short and long RNA sequencing reads.短读长读 RNA 测序捕获的转录组复杂性的对比和组合。
Genome Res. 2024 Oct 29;34(10):1624-1635. doi: 10.1101/gr.278659.123.
9
Assessing next-generation sequencing-based computational methods for predicting transcriptional regulators with query gene sets.评估基于下一代测序的计算方法,用于预测具有查询基因集的转录调控因子。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae366.
10
Origin and maintenance of large ribosomal RNA gene repeat size in mammals.哺乳动物大亚基核糖体 RNA 基因重复大小的起源和维持。
Genetics. 2024 Sep 4;228(1). doi: 10.1093/genetics/iyae121.
长 RNA 测序reads 的精确拼接比对。
Bioinformatics. 2021 Dec 11;37(24):4643-4651. doi: 10.1093/bioinformatics/btab540.
4
Single-molecule long-read sequencing reveals a conserved intact long RNA profile in sperm.单分子长读测序揭示精子中保守的完整长 RNA 谱。
Nat Commun. 2021 Mar 1;12(1):1361. doi: 10.1038/s41467-021-21524-6.
5
2passtools: two-pass alignment using machine-learning-filtered splice junctions increases the accuracy of intron detection in long-read RNA sequencing.2passtools:使用机器学习过滤的剪接接头的双通比对提高了长读 RNA 测序中内含子检测的准确性。
Genome Biol. 2021 Mar 1;22(1):72. doi: 10.1186/s13059-021-02296-0.
6
Author Correction: Error correction enables use of Oxford Nanopore technology for reference-free transcriptome analysis.作者更正:纠错使牛津纳米孔技术能够用于无参考转录组分析。
Nat Commun. 2021 Feb 8;12(1):992. doi: 10.1038/s41467-021-21424-9.
7
A spatially resolved brain region- and cell type-specific isoform atlas of the postnatal mouse brain.一个具有空间分辨率的、针对出生后小鼠大脑的脑区和细胞类型特异性异构体图谱。
Nat Commun. 2021 Jan 19;12(1):463. doi: 10.1038/s41467-020-20343-5.
8
GENCODE 2021.GENCODE 2021.
Nucleic Acids Res. 2021 Jan 8;49(D1):D916-D923. doi: 10.1093/nar/gkaa1087.
9
Ultralow-input single-tube linked-read library method enables short-read second-generation sequencing systems to routinely generate highly accurate and economical long-range sequencing information.超微量单管连接读取文库方法使短读长第二代测序系统能够常规地生成高度准确和经济的长程测序信息。
Genome Res. 2020 Jun;30(6):898-909. doi: 10.1101/gr.260380.119. Epub 2020 Jun 15.
10
Analysis and comprehensive comparison of PacBio and nanopore-based RNA sequencing of the transcriptome.基于PacBio和纳米孔的转录组RNA测序分析与综合比较。
Plant Methods. 2020 Jun 12;16:85. doi: 10.1186/s13007-020-00629-x. eCollection 2020.