• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

2passtools:使用机器学习过滤的剪接接头的双通比对提高了长读 RNA 测序中内含子检测的准确性。

2passtools: two-pass alignment using machine-learning-filtered splice junctions increases the accuracy of intron detection in long-read RNA sequencing.

机构信息

School of Life Sciences, University of Dundee, Dow Street, Dundee, DD1 5EH, UK.

James Hutton Institute, Invergowrie, DD2 5DA, UK.

出版信息

Genome Biol. 2021 Mar 1;22(1):72. doi: 10.1186/s13059-021-02296-0.

DOI:10.1186/s13059-021-02296-0
PMID:33648554
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7919322/
Abstract

Transcription of eukaryotic genomes involves complex alternative processing of RNAs. Sequencing of full-length RNAs using long reads reveals the true complexity of processing. However, the relatively high error rates of long-read sequencing technologies can reduce the accuracy of intron identification. Here we apply alignment metrics and machine-learning-derived sequence information to filter spurious splice junctions from long-read alignments and use the remaining junctions to guide realignment in a two-pass approach. This method, available in the software package 2passtools ( https://github.com/bartongroup/2passtools ), improves the accuracy of spliced alignment and transcriptome assembly for species both with and without existing high-quality annotations.

摘要

真核生物基因组的转录涉及 RNA 的复杂可变剪接。通过长读长测序完整的 RNA 可揭示剪接过程的真实复杂性。然而,长读长测序技术的相对较高错误率可能会降低内含子识别的准确性。在这里,我们应用比对指标和基于机器学习的序列信息来过滤长读长比对中虚假的剪接位点,并使用剩余的剪接位点通过双步重新比对方法来指导重新比对。该方法可用于软件包 2passtools(https://github.com/bartongroup/2passtools),可提高具有和不具有现有高质量注释的物种的剪接比对和转录组组装的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/69f901a7822f/13059_2021_2296_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/74c86653bb34/13059_2021_2296_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/0cc2d629ad7f/13059_2021_2296_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/1370f1c29aa8/13059_2021_2296_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/7a1e70c24c4a/13059_2021_2296_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/2b32723850d6/13059_2021_2296_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/59cd1da745cf/13059_2021_2296_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/4e2f31c880d7/13059_2021_2296_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/69f901a7822f/13059_2021_2296_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/74c86653bb34/13059_2021_2296_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/0cc2d629ad7f/13059_2021_2296_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/1370f1c29aa8/13059_2021_2296_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/7a1e70c24c4a/13059_2021_2296_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/2b32723850d6/13059_2021_2296_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/59cd1da745cf/13059_2021_2296_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/4e2f31c880d7/13059_2021_2296_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630c/7919322/69f901a7822f/13059_2021_2296_Fig8_HTML.jpg

相似文献

1
2passtools: two-pass alignment using machine-learning-filtered splice junctions increases the accuracy of intron detection in long-read RNA sequencing.2passtools:使用机器学习过滤的剪接接头的双通比对提高了长读 RNA 测序中内含子检测的准确性。
Genome Biol. 2021 Mar 1;22(1):72. doi: 10.1186/s13059-021-02296-0.
2
Read-Split-Run: an improved bioinformatics pipeline for identification of genome-wide non-canonical spliced regions using RNA-Seq data.读取-分割-运行:一种利用RNA测序数据识别全基因组非经典剪接区域的改进型生物信息学流程。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):503. doi: 10.1186/s12864-016-2896-7.
3
RNA-Seq read alignments with PALMapper.使用PALMapper进行RNA-Seq读段比对。
Curr Protoc Bioinformatics. 2010 Dec;Chapter 11:Unit 11.6. doi: 10.1002/0471250953.bi1106s32.
4
RNASequel: accurate and repeat tolerant realignment of RNA-seq reads.RNASeq 序列:准确且具有重复容忍度的 RNA-seq 读段重新比对。
Nucleic Acids Res. 2015 Oct 15;43(18):e122. doi: 10.1093/nar/gkv594. Epub 2015 Jun 16.
5
HSA: a heuristic splice alignment tool.HSA:一种启发式剪接比对工具。
BMC Syst Biol. 2013;7 Suppl 2(Suppl 2):S10. doi: 10.1186/1752-0509-7-S2-S10. Epub 2013 Dec 17.
6
Human splicing diversity and the extent of unannotated splice junctions across human RNA-seq samples on the Sequence Read Archive.人类剪接多样性以及序列读取存档中人类RNA测序样本间未注释剪接位点的程度。
Genome Biol. 2016 Dec 30;17(1):266. doi: 10.1186/s13059-016-1118-6.
7
Multiplexed primer extension sequencing: A targeted RNA-seq method that enables high-precision quantitation of mRNA splicing isoforms and rare pre-mRNA splicing intermediates.多重引物延伸测序:一种靶向 RNA-seq 方法,可实现 mRNA 剪接异构体和罕见前体 mRNA 剪接中间产物的高精度定量。
Methods. 2020 Apr 1;176:34-45. doi: 10.1016/j.ymeth.2019.05.013. Epub 2019 May 21.
8
Mining Arabidopsis thaliana RNA-seq data with Integrated Genome Browser reveals stress-induced alternative splicing of the putative splicing regulator SR45a.利用 Integrated Genome Browser 挖掘拟南芥 RNA-seq 数据揭示了应激诱导的假定剪接调控因子 SR45a 的可变剪接。
Am J Bot. 2012 Feb;99(2):219-31. doi: 10.3732/ajb.1100355. Epub 2012 Jan 30.
9
SplicedFamAlign: CDS-to-gene spliced alignment and identification of transcript orthology groups. splicedFamAlign:CDS 到基因拼接对齐和转录本同源物组的鉴定。
BMC Bioinformatics. 2019 Mar 29;20(Suppl 3):133. doi: 10.1186/s12859-019-2647-2.
10
SpliceJumper: a classification-based approach for calling splicing junctions from RNA-seq data.SpliceJumper:一种基于分类法从RNA测序数据中识别剪接连接的方法。
BMC Bioinformatics. 2015;16 Suppl 17(Suppl 17):S10. doi: 10.1186/1471-2105-16-S17-S10. Epub 2015 Dec 7.

引用本文的文献

1
Transcriptomics in the era of long-read sequencing.长读长测序时代的转录组学
Nat Rev Genet. 2025 Mar 28. doi: 10.1038/s41576-025-00828-z.
2
Discovery of Novel Protein-Coding and Long Non-coding Transcripts in Distinct Regions of the Human Brain.在人类大脑不同区域发现新型蛋白质编码和长链非编码转录本
J Mol Neurosci. 2025 Mar 6;75(1):30. doi: 10.1007/s12031-025-02316-9.
3
Mitochondrial transcriptome of Candida albicans in flagranti - direct RNA sequencing reveals a new layer of information.白色念珠菌活跃状态下的线粒体转录组 - 直接 RNA 测序揭示了新的信息层。

本文引用的文献

1
Co-transcriptional splicing regulates 3' end cleavage during mammalian erythropoiesis.共转录剪接调控哺乳动物红细胞生成过程中的 3' 末端切割。
Mol Cell. 2021 Mar 4;81(5):998-1012.e7. doi: 10.1016/j.molcel.2020.12.018. Epub 2021 Jan 12.
2
Illuminating the dark side of the human transcriptome with long read transcript sequencing.利用长读转录组测序揭示人类转录组的暗面。
BMC Genomics. 2020 Oct 30;21(1):751. doi: 10.1186/s12864-020-07123-7.
3
Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and mA modification.
BMC Genomics. 2024 Sep 14;25(1):860. doi: 10.1186/s12864-024-10791-4.
4
U6 snRNA m6A modification is required for accurate and efficient splicing of C. elegans and human pre-mRNAs.U6 snRNA m6A 修饰对于秀丽隐杆线虫和人类前体 mRNA 的准确和高效剪接是必需的。
Nucleic Acids Res. 2024 Aug 27;52(15):9139-9160. doi: 10.1093/nar/gkae447.
5
The Genome-Wide Characterization of Alternative Splicing and RNA Editing in the Development of .发育过程中可变剪接和RNA编辑的全基因组特征分析 。 你提供的原文似乎不完整,句末“in the Development of.”后面缺少具体内容。
J Fungi (Basel). 2023 Sep 9;9(9):915. doi: 10.3390/jof9090915.
6
Long noncoding RNA study: Genome-wide approaches.长链非编码RNA研究:全基因组方法。
Genes Dis. 2022 Nov 29;10(6):2491-2510. doi: 10.1016/j.gendis.2022.10.024. eCollection 2023 Nov.
7
Excretory/Secretory Proteome of Females and Males of the Hookworm .钩虫雌虫和雄虫的排泄/分泌蛋白质组
Pathogens. 2023 Jan 6;12(1):95. doi: 10.3390/pathogens12010095.
8
ESPRESSO: Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data.ESPRESSO:从易错的长读 RNA-seq 数据中稳健地发现和定量转录本异构体。
Sci Adv. 2023 Jan 20;9(3):eabq5072. doi: 10.1126/sciadv.abq5072.
9
mA modification of U6 snRNA modulates usage of two major classes of pre-mRNA 5' splice site.U6 snRNA 的一种修饰调节了两类主要的前体 mRNA 5' 剪接位点的使用。
Elife. 2022 Nov 21;11:e78808. doi: 10.7554/eLife.78808.
10
NanoSplicer: accurate identification of splice junctions using Oxford Nanopore sequencing.NanoSplicer:使用 Oxford Nanopore 测序进行准确的剪接接头识别。
Bioinformatics. 2022 Aug 2;38(15):3741-3748. doi: 10.1093/bioinformatics/btac359.
纳米孔直接 RNA 测序绘制拟南芥 mRNA 加工和 mA 修饰的复杂性图谱。
Elife. 2020 Jan 14;9:e49658. doi: 10.7554/eLife.49658.
4
Transcriptome assembly from long-read RNA-seq alignments with StringTie2.基于长读 RNA-seq 比对的转录组组装与 StringTie2。
Genome Biol. 2019 Dec 16;20(1):278. doi: 10.1186/s13059-019-1910-1.
5
deSALT: fast and accurate long transcriptomic read alignment with de Bruijn graph-based index.deSALT:基于 de Bruijn 图的索引实现快速准确的长转录组读段比对。
Genome Biol. 2019 Dec 16;20(1):274. doi: 10.1186/s13059-019-1895-9.
6
Nanopore native RNA sequencing of a human poly(A) transcriptome.人 poly(A) 转录组的纳米孔天然 RNA 测序。
Nat Methods. 2019 Dec;16(12):1297-1305. doi: 10.1038/s41592-019-0617-2. Epub 2019 Nov 18.
7
Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules.使用 cDNA 和 RNA 分子的纳米孔测序对小鼠样本进行转录组谱分析。
Sci Rep. 2019 Oct 17;9(1):14908. doi: 10.1038/s41598-019-51470-9.
8
Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype.基于图的基因组比对和基因分型与 HISAT2 和 HISAT-genotype。
Nat Biotechnol. 2019 Aug;37(8):907-915. doi: 10.1038/s41587-019-0201-4. Epub 2019 Aug 2.
9
PyRanges: efficient comparison of genomic intervals in Python.PyRanges:在 Python 中高效比较基因组区间。
Bioinformatics. 2020 Feb 1;36(3):918-919. doi: 10.1093/bioinformatics/btz615.
10
RNA sequencing: the teenage years.RNA 测序:青少年时期。
Nat Rev Genet. 2019 Nov;20(11):631-656. doi: 10.1038/s41576-019-0150-2. Epub 2019 Jul 24.