• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TrAnnoScope:用于全长转录组分析和功能注释的模块化Snakemake工作流程

TrAnnoScope: A Modular Snakemake Pipeline for Full-Length Transcriptome Analysis and Functional Annotation.

作者信息

Pektas Aysevil, Panitz Frank, Thomsen Bo

机构信息

Department of Molecular Biology and Genetics, Aarhus University, 8000 Aarhus, Denmark.

Applied Statistical Methods, Natural Resources Institute Finland (Luke), 20520 Turku, Finland.

出版信息

Genes (Basel). 2024 Nov 29;15(12):1547. doi: 10.3390/genes15121547.

DOI:10.3390/genes15121547
PMID:39766814
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11727683/
Abstract

: Transcriptome assembly and functional annotation are essential in understanding gene expression and biological function. Nevertheless, many existing pipelines lack the flexibility to integrate both short- and long-read sequencing data or fail to provide a complete, customizable workflow for transcriptome analysis, particularly for non-model organisms. : We present TrAnnoScope, a transcriptome analysis pipeline designed to process Illumina short-read and PacBio long-read data. The pipeline provides a complete, customizable workflow to generate high-quality, full-length (FL) transcripts with broad functional annotation. Its modular design allows users to adapt specific analysis steps for other sequencing platforms or data types. The pipeline encompasses steps from quality control to functional annotation, employing tools and established databases such as SwissProt, Pfam, Gene Ontology (GO), the Kyoto Encyclopedia of Genes and Genomes (KEGG), and Eukaryotic Orthologous Groups (KOG). As a case study, TrAnnoScope was applied to RNA-Seq and Iso-Seq data from zebra finch brain, ovary, and testis tissue. : The zebra finch transcriptome generated by TrAnnoScope from the brain, ovary, and testis tissue demonstrated strong alignment with the reference genome (99.63%), and it was found that 93.95% of the matched protein sequences in the zebra finch proteome were captured as nearly complete. Functional annotation provided matches to known protein databases and assigned relevant functional terms to the majority of the transcripts. : TrAnnoScope successfully integrates short and long sequencing technologies to generate transcriptomes with minimal user input. Its modularity and ease of use make it a valuable tool for researchers analyzing complex datasets, particularly for non-model organisms.

摘要

转录组组装和功能注释对于理解基因表达和生物学功能至关重要。然而,许多现有的流程缺乏整合短读长和长读长测序数据的灵活性,或者未能提供完整的、可定制的转录组分析工作流程,特别是对于非模式生物。

我们提出了TrAnnoScope,这是一种转录组分析流程,旨在处理Illumina短读长和PacBio长读长数据。该流程提供了一个完整的、可定制的工作流程,以生成具有广泛功能注释的高质量全长(FL)转录本。其模块化设计允许用户针对其他测序平台或数据类型调整特定的分析步骤。该流程涵盖了从质量控制到功能注释的各个步骤,使用了诸如SwissProt、Pfam、基因本体(GO)、京都基因与基因组百科全书(KEGG)以及真核直系同源组(KOG)等工具和既定数据库。作为一个案例研究,TrAnnoScope被应用于斑胸草雀脑、卵巢和睾丸组织的RNA测序和全长转录组测序(Iso-Seq)数据。

TrAnnoScope从脑、卵巢和睾丸组织生成的斑胸草雀转录组与参考基因组显示出高度的比对(99.63%),并且发现斑胸草雀蛋白质组中93.95%的匹配蛋白质序列被捕获为几乎完整。功能注释提供了与已知蛋白质数据库的匹配,并为大多数转录本赋予了相关的功能术语。

TrAnnoScope成功整合了短读长和长读长测序技术,只需最少的用户输入就能生成转录组。其模块化和易用性使其成为研究人员分析复杂数据集,特别是非模式生物的有价值工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/9f77fd6f376d/genes-15-01547-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/902c11868e91/genes-15-01547-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/b89442b36842/genes-15-01547-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/1b1ca6939dc8/genes-15-01547-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/cb8dd87cee47/genes-15-01547-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/57e95f8920d9/genes-15-01547-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/9f77fd6f376d/genes-15-01547-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/902c11868e91/genes-15-01547-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/b89442b36842/genes-15-01547-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/1b1ca6939dc8/genes-15-01547-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/cb8dd87cee47/genes-15-01547-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/57e95f8920d9/genes-15-01547-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3589/11727683/9f77fd6f376d/genes-15-01547-g006.jpg

相似文献

1
TrAnnoScope: A Modular Snakemake Pipeline for Full-Length Transcriptome Analysis and Functional Annotation.TrAnnoScope:用于全长转录组分析和功能注释的模块化Snakemake工作流程
Genes (Basel). 2024 Nov 29;15(12):1547. doi: 10.3390/genes15121547.
2
Illuminating the dark side of the human transcriptome with long read transcript sequencing.利用长读转录组测序揭示人类转录组的暗面。
BMC Genomics. 2020 Oct 30;21(1):751. doi: 10.1186/s12864-020-07123-7.
3
A Full-Length mRNA Transcriptome Generated From Hybrid-Corrected PacBio Long-Reads Improves the Transcript Annotation and Identifies Thousands of Novel Splice Variants in Atlantic Salmon.通过混合校正的PacBio长读长生成的全长mRNA转录组改善了转录本注释并鉴定了大西洋鲑鱼中数千种新的剪接变体。
Front Genet. 2021 Apr 27;12:656334. doi: 10.3389/fgene.2021.656334. eCollection 2021.
4
Improved zebra finch brain transcriptome identifies novel proteins with sex differences.改良的斑马雀脑转录组鉴定出具有性别差异的新型蛋白质。
Gene. 2022 Nov 15;843:146803. doi: 10.1016/j.gene.2022.146803. Epub 2022 Aug 9.
5
HPC-T-Assembly: a pipeline for de novo transcriptome assembly of large multi-specie datasets.HPC-T-Assembly:一种用于大型多物种数据集从头转录组组装的流程。
BMC Bioinformatics. 2025 Apr 28;26(1):113. doi: 10.1186/s12859-025-06121-4.
6
RNA-Seq in Nonmodel Organisms.非模式生物的 RNA-Seq。
Methods Mol Biol. 2021;2243:143-167. doi: 10.1007/978-1-0716-1103-6_8.
7
PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.PARRoT——一种基于同源性的策略,用于量化和比较非模式生物的RNA测序。
BMC Bioinformatics. 2016 Dec 22;17(Suppl 19):513. doi: 10.1186/s12859-016-1366-1.
8
A high-quality annotated transcriptome of swine peripheral blood.猪外周血的高质量注释转录组。
BMC Genomics. 2017 Jun 24;18(1):479. doi: 10.1186/s12864-017-3863-7.
9
transXpress: a Snakemake pipeline for streamlined de novo transcriptome assembly and annotation.transXpress:用于简化从头转录组组装和注释的 SnakeMake 管道。
BMC Bioinformatics. 2023 Apr 4;24(1):133. doi: 10.1186/s12859-023-05254-8.
10
Comparative transcriptome analysis of ovary and testis reveals potential sex-related genes and pathways in spotted knifejaw Oplegnathus punctatus.斑石鲷卵巢和精巢的比较转录组分析揭示潜在的性别相关基因和通路
Gene. 2017 Dec 30;637:203-210. doi: 10.1016/j.gene.2017.09.055. Epub 2017 Sep 27.

本文引用的文献

1
Trans2express - de novo transcriptome assembly pipeline optimized for gene expression analysis.Trans2express - 针对基因表达分析优化的从头转录组组装流程。
Plant Methods. 2024 Aug 17;20(1):128. doi: 10.1186/s13007-024-01255-7.
2
TAGET: a toolkit for analyzing full-length transcripts from long-read sequencing.TAGET:用于分析长读测序全长转录本的工具包。
Nat Commun. 2023 Sep 23;14(1):5935. doi: 10.1038/s41467-023-41649-0.
3
IsoTools: a flexible workflow for long-read transcriptome sequencing analysis.IsoTools:一种用于长读转录组测序分析的灵活工作流程。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad364.
4
Error analysis of the PacBio sequencing CCS reads.CCS 读段 PacBio 测序错误分析。
Int J Biostat. 2023 May 8;19(2):439-453. doi: 10.1515/ijb-2021-0091. eCollection 2023 Nov 1.
5
transXpress: a Snakemake pipeline for streamlined de novo transcriptome assembly and annotation.transXpress:用于简化从头转录组组装和注释的 SnakeMake 管道。
BMC Bioinformatics. 2023 Apr 4;24(1):133. doi: 10.1186/s12859-023-05254-8.
6
RNA-seq data science: From raw data to effective interpretation.RNA测序数据科学:从原始数据到有效解读
Front Genet. 2023 Mar 13;14:997383. doi: 10.3389/fgene.2023.997383. eCollection 2023.
7
Revealing the History and Mystery of RNA-Seq.揭示RNA测序的历史与奥秘
Curr Issues Mol Biol. 2023 Feb 24;45(3):1860-1874. doi: 10.3390/cimb45030120.
8
nf-core/isoseq: simple gene and isoform annotation with PacBio Iso-Seq long-read sequencing.nf-core/isoseq:使用 PacBio Iso-Seq 长读测序进行简单的基因和异构体注释。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad150.
9
The Gene Ontology knowledgebase in 2023.2023 版基因本体论知识库。
Genetics. 2023 May 4;224(1). doi: 10.1093/genetics/iyad031.
10
The hitchhikers' guide to RNA sequencing and functional analysis.RNA 测序和功能分析的搭便车指南。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac529.