• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

transXpress:用于简化从头转录组组装和注释的 SnakeMake 管道。

transXpress: a Snakemake pipeline for streamlined de novo transcriptome assembly and annotation.

机构信息

Scripps Institution of Oceanography, UC San Diego, 9500 Gilman Dr, La Jolla, CA, 92093, USA.

Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 16000, Prague 6, Czech Republic.

出版信息

BMC Bioinformatics. 2023 Apr 4;24(1):133. doi: 10.1186/s12859-023-05254-8.

DOI:10.1186/s12859-023-05254-8
PMID:37016291
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10074830/
Abstract

BACKGROUND

RNA-seq followed by de novo transcriptome assembly has been a transformative technique in biological research of non-model organisms, but the computational processing of RNA-seq data entails many different software tools. The complexity of these de novo transcriptomics workflows therefore presents a major barrier for researchers to adopt best-practice methods and up-to-date versions of software.

RESULTS

Here we present a streamlined and universal de novo transcriptome assembly and annotation pipeline, transXpress, implemented in Snakemake. transXpress supports two popular assembly programs, Trinity and rnaSPAdes, and allows parallel execution on heterogeneous cluster computing hardware.

CONCLUSIONS

transXpress simplifies the use of best-practice methods and up-to-date software for de novo transcriptome assembly, and produces standardized output files that can be mined using SequenceServer to facilitate rapid discovery of new genes and proteins in non-model organisms.

摘要

背景

RNA-seq 结合从头转录组组装已成为非模式生物生物学研究中的一项变革性技术,但 RNA-seq 数据的计算处理需要许多不同的软件工具。因此,这些从头转录组学工作流程的复杂性对研究人员采用最佳实践方法和最新版本的软件构成了重大障碍。

结果

在这里,我们展示了一个简化和通用的从头转录组组装和注释管道 transXpress,它是在 Snakemake 中实现的。transXpress 支持两个流行的组装程序 Trinity 和 rnaSPAdes,并允许在异构集群计算硬件上并行执行。

结论

transXpress 简化了最佳实践方法和最新软件在从头转录组组装中的使用,并生成了标准化的输出文件,可以使用 SequenceServer 进行挖掘,以促进非模式生物中新基因和蛋白质的快速发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03fe/10074830/bd6a56ff8589/12859_2023_5254_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03fe/10074830/7498ef7afdaa/12859_2023_5254_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03fe/10074830/25e81e666008/12859_2023_5254_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03fe/10074830/440a4a295ec3/12859_2023_5254_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03fe/10074830/bd6a56ff8589/12859_2023_5254_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03fe/10074830/7498ef7afdaa/12859_2023_5254_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03fe/10074830/25e81e666008/12859_2023_5254_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03fe/10074830/440a4a295ec3/12859_2023_5254_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03fe/10074830/bd6a56ff8589/12859_2023_5254_Fig4_HTML.jpg

相似文献

1
transXpress: a Snakemake pipeline for streamlined de novo transcriptome assembly and annotation.transXpress:用于简化从头转录组组装和注释的 SnakeMake 管道。
BMC Bioinformatics. 2023 Apr 4;24(1):133. doi: 10.1186/s12859-023-05254-8.
2
RNA-Seq in Nonmodel Organisms.非模式生物的 RNA-Seq。
Methods Mol Biol. 2021;2243:143-167. doi: 10.1007/978-1-0716-1103-6_8.
3
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers.从头转录组组装:短读 RNA-Seq 组装器的全面跨物种比较。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz039.
4
Assembly-free rapid differential gene expression analysis in non-model organisms using DNA-protein alignment.无组装的快速差异基因表达分析在非模式生物中使用 DNA 蛋白比对。
BMC Genomics. 2022 Feb 4;23(1):97. doi: 10.1186/s12864-021-08278-7.
5
Inferring bona fide transfrags in RNA-Seq derived-transcriptome assemblies of non-model organisms.在非模式生物的RNA测序衍生转录组组装中推断真正的转录片段
BMC Bioinformatics. 2015 Feb 21;16(1):58. doi: 10.1186/s12859-015-0492-5.
6
Comprehensive evaluation of de novo transcriptome assembly programs and their effects on differential gene expression analysis.从头转录组组装程序的综合评估及其对差异基因表达分析的影响。
Bioinformatics. 2017 Feb 1;33(3):327-333. doi: 10.1093/bioinformatics/btw625.
7
HPC-T-Annotator: an HPC tool for de novo transcriptome assembly annotation.HPC-T-Annotator:用于从头转录组组装注释的 HPC 工具。
BMC Bioinformatics. 2024 Aug 21;25(1):272. doi: 10.1186/s12859-024-05887-3.
8
Pincho: A Modular Approach to High Quality De Novo Transcriptomics.Pincho:一种用于高质量从头转录组学的模块化方法。
Genes (Basel). 2021 Jun 22;12(7):953. doi: 10.3390/genes12070953.
9
RNA-Seq Data Analysis Pipeline for Plants: Transcriptome Assembly, Alignment, and Differential Expression Analysis.植物RNA-Seq数据分析流程:转录组组装、比对及差异表达分析
Methods Mol Biol. 2022;2396:47-60. doi: 10.1007/978-1-0716-1822-6_5.
10
Challenges and strategies in transcriptome assembly and differential gene expression quantification. A comprehensive in silico assessment of RNA-seq experiments.转录组组装和差异基因表达定量中的挑战与策略。RNA-seq 实验的综合计算机评估。
Mol Ecol. 2013 Feb;22(3):620-34. doi: 10.1111/mec.12014. Epub 2012 Sep 24.

引用本文的文献

1
Proteogenomic approach to immunopeptidomics of ovarian tumors identifies shared peptide vaccine candidates.卵巢肿瘤免疫肽组学的蛋白质基因组学方法鉴定出共享的肽疫苗候选物。
NPJ Vaccines. 2025 Aug 16;10(1):195. doi: 10.1038/s41541-025-01234-6.
2
Emerging functions within the enzyme families of plant alkaloid biosynthesis.植物生物碱生物合成酶家族中的新功能。
Phytochem Rev. 2023 Dec 12. doi: 10.1007/s11101-023-09901-z.
3
HPC-T-Assembly: a pipeline for de novo transcriptome assembly of large multi-specie datasets.HPC-T-Assembly:一种用于大型多物种数据集从头转录组组装的流程。

本文引用的文献

1
SignalP 6.0 predicts all five types of signal peptides using protein language models.SignalP 6.0 使用蛋白质语言模型预测所有五种类型的信号肽。
Nat Biotechnol. 2022 Jul;40(7):1023-1025. doi: 10.1038/s41587-021-01156-3. Epub 2022 Jan 3.
2
BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes.BUSCO 更新:用于真核生物、原核生物和病毒基因组评分的新颖且简化的工作流程以及更广泛和更深的系统发育覆盖范围。
Mol Biol Evol. 2021 Sep 27;38(10):4647-4654. doi: 10.1093/molbev/msab199.
3
Pincho: A Modular Approach to High Quality De Novo Transcriptomics.
BMC Bioinformatics. 2025 Apr 28;26(1):113. doi: 10.1186/s12859-025-06121-4.
4
The emergence and loss of cyclic peptides in illuminate dynamics and mechanisms of plant metabolic evolution.环肽的出现与消失揭示了植物代谢进化的动态过程和机制。
Proc Natl Acad Sci U S A. 2025 Apr 22;122(16):e2425055122. doi: 10.1073/pnas.2425055122. Epub 2025 Apr 14.
5
Semblans: automated assembly and processing of RNA-seq data.Semblans:RNA测序数据的自动化组装与处理
Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btaf003.
6
TrAnnoScope: A Modular Snakemake Pipeline for Full-Length Transcriptome Analysis and Functional Annotation.TrAnnoScope:用于全长转录组分析和功能注释的模块化Snakemake工作流程
Genes (Basel). 2024 Nov 29;15(12):1547. doi: 10.3390/genes15121547.
7
Comprehensive Analysis of the Influence of Technical and Biological Variations on De Novo Assembly of RNA-Seq Datasets.技术和生物学变异对RNA-Seq数据集从头组装的影响的综合分析
Bioinform Biol Insights. 2024 Dec 5;18:11779322241274957. doi: 10.1177/11779322241274957. eCollection 2024.
8
HPC-T-Annotator: an HPC tool for de novo transcriptome assembly annotation.HPC-T-Annotator:用于从头转录组组装注释的 HPC 工具。
BMC Bioinformatics. 2024 Aug 21;25(1):272. doi: 10.1186/s12859-024-05887-3.
9
TAGADA: a scalable pipeline to improve genome annotations with RNA-seq data.TAGADA:一种利用RNA测序数据改进基因组注释的可扩展流程。
NAR Genom Bioinform. 2023 Oct 16;5(4):lqad089. doi: 10.1093/nargab/lqad089. eCollection 2023 Dec.
Pincho:一种用于高质量从头转录组学的模块化方法。
Genes (Basel). 2021 Jun 22;12(7):953. doi: 10.3390/genes12070953.
4
RNAflow: An Effective and Simple RNA-Seq Differential Gene Expression Pipeline Using Nextflow.RNAflow:一种使用 Nextflow 的高效、简单的 RNA-Seq 差异基因表达分析流程。
Genes (Basel). 2020 Dec 10;11(12):1487. doi: 10.3390/genes11121487.
5
UniProt: the universal protein knowledgebase in 2021.UniProt:2021 年的通用蛋白质知识库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489. doi: 10.1093/nar/gkaa1100.
6
Rfam 14: expanded coverage of metagenomic, viral and microRNA families.Rfam 14:扩展了对宏基因组、病毒和 miRNA 家族的覆盖范围。
Nucleic Acids Res. 2021 Jan 8;49(D1):D192-D200. doi: 10.1093/nar/gkaa1047.
7
RASflow: an RNA-Seq analysis workflow with Snakemake.RASflow:一个基于 Snakemake 的 RNA-Seq 分析工作流程。
BMC Bioinformatics. 2020 Mar 18;21(1):110. doi: 10.1186/s12859-020-3433-x.
8
The nf-core framework for community-curated bioinformatics pipelines.用于社区策划生物信息学流程的nf-core框架。
Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x.
9
BISR-RNAseq: an efficient and scalable RNAseq analysis workflow with interactive report generation.BISR-RNAseq:一个高效、可扩展的 RNAseq 分析工作流程,具有交互式报告生成功能。
BMC Bioinformatics. 2019 Dec 20;20(Suppl 24):670. doi: 10.1186/s12859-019-3251-1.
10
The chromosome-scale reference genome of black pepper provides insight into piperine biosynthesis.黑胡椒的染色体级参考基因组为胡椒碱生物合成提供了线索。
Nat Commun. 2019 Oct 16;10(1):4702. doi: 10.1038/s41467-019-12607-6.