• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重新组装、质量评估和 678 个微生物真核参考转录组的注释。

Re-assembly, quality evaluation, and annotation of 678 microbial eukaryotic reference transcriptomes.

机构信息

Department of Population Health, and Reproduction, School of Veterinary Medicine, University of California Davis, One Shields Ave, Davis, CA 95616, USA.

Molecular, Cellular, and Integrative Physiology Graduate Group, University of California Davis, One Shields Ave, Davis, CA 95616, USA.

出版信息

Gigascience. 2019 Apr 1;8(4). doi: 10.1093/gigascience/giy158.

DOI:10.1093/gigascience/giy158
PMID:30544207
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6481552/
Abstract

BACKGROUND

De novo transcriptome assemblies are required prior to analyzing RNA sequencing data from a species without an existing reference genome or transcriptome. Despite the prevalence of transcriptomic studies, the effects of using different workflows, or "pipelines," on the resulting assemblies are poorly understood. Here, a pipeline was programmatically automated and used to assemble and annotate raw transcriptomic short-read data collected as part of the Marine Microbial Eukaryotic Transcriptome Sequencing Project. The resulting transcriptome assemblies were evaluated and compared against assemblies that were previously generated with a different pipeline developed by the National Center for Genome Research.

RESULTS

New transcriptome assemblies contained the majority of previous contigs as well as new content. On average, 7.8% of the annotated contigs in the new assemblies were novel gene names not found in the previous assemblies. Taxonomic trends were observed in the assembly metrics. Assemblies from the Dinoflagellata showed a higher number of contigs and unique k-mers than transcriptomes from other phyla, while assemblies from Ciliophora had a lower percentage of open reading frames compared to other phyla.

CONCLUSIONS

Given current bioinformatics approaches, there is no single "best" reference transcriptome for a particular set of raw data. As the optimum transcriptome is a moving target, improving (or not) with new tools and approaches, automated and programmable pipelines are invaluable for managing the computationally intensive tasks required for re-processing large sets of samples with revised pipelines and ensuring a common evaluation workflow is applied to all samples. Thus, re-assembling existing data with new tools using automated and programmable pipelines may yield more accurate identification of taxon-specific trends across samples in addition to novel and useful products for the community.

摘要

背景

在分析没有现有参考基因组或转录组的物种的 RNA 测序数据之前,需要从头转录组组装。尽管转录组研究很普遍,但对于不同工作流程或“管道”对组装结果的影响了解甚少。在这里,我们通过编程自动化了一个管道,用于组装和注释作为海洋微生物真核转录组测序项目一部分收集的原始转录组短读数据。评估了由此产生的转录组组装,并将其与先前使用国家基因组研究中心开发的不同管道生成的组装进行了比较。

结果

新的转录组组装包含了以前的大多数拼接体以及新的内容。在新组装中,平均有 7.8%的注释拼接体是以前组装中未发现的新基因名称。在组装指标中观察到了分类趋势。与其他门的转录组相比,甲藻门的组装具有更多的拼接体和独特的 k-mer,而纤毛门的组装与其他门相比,开放阅读框的比例较低。

结论

鉴于当前的生物信息学方法,对于特定的原始数据集,没有一个单一的“最佳”参考转录组。由于最佳转录组是一个移动的目标,随着新工具和方法的不断改进(或不改进),自动化和可编程的管道对于管理重新处理具有修订管道的大量样本所需的计算密集型任务以及确保对所有样本应用通用的评估工作流程非常有价值。因此,使用自动化和可编程的管道,用新工具重新组装现有数据,除了为社区提供新的和有用的产品外,还可能更准确地识别样本中特定于分类群的趋势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/bb71119fe09a/giy158fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/af106266fd5f/giy158fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/70896d8bcd5d/giy158fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/bf394f7010ec/giy158fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/75b849de44f2/giy158fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/885b0155a7ff/giy158fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/597a255318af/giy158fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/cb93f6e93f0b/giy158fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/bb71119fe09a/giy158fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/af106266fd5f/giy158fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/70896d8bcd5d/giy158fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/bf394f7010ec/giy158fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/75b849de44f2/giy158fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/885b0155a7ff/giy158fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/597a255318af/giy158fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/cb93f6e93f0b/giy158fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c135/6481552/bb71119fe09a/giy158fig8.jpg

相似文献

1
Re-assembly, quality evaluation, and annotation of 678 microbial eukaryotic reference transcriptomes.重新组装、质量评估和 678 个微生物真核参考转录组的注释。
Gigascience. 2019 Apr 1;8(4). doi: 10.1093/gigascience/giy158.
2
Optimization of de novo transcriptome assembly from high-throughput short read sequencing data improves functional annotation for non-model organisms.优化从头转录组组装从高通量短读测序数据提高非模式生物的功能注释。
BMC Bioinformatics. 2012 Jul 18;13:170. doi: 10.1186/1471-2105-13-170.
3
Combining independent de novo assemblies optimizes the coding transcriptome for nonconventional model eukaryotic organisms.结合独立的从头组装可优化非传统模式真核生物的编码转录组。
BMC Bioinformatics. 2016 Dec 9;17(1):525. doi: 10.1186/s12859-016-1406-x.
4
Improved annotation with de novo transcriptome assembly in four social amoeba species.通过对四种社会性变形虫物种进行从头转录组组装来改进注释。
BMC Genomics. 2017 Jan 31;18(1):120. doi: 10.1186/s12864-017-3505-0.
5
Comparative performance of transcriptome assembly methods for non-model organisms.非模式生物转录组组装方法的比较性能
BMC Genomics. 2016 Jul 27;17:523. doi: 10.1186/s12864-016-2923-8.
6
Challenges and advances for transcriptome assembly in non-model species.非模式物种转录组组装面临的挑战与进展
PLoS One. 2017 Sep 20;12(9):e0185020. doi: 10.1371/journal.pone.0185020. eCollection 2017.
7
Construction of a public CHO cell line transcript database using versatile bioinformatics analysis pipelines.利用多功能生物信息学分析管道构建公共 CHO 细胞系转录数据库。
PLoS One. 2014 Jan 10;9(1):e85568. doi: 10.1371/journal.pone.0085568. eCollection 2014.
8
A pipeline for the de novo assembly of the Themira biloba (Sepsidae: Diptera) transcriptome using a multiple k-mer length approach.使用多 k -mer 长度方法从头组装 Themira biloba(双翅目:Sepsidae)转录组的流水线。
BMC Genomics. 2014 Mar 12;15(1):188. doi: 10.1186/1471-2164-15-188.
9
TransFlow: a modular framework for assembling and assessing accurate de novo transcriptomes in non-model organisms.TransFlow:一种用于在非模式生物中组装和评估准确从头转录组的模块化框架。
BMC Bioinformatics. 2018 Nov 20;19(Suppl 14):416. doi: 10.1186/s12859-018-2384-y.
10
"Escalibur"-A practical pipeline for the de novo analysis of nucleotide variation in nonmodel eukaryotes.“Escalibur”-一种用于非模式真核生物核苷酸变异从头分析的实用管道。
Mol Ecol Resour. 2022 Jul;22(5):2120-2126. doi: 10.1111/1755-0998.13600. Epub 2022 Mar 2.

引用本文的文献

1
Proteome remodeling in the zoospore-to-vegetative cell transition of the stramenopile Aurantiochytrium limacinum reveals candidate ectoplasmic network proteins.硅藻金藻在游动孢子向营养细胞转变过程中的蛋白质组重塑揭示了候选胞外网络蛋白。
PLoS One. 2025 Jul 2;20(7):e0326651. doi: 10.1371/journal.pone.0326651. eCollection 2025.
2
Unveiling the functional nature of retrogenes in dinoflagellates.揭示甲藻中反转录基因的功能本质。
Open Biol. 2025 Apr;15(4):240221. doi: 10.1098/rsob.240221. Epub 2025 Apr 23.
3
Heat shock transcription factor-mediated thermal tolerance and cell size plasticity in marine diatoms.

本文引用的文献

1
Keeping it light: (re)analyzing community-wide datasets without major infrastructure.保持轻松:(重新)分析无需主要基础设施的全社区数据集。
Gigascience. 2019 Feb 1;8(2):giy159. doi: 10.1093/gigascience/giy159.
2
The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly.牡蛎河协议:一种用于从头转录组组装的多组装器和kmer方法。
PeerJ. 2018 Aug 3;6:e5428. doi: 10.7717/peerj.5428. eCollection 2018.
3
De novo transcriptome assembly, annotation and comparison of four ecological and evolutionary model salmonid fish species.
热休克转录因子介导的海洋硅藻的热耐受性和细胞大小可塑性
Nat Commun. 2025 Apr 10;16(1):3404. doi: 10.1038/s41467-025-58547-2.
4
A multifunctional organelle coordinates phagocytosis and chlorophagy in a marine eukaryote phytoplankton Scyphosphaera apsteinii.一种多功能细胞器在海洋真核浮游植物阿氏鞘藻中协调吞噬作用和叶绿体吞噬作用。
New Phytol. 2025 May;246(3):1096-1112. doi: 10.1111/nph.20388. Epub 2025 Mar 4.
5
Missing microbial eukaryotes and misleading meta-omic conclusions.缺失的微生物真核生物和误导性的宏基因组学结论。
Nat Commun. 2024 Nov 14;15(1):9873. doi: 10.1038/s41467-024-52212-w.
6
Phylogenetic proximity drives temporal succession of marine giant viruses in a five-year metagenomic time-series.在一个为期五年的宏基因组时间序列中,系统发育亲缘关系驱动着海洋巨型病毒的时间演替。
bioRxiv. 2024 Aug 14:2024.08.12.607631. doi: 10.1101/2024.08.12.607631.
7
Microeukaryote metabolism across the western North Atlantic Ocean revealed through autonomous underwater profiling.通过自主水下剖面技术揭示北大西洋西部的微型真核生物代谢。
Nat Commun. 2024 Aug 25;15(1):7325. doi: 10.1038/s41467-024-51583-4.
8
First regional reference database of northern Adriatic diatom transcriptomes.首个亚得里亚海北部硅藻转录组区域参考数据库。
Sci Rep. 2024 Jul 13;14(1):16209. doi: 10.1038/s41598-024-67043-4.
9
Seasonal metabolic dynamics of microeukaryotic plankton: a year-long metatranscriptomic study in a temperate sea.季节代谢动态的微型浮游生物:在温带海洋一年的宏转录组学研究。
mBio. 2024 Aug 14;15(8):e0038324. doi: 10.1128/mbio.00383-24. Epub 2024 Jul 9.
10
DeepMicroClass sorts metagenomic contigs into prokaryotes, eukaryotes and viruses.DeepMicroClass将宏基因组重叠群分类为原核生物、真核生物和病毒。
NAR Genom Bioinform. 2024 May 6;6(2):lqae044. doi: 10.1093/nargab/lqae044. eCollection 2024 Jun.
从头组装、注释和比较四种生态和进化模式的鲑鱼科鱼类的转录组。
BMC Genomics. 2018 Jan 8;19(1):32. doi: 10.1186/s12864-017-4379-x.
4
De novo transcriptome assembly and analysis of differential gene expression in response to drought in European beech.欧洲山毛榉响应干旱的从头转录组组装及差异基因表达分析
PLoS One. 2017 Sep 5;12(9):e0184167. doi: 10.1371/journal.pone.0184167. eCollection 2017.
5
De novo transcriptome assembly and developmental mode specific gene expression of Pygospio elegans.秀丽磷虫的从头转录组组装及发育模式特异性基因表达
Evol Dev. 2017 Jul;19(4-5):205-217. doi: 10.1111/ede.12230.
6
De novo assembly and characterization of breast cancer transcriptomes identifies large numbers of novel fusion-gene transcripts of potential functional significance.乳腺癌转录组的从头组装和特征分析鉴定出大量具有潜在功能意义的新型融合基因转录本。
BMC Med Genomics. 2017 Aug 29;10(1):53. doi: 10.1186/s12920-017-0289-7.
7
Conserved Transcriptional Responses to Nutrient Stress in Bloom-Forming Algae.形成水华的藻类对营养胁迫的保守转录反应
Front Microbiol. 2017 Jul 18;8:1279. doi: 10.3389/fmicb.2017.01279. eCollection 2017.
8
Tissue resolved, gene structure refined equine transcriptome.组织解析、基因结构优化的马转录组。
BMC Genomics. 2017 Jan 20;18(1):103. doi: 10.1186/s12864-016-3451-2.
9
Reassigning stop codons via translation termination: How a few eukaryotes broke the dogma.通过翻译终止重新分配终止密码子:少数真核生物如何打破这一教条。
Bioessays. 2017 Mar;39(3). doi: 10.1002/bies.201600213. Epub 2016 Dec 23.
10
Genomes of coral dinoflagellate symbionts highlight evolutionary adaptations conducive to a symbiotic lifestyle.珊瑚共生甲藻基因组揭示了有利于共生生活方式的进化适应性。
Sci Rep. 2016 Dec 22;6:39734. doi: 10.1038/srep39734.