多种模式生物选择用于非模式生物的转录组学分析。

Multiple model species selection for transcriptomics analysis of non-model organisms.

机构信息

Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung, Taiwan.

Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei, Taiwan.

出版信息

BMC Bioinformatics. 2018 Aug 13;19(Suppl 9):284. doi: 10.1186/s12859-018-2278-z.

DOI:10.1186/s12859-018-2278-z

PMID:30367568

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6101069/

Abstract

BACKGROUND

Transcriptomic sequencing (RNA-seq) related applications allow for rapid explorations due to their high-throughput and relatively fast experimental capabilities, providing unprecedented progress in gene functional annotation, gene regulation analysis, and environmental factor verification. However, with increasing amounts of sequenced reads and reference model species, the selection of appropriate reference species for gene annotation has become a new challenge.

METHODS

We proposed a novel approach for finding the most effective reference model species through taxonomic associations and ultra-conserved orthologous (UCO) gene comparisons among species. An online system for multiple species selection (MSS) for RNA-seq differential expression analysis was developed, and comprehensive genomic annotations from 291 reference model eukaryotic species were retrieved from the RefSeq, KEGG, and UniProt databases.

RESULTS

Using the proposed MSS pipeline, gene ontology and biological pathway enrichment analysis can be efficiently achieved, especially in the case of transcriptomic analysis of non-model organisms. The results showed that the proposed method solved problems related to limitations in annotation information and provided a roughly twenty-fold reduction in computational time, resulting in more accurate results than those of traditional approaches of using a single model reference species or the large non-redundant reference database.

CONCLUSIONS

Selection of appropriate reference model species helps to reduce missing annotation information, allowing for more comprehensive results than those obtained with a single model reference species. In addition, adequate model species selection reduces the computational time significantly while retaining the same order of accuracy. The proposed system indeed provides superior performance by selecting appropriate multiple species for transcriptomic analysis compared to traditional approaches.

摘要

背景

转录组测序（RNA-seq）相关应用因其高通量和相对较快的实验能力而能够快速探索，为基因功能注释、基因调控分析和环境因素验证提供了前所未有的进展。然而，随着测序读段和参考模型物种数量的增加，为基因注释选择合适的参考物种已成为新的挑战。

方法

我们提出了一种通过分类群关联和物种间超保守直系同源（UCO）基因比较来寻找最有效参考模型物种的新方法。开发了一个用于 RNA-seq 差异表达分析的多物种选择（MSS）在线系统，并从 RefSeq、KEGG 和 UniProt 数据库中检索了 291 个参考真核模型物种的综合基因组注释。

结果

使用所提出的 MSS 管道，可以有效地进行基因本体论和生物途径富集分析，特别是在非模型生物的转录组分析中。结果表明，该方法解决了注释信息有限的问题，并将计算时间减少了约二十倍，与使用单个模型参考物种或大型非冗余参考数据库的传统方法相比，结果更准确。

结论

选择合适的参考模型物种有助于减少缺失的注释信息，提供比使用单个模型参考物种更全面的结果。此外，适当的模型物种选择可以大大减少计算时间，同时保持相同的准确性。与传统方法相比，该系统通过为转录组分析选择合适的多个物种，确实提供了卓越的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d85/6101069/19d5e1c93251/12859_2018_2278_Fig1_HTML.jpg

相似文献

Multiple model species selection for transcriptomics analysis of non-model organisms.多种模式生物选择用于非模式生物的转录组学分析。

BMC Bioinformatics. 2018 Aug 13;19(Suppl 9):284. doi: 10.1186/s12859-018-2278-z.

PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.PARRoT——一种基于同源性的策略，用于量化和比较非模式生物的RNA测序。

BMC Bioinformatics. 2016 Dec 22;17(Suppl 19):513. doi: 10.1186/s12859-016-1366-1.

A robust (re-)annotation approach to generate unbiased mapping references for RNA-seq-based analyses of differential expression across closely related species.一种强大的（重新）注释方法，用于为基于RNA测序的密切相关物种间差异表达分析生成无偏映射参考。

BMC Genomics. 2016 May 24;17:392. doi: 10.1186/s12864-016-2646-x.

The aquatic animals' transcriptome resource for comparative functional analysis.水产动物比较功能分析转录组资源。

BMC Genomics. 2018 May 9;19(Suppl 2):103. doi: 10.1186/s12864-018-4463-x.

Comparative performance of transcriptome assembly methods for non-model organisms.非模式生物转录组组装方法的比较性能

BMC Genomics. 2016 Jul 27;17:523. doi: 10.1186/s12864-016-2923-8.

Designing a transcriptome next-generation sequencing project for a nonmodel plant species.为非模式植物物种设计转录组二代测序项目。

Am J Bot. 2012 Feb;99(2):257-66. doi: 10.3732/ajb.1100292. Epub 2012 Jan 19.

Issues with RNA-seq analysis in non-model organisms: A salmonid example.非模式生物中RNA测序分析的问题：一个鲑科鱼类的例子。

Dev Comp Immunol. 2017 Oct;75:38-47. doi: 10.1016/j.dci.2017.02.006. Epub 2017 Feb 20.

Comparative transcriptomics of elasmobranchs and teleosts highlight important processes in adaptive immunity and regional endothermy.软骨鱼和硬骨鱼的比较转录组学突出了适应性免疫和区域性体温调节中的重要过程。

BMC Genomics. 2017 Jan 30;18(1):87. doi: 10.1186/s12864-016-3411-x.

Blast2Fish: a reference-based annotation web tool for transcriptome analysis of non-model teleost fish.Blast2Fish：一种基于参考的非模式硬骨鱼类转录组分析注释网络工具。

BMC Bioinformatics. 2020 May 4;21(1):174. doi: 10.1186/s12859-020-3507-9.

TOA: A software package for automated functional annotation in non-model plant species.TOA：用于非模式植物物种自动功能注释的软件包。

Mol Ecol Resour. 2021 Feb;21(2):621-636. doi: 10.1111/1755-0998.13285. Epub 2020 Nov 18.

引用本文的文献

Comparative Transcriptome Analyses of Different Tissues Reveal Differentially Expressed Genes Associated with Anthraquinone, Catechin, and Gallic Acid Biosynthesis.不同组织的比较转录组分析揭示了与蒽醌、儿茶素和没食子酸生物合成相关的差异表达基因。

Genes (Basel). 2022 Sep 5;13(9):1592. doi: 10.3390/genes13091592.

Insights into the species evolution of copepods in the northern seas revealed by transcriptome sequencing.转录组测序揭示北海桡足类动物的物种进化见解

Ecol Evol. 2022 Feb 22;12(2):e8606. doi: 10.1002/ece3.8606. eCollection 2022 Feb.

Juxtapose: a gene-embedding approach for comparing co-expression networks.并列：一种用于比较共表达网络的基因嵌入方法。

BMC Bioinformatics. 2021 Mar 16;22(1):125. doi: 10.1186/s12859-021-04055-1.

Blast2Fish: a reference-based annotation web tool for transcriptome analysis of non-model teleost fish.Blast2Fish：一种基于参考的非模式硬骨鱼类转录组分析注释网络工具。

BMC Bioinformatics. 2020 May 4;21(1):174. doi: 10.1186/s12859-020-3507-9.

本文引用的文献

Transcriptomic Analysis of Metabolic Pathways in Milkfish That Respond to Salinity and Temperature Changes.虱目鱼中响应盐度和温度变化的代谢途径的转录组分析

PLoS One. 2015 Aug 11;10(8):e0134959. doi: 10.1371/journal.pone.0134959. eCollection 2015.

UniProt: a hub for protein information.通用蛋白质数据库（UniProt）：蛋白质信息中心。

Nucleic Acids Res. 2015 Jan;43(Database issue):D204-12. doi: 10.1093/nar/gku989. Epub 2014 Oct 27.

Transcriptome analysis reveals the same 17 S-locus F-box genes in two haplotypes of the self-incompatibility locus of Petunia inflata.转录组分析揭示了矮牵牛自交不亲和位点的两个单倍型中相同的17个S-位点F-box基因。

Plant Cell. 2014 Jul;26(7):2873-88. doi: 10.1105/tpc.114.126920. Epub 2014 Jul 28.

RNA sequencing analysis of the gametophyte transcriptome from the liverwort, Marchantia polymorpha.RNA 测序分析地钱配子体转录组。

PLoS One. 2014 May 19;9(5):e97497. doi: 10.1371/journal.pone.0097497. eCollection 2014.

Sequencing and de novo assembly of the Asian clam (Corbicula fluminea) transcriptome using the Illumina GAIIx method.使用Illumina GAIIx方法对亚洲蚬（河蚬，Corbicula fluminea）转录组进行测序和从头组装。

PLoS One. 2013 Nov 7;8(11):e79516. doi: 10.1371/journal.pone.0079516. eCollection 2013.

De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.利用 Trinity 平台从 RNA-seq 进行从头转录序列重建，用于参考生成和分析。

Nat Protoc. 2013 Aug;8(8):1494-512. doi: 10.1038/nprot.2013.084. Epub 2013 Jul 11.

Phylogenomic analyses of nuclear genes reveal the evolutionary relationships within the BEP clade and the evidence of positive selection in Poaceae.核基因的系统基因组学分析揭示了 BEP 分支内的进化关系以及禾本科中阳性选择的证据。

PLoS One. 2013 May 29;8(5):e64642. doi: 10.1371/journal.pone.0064642. Print 2013.

Transcriptome analysis and SSR/SNP markers information of the blunt snout bream (Megalobrama amblycephala).转录组分析和短串联重复/单核苷酸多态性标记信息的团头鲂（Megalobrama amblycephala）。

PLoS One. 2012;7(8):e42637. doi: 10.1371/journal.pone.0042637. Epub 2012 Aug 6.

The NCBI Taxonomy database.NCBI 分类数据库。

Nucleic Acids Res. 2012 Jan;40(Database issue):D136-43. doi: 10.1093/nar/gkr1178. Epub 2011 Dec 1.

NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy.NCBI 参考序列（RefSeq）：现状、新特性和基因组注释政策。

Nucleic Acids Res. 2012 Jan;40(Database issue):D130-5. doi: 10.1093/nar/gkr1079. Epub 2011 Nov 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

多种模式生物选择用于非模式生物的转录组学分析。

Multiple model species selection for transcriptomics analysis of non-model organisms.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献