Suppr超能文献

TomExpress,一个统一的番茄 RNA-Seq 平台,用于可视化表达数据、聚类和相关网络。

TomExpress, a unified tomato RNA-Seq platform for visualization of expression data, clustering and correlation networks.

机构信息

University of Toulouse, INPT, Laboratory of Genomics and Biotechnology of Fruit, Avenue de l'Agrobiopole BP 32607, Castanet-Tolosan, F-31326, France.

INRA, UMR990 Génomique et Biotechnologie des Fruits, Chemin de Borde Rouge, Castanet-Tolosan, F-31326, France.

出版信息

Plant J. 2017 Nov;92(4):727-735. doi: 10.1111/tpj.13711. Epub 2017 Oct 25.

Abstract

The TomExpress platform was developed to provide the tomato research community with a browser and integrated web tools for public RNA-Seq data visualization and data mining. To avoid major biases that can result from the use of different mapping and statistical processing methods, RNA-Seq raw sequence data available in public databases were mapped de novo on a unique tomato reference genome sequence and post-processed using the same pipeline with accurate parameters. Following the calculation of the number of counts per gene in each RNA-Seq sample, a communal global normalization method was applied to all expression values. This unifies the whole set of expression data and makes them comparable. A database was designed where each expression value is associated with corresponding experimental annotations. Sample details were manually curated to be easily understandable by biologists. To make the data easily searchable, a user-friendly web interface was developed that provides versatile data mining web tools via on-the-fly generation of output graphics, such as expression bar plots, comprehensive in planta representations and heatmaps of hierarchically clustered expression data. In addition, it allows for the identification of co-expressed genes and the visualization of correlation networks of co-regulated gene groups. TomExpress provides one of the most complete free resources of publicly available tomato RNA-Seq data, and allows for the immediate interrogation of transcriptional programs that regulate vegetative and reproductive development in tomato under diverse conditions. The design of the pipeline developed in this project enables easy updating of the database with newly published RNA-Seq data, thereby allowing for continuous enrichment of the resource.

摘要

TomExpress 平台旨在为番茄研究界提供一个浏览器和集成的网络工具,用于公共 RNA-Seq 数据可视化和数据挖掘。为了避免因使用不同的映射和统计处理方法而产生的主要偏差,我们将公共数据库中可用的 RNA-Seq 原始序列数据从头映射到一个独特的番茄参考基因组序列上,并使用相同的具有准确参数的管道进行后处理。在计算每个 RNA-Seq 样本中每个基因的计数后,应用了一种公共全局标准化方法对所有表达值进行归一化。这统一了整个表达数据集,并使它们具有可比性。设计了一个数据库,其中每个表达值都与相应的实验注释相关联。样本细节经过精心编辑,以便生物学家易于理解。为了使数据易于搜索,开发了一个用户友好的网络界面,通过即时生成输出图形(如表达条形图、综合的体内表达图谱和层次聚类表达数据的热图)提供多功能的数据挖掘网络工具。此外,它还允许识别共表达基因,并可视化共调控基因组的相关网络。TomExpress 提供了最完整的免费番茄 RNA-Seq 数据集资源之一,可立即查询转录程序,以了解番茄在不同条件下的营养和生殖发育情况。本项目中开发的管道设计使数据库能够轻松更新新发布的 RNA-Seq 数据,从而不断丰富资源。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验