Lefol Yohan, Korfage Tom, Mjelle Robin, Prebensen Christian, Lüders Torben, Müller Bruno, Krokan Hans, Sarno Antonio, Alsøe Lene, Berdal Jan-Erik, Sætrom Pål, Nilsen Hilde, Domanska Diana
Institute of Clinical Medicine, University of Oslo, PO Box 1171, Blindern 0318, Norway.
Department of Microbiology, University of Oslo, Rikshospitalet, Oslo 0424, Norway.
NAR Genom Bioinform. 2023 Mar 3;5(1):lqad020. doi: 10.1093/nargab/lqad020. eCollection 2023 Mar.
Improved transcriptomic sequencing technologies now make it possible to perform longitudinal experiments, thus generating a large amount of data. Currently, there are no dedicated or comprehensive methods for the analysis of these experiments. In this article, we describe our TimeSeries Analysis pipeline (TiSA) which combines differential gene expression, clustering based on recursive thresholding, and a functional enrichment analysis. Differential gene expression is performed for both the temporal and conditional axes. Clustering is performed on the identified differentially expressed genes, with each cluster being evaluated using a functional enrichment analysis. We show that TiSA can be used to analyse longitudinal transcriptomic data from both microarrays and RNA-seq, as well as small, large, and/or datasets with missing data points. The tested datasets ranged in complexity, some originating from cell lines while another was from a longitudinal experiment of severity in COVID-19 patients. We have also included custom figures to aid with the biological interpretation of the data, these plots include Principal Component Analyses, Multi Dimensional Scaling plots, functional enrichment dotplots, trajectory plots, and complex heatmaps showing the broad overview of results. To date, TiSA is the first pipeline to provide an easy solution to the analysis of longitudinal transcriptomics experiments.
改进的转录组测序技术现在使进行纵向实验成为可能,从而产生大量数据。目前,尚无专门或全面的方法来分析这些实验。在本文中,我们描述了我们的时间序列分析流程(TiSA),它结合了差异基因表达、基于递归阈值的聚类和功能富集分析。对时间轴和条件轴都进行差异基因表达分析。对鉴定出的差异表达基因进行聚类,并使用功能富集分析对每个聚类进行评估。我们表明,TiSA可用于分析来自微阵列和RNA测序的纵向转录组数据,以及具有缺失数据点的小型、大型和/或数据集。测试的数据集在复杂性上各不相同,一些来自细胞系,另一些来自COVID-19患者严重程度的纵向实验。我们还包括了自定义图形以帮助对数据进行生物学解释,这些图包括主成分分析、多维标度图、功能富集点图、轨迹图以及显示结果概述的复杂热图。迄今为止,TiSA是第一个为纵向转录组学实验分析提供简便解决方案的流程。