• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TimesVector:一种用于分析来自多种表型的时间序列转录组数据的向量化聚类方法。

TimesVector: a vectorized clustering approach to the analysis of time series transcriptome data from multiple phenotypes.

机构信息

Interdisciplinary Program in Bioinformatics, Seoul National University, Gwanak-Gu, Seoul, 151-747, Republic of Korea.

Department of Computer Science and Engineering.

出版信息

Bioinformatics. 2017 Dec 1;33(23):3827-3835. doi: 10.1093/bioinformatics/btw780.

DOI:10.1093/bioinformatics/btw780
PMID:28096084
Abstract

MOTIVATION

Identifying biologically meaningful gene expression patterns from time series gene expression data is important to understand the underlying biological mechanisms. To identify significantly perturbed gene sets between different phenotypes, analysis of time series transcriptome data requires consideration of time and sample dimensions. Thus, the analysis of such time series data seeks to search gene sets that exhibit similar or different expression patterns between two or more sample conditions, constituting the three-dimensional data, i.e. gene-time-condition. Computational complexity for analyzing such data is very high, compared to the already difficult NP-hard two dimensional biclustering algorithms. Because of this challenge, traditional time series clustering algorithms are designed to capture co-expressed genes with similar expression pattern in two sample conditions.

RESULTS

We present a triclustering algorithm, TimesVector, specifically designed for clustering three-dimensional time series data to capture distinctively similar or different gene expression patterns between two or more sample conditions. TimesVector identifies clusters with distinctive expression patterns in three steps: (i) dimension reduction and clustering of time-condition concatenated vectors, (ii) post-processing clusters for detecting similar and distinct expression patterns and (iii) rescuing genes from unclassified clusters. Using four sets of time series gene expression data, generated by both microarray and high throughput sequencing platforms, we demonstrated that TimesVector successfully detected biologically meaningful clusters of high quality. TimesVector improved the clustering quality compared to existing triclustering tools and only TimesVector detected clusters with differential expression patterns across conditions successfully.

AVAILABILITY AND IMPLEMENTATION

The TimesVector software is available at http://biohealth.snu.ac.kr/software/TimesVector/.

CONTACT

sunkim.bioinfo@snu.ac.kr.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

从时间序列基因表达数据中识别有生物学意义的基因表达模式对于理解潜在的生物学机制非常重要。为了识别不同表型之间显著扰动的基因集,需要考虑时间和样本维度来分析时间序列转录组数据。因此,此类时间序列数据的分析旨在搜索在两个或更多样本条件之间表现出相似或不同表达模式的基因集,构成三维数据,即基因-时间-条件。与已经困难的二维二分聚类算法相比,分析此类数据的计算复杂度非常高。由于这一挑战,传统的时间序列聚类算法旨在捕获具有相似表达模式的两个样本条件下共表达的基因。

结果

我们提出了一种三聚类算法 TimesVector,专门用于聚类三维时间序列数据,以捕获两个或更多样本条件之间独特相似或不同的基因表达模式。TimesVector 通过三个步骤识别具有独特表达模式的聚类:(i)将时间-条件串联向量降维和聚类,(ii)对聚类进行后处理以检测相似和不同的表达模式,以及(iii)从未分类的聚类中恢复基因。使用四个时间序列基因表达数据集,这些数据集由微阵列和高通量测序平台生成,我们证明了 TimesVector 成功地检测到了高质量的生物学有意义的聚类。与现有的三聚类工具相比,TimesVector 提高了聚类质量,并且只有 TimesVector 成功地检测到了条件之间具有差异表达模式的聚类。

可用性和实现

TimesVector 软件可在 http://biohealth.snu.ac.kr/software/TimesVector/ 获得。

联系人

sunkim.bioinfo@snu.ac.kr。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
TimesVector: a vectorized clustering approach to the analysis of time series transcriptome data from multiple phenotypes.TimesVector:一种用于分析来自多种表型的时间序列转录组数据的向量化聚类方法。
Bioinformatics. 2017 Dec 1;33(23):3827-3835. doi: 10.1093/bioinformatics/btw780.
2
Detecting clusters of different geometrical shapes in microarray gene expression data.在微阵列基因表达数据中检测不同几何形状的聚类。
Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.
3
TimesVector-Web: A Web Service for Analysing Time Course Transcriptome Data with Multiple Conditions.TimesVector-Web:一个用于分析具有多种条件的时间进程转录组数据的网络服务。
Genes (Basel). 2021 Dec 28;13(1):73. doi: 10.3390/genes13010073.
4
Multiobjective triclustering of time-series transcriptome data reveals key genes of biological processes.时间序列转录组数据的多目标三聚类揭示生物过程的关键基因。
BMC Bioinformatics. 2015 Jun 26;16:200. doi: 10.1186/s12859-015-0635-8.
5
CLIC: clustering analysis of large microarray datasets with individual dimension-based clustering.CLIC:基于个体维度聚类的大型微阵列数据集聚类分析。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W246-53. doi: 10.1093/nar/gkq516. Epub 2010 Jun 6.
6
Influence maximization in time bounded network identifies transcription factors regulating perturbed pathways.时间受限网络中的影响力最大化识别调控受扰通路的转录因子。
Bioinformatics. 2016 Jun 15;32(12):i128-i136. doi: 10.1093/bioinformatics/btw275.
7
Analysis of a Gibbs sampler method for model-based clustering of gene expression data.一种基于模型的基因表达数据聚类的吉布斯采样器方法分析。
Bioinformatics. 2008 Jan 15;24(2):176-83. doi: 10.1093/bioinformatics/btm562. Epub 2007 Nov 22.
8
TriRNSC: triclustering of gene expression microarray data using restricted neighbourhood search.TriRNSC:基于受限邻域搜索的基因表达微阵列数据的三重聚类。
IET Syst Biol. 2020 Dec;14(6):323-333. doi: 10.1049/iet-syb.2020.0024.
9
Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes.使用监督学习组合帕累托最优聚类以识别共表达基因。
BMC Bioinformatics. 2009 Jan 20;10:27. doi: 10.1186/1471-2105-10-27.
10
EDISA: extracting biclusters from multiple time-series of gene expression profiles.EDISA:从多个基因表达谱时间序列中提取双聚类
BMC Bioinformatics. 2007 Sep 12;8:334. doi: 10.1186/1471-2105-8-334.

引用本文的文献

1
Transcriptomic profiling of gastrointestinal tracts in dairy cattle during lactation reveals molecular adaptations for milk synthesis.泌乳期奶牛胃肠道的转录组分析揭示了牛奶合成的分子适应性。
J Adv Res. 2025 May;71:67-80. doi: 10.1016/j.jare.2024.06.020. Epub 2024 Jun 24.
2
A clustering procedure for three-way RNA sequencing data using data transformations and matrix-variate Gaussian mixture models.基于数据变换和矩阵变量高斯混合模型的三方 RNA 测序数据聚类方法。
BMC Bioinformatics. 2024 Mar 1;25(1):90. doi: 10.1186/s12859-024-05717-6.
3
An auxin-mediated ultradian rhythm positively influences root regeneration in .
生长素介导的超日节律对[具体对象]的根再生有积极影响。 (原文中“in”后面缺少具体内容)
Front Plant Sci. 2023 Jun 7;14:1136445. doi: 10.3389/fpls.2023.1136445. eCollection 2023.
4
Identification of novel anti-ZIKV drugs from viral-infection temporal gene expression profiles.从病毒感染时间基因表达谱中鉴定新型抗 ZIKV 药物。
Emerg Microbes Infect. 2023 Dec;12(1):2174777. doi: 10.1080/22221751.2023.2174777.
5
Comparative network analysis reveals the dynamics of organic acid diversity during fruit ripening in peach (Prunus persica L. Batsch).比较网络分析揭示了桃(Prunus persica L. Batsch)果实成熟过程中有机酸多样性的动态变化。
BMC Plant Biol. 2023 Jan 9;23(1):16. doi: 10.1186/s12870-023-04037-w.
6
Comparative transcriptome and metabolome analyses identified the mode of sucrose degradation as a metabolic marker for early vegetative propagation in bulbs of Lycoris.比较转录组和代谢组分析确定了蔗糖降解的模式作为石蒜鳞茎早期营养繁殖的代谢标记。
Plant J. 2022 Oct;112(1):115-134. doi: 10.1111/tpj.15935. Epub 2022 Sep 8.
7
Time Series Transcriptome Analysis in Shoot and Root Tissue During Early Nodulation.早期结瘤过程中茎和根组织的时间序列转录组分析
Front Plant Sci. 2022 Apr 7;13:861639. doi: 10.3389/fpls.2022.861639. eCollection 2022.
8
TimesVector-Web: A Web Service for Analysing Time Course Transcriptome Data with Multiple Conditions.TimesVector-Web:一个用于分析具有多种条件的时间进程转录组数据的网络服务。
Genes (Basel). 2021 Dec 28;13(1):73. doi: 10.3390/genes13010073.
9
Charting Shifts in Saccharomyces cerevisiae Gene Expression across Asynchronous Time Trajectories with Diffusion Maps.利用扩散映射绘制酿酒酵母基因表达在异步时间轨迹上的变化。
mBio. 2021 Oct 26;12(5):e0234521. doi: 10.1128/mBio.02345-21. Epub 2021 Oct 5.
10
Temporal Dynamic Methods for Bulk RNA-Seq Time Series Data.批量 RNA-Seq 时间序列数据的时间动态方法。
Genes (Basel). 2021 Feb 27;12(3):352. doi: 10.3390/genes12030352.