Suppr超能文献

微阵列时间序列数据聚类的多个基因表达谱比对。

Multiple gene expression profile alignment for microarray time-series data clustering.

机构信息

School of Computer Science, University of Windsor, Windsor, Ontario, Canada.

出版信息

Bioinformatics. 2010 Sep 15;26(18):2281-8. doi: 10.1093/bioinformatics/btq422. Epub 2010 Jul 16.

Abstract

MOTIVATION

Clustering gene expression data given in terms of time-series is a challenging problem that imposes its own particular constraints. Traditional clustering methods based on conventional similarity measures are not always suitable for clustering time-series data. A few methods have been proposed recently for clustering microarray time-series, which take the temporal dimension of the data into account. The inherent principle behind these methods is to either define a similarity measure appropriate for temporal expression data, or pre-process the data in such a way that the temporal relationships between and within the time-series are considered during the subsequent clustering phase.

RESULTS

We introduce pairwise gene expression profile alignment, which vertically shifts two profiles in such a way that the area between their corresponding curves is minimal. Based on the pairwise alignment operation, we define a new distance function that is appropriate for time-series profiles. We also introduce a new clustering method that involves multiple expression profile alignment, which generalizes pairwise alignment to a set of profiles. Extensive experiments on well-known datasets yield encouraging results of at least 80% classification accuracy.

摘要

动机

根据时间序列给出的基因表达数据聚类是一个具有挑战性的问题,它带来了自身的特殊约束。基于传统相似性度量的传统聚类方法并不总是适合聚类时间序列数据。最近已经提出了一些用于聚类微阵列时间序列的方法,这些方法考虑了数据的时间维度。这些方法背后的基本原理是要么定义一个适合时间表达数据的相似性度量,要么以这样的方式预处理数据,即在随后的聚类阶段考虑时间序列之间和内部的时间关系。

结果

我们引入了两两基因表达谱比对,它以这样的方式垂直移动两个谱,使得它们相应曲线之间的区域最小化。基于成对的对齐操作,我们定义了一个适用于时间序列谱的新距离函数。我们还引入了一种新的聚类方法,它涉及多个表达谱的对齐,将成对对齐推广到一组谱。在著名数据集上进行的广泛实验至少产生了 80%的分类准确性的令人鼓舞的结果。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验