• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基于局部形状的相似性度量对基因表达数据进行聚类。

Clustering of gene expression data using a local shape-based similarity measure.

作者信息

Balasubramaniyan Rajarajeswari, Hüllermeier Eyke, Weskamp Nils, Kämper Jörg

机构信息

Max-Planck Institute for Terrestrial Microbiology, Department of Organismic Interactions Karl-von-Frisch-Strasse, 35043 Marburg, Germany.

出版信息

Bioinformatics. 2005 Apr 1;21(7):1069-77. doi: 10.1093/bioinformatics/bti095. Epub 2004 Oct 28.

DOI:10.1093/bioinformatics/bti095
PMID:15513997
Abstract

MOTIVATION

Microarray technology enables the study of gene expression in large scale. The application of methods for data analysis then allows for grouping genes that show a similar expression profile and that are thus likely to be co-regulated. A relationship among genes at the biological level often presents itself by locally similar and potentially time-shifted patterns in their expression profiles.

RESULTS

Here, we propose a new method (CLARITY; Clustering with Local shApe-based similaRITY) for the analysis of microarray time course experiments that uses a local shape-based similarity measure based on Spearman rank correlation. This measure does not require a normalization of the expression data and is comparably robust towards noise. It is also able to detect similar and even time-shifted sub-profiles. To this end, we implemented an approach motivated by the BLAST algorithm for sequence alignment. We used CLARITY to cluster the times series of gene expression data during the mitotic cell cycle of the yeast Saccharomyces cerevisiae. The obtained clusters were related to the MIPS functional classification to assess their biological significance. We found that several clusters were significantly enriched with genes that share similar or related functions.

摘要

动机

微阵列技术能够大规模地研究基因表达。数据分析方法的应用使得能够对那些显示出相似表达谱且因此可能受到共同调控的基因进行分组。基因在生物学层面的关系通常通过其表达谱中局部相似且可能存在时间偏移的模式表现出来。

结果

在此,我们提出一种用于微阵列时间进程实验分析的新方法(CLARITY;基于局部形状相似性的聚类),该方法使用基于斯皮尔曼等级相关的基于局部形状的相似性度量。这种度量不需要对表达数据进行归一化,并且对噪声具有相当的鲁棒性。它还能够检测相似甚至时间偏移的子谱。为此,我们实现了一种受BLAST序列比对算法启发的方法。我们使用CLARITY对酿酒酵母有丝分裂细胞周期中的基因表达数据时间序列进行聚类。将获得的聚类与MIPS功能分类相关联以评估其生物学意义。我们发现几个聚类显著富集了具有相似或相关功能的基因。

相似文献

1
Clustering of gene expression data using a local shape-based similarity measure.使用基于局部形状的相似性度量对基因表达数据进行聚类。
Bioinformatics. 2005 Apr 1;21(7):1069-77. doi: 10.1093/bioinformatics/bti095. Epub 2004 Oct 28.
2
Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions.超越共表达关系:时移和反向基因表达谱的局部聚类可识别新的生物学相关相互作用。
J Mol Biol. 2001 Dec 14;314(5):1053-66. doi: 10.1006/jmbi.2000.5219.
3
Identifying time-lagged gene clusters using gene expression data.利用基因表达数据识别时间滞后的基因簇。
Bioinformatics. 2005 Feb 15;21(4):509-16. doi: 10.1093/bioinformatics/bti026. Epub 2004 Sep 16.
4
Correcting the loss of cell-cycle synchrony in clustering analysis of microarray data using weights.利用权重校正微阵列数据聚类分析中细胞周期同步性的损失。
Bioinformatics. 2004 Jul 22;20(11):1766-71. doi: 10.1093/bioinformatics/bth169. Epub 2004 May 27.
5
Detecting biological associations between genes based on the theory of phase synchronization.基于相位同步理论检测基因之间的生物学关联。
Biosystems. 2008 May;92(2):99-113. doi: 10.1016/j.biosystems.2007.12.006. Epub 2008 Jan 11.
6
Clustering of diverse genomic data using information fusion.利用信息融合对多样的基因组数据进行聚类分析。
Bioinformatics. 2005 Feb 15;21(4):423-9. doi: 10.1093/bioinformatics/bti186. Epub 2004 Dec 17.
7
Clustering of change patterns using Fourier coefficients.使用傅里叶系数对变化模式进行聚类。
Bioinformatics. 2008 Jan 15;24(2):184-91. doi: 10.1093/bioinformatics/btm568. Epub 2007 Nov 19.
8
A new clustering method for microarray data analysis.一种用于微阵列数据分析的新聚类方法。
Proc IEEE Comput Soc Bioinform Conf. 2002;1:268-75.
9
Interpolation based consensus clustering for gene expression time series.基于插值的基因表达时间序列一致性聚类
BMC Bioinformatics. 2015 Apr 16;16:117. doi: 10.1186/s12859-015-0541-0.
10
Gene function prediction by a combined analysis of gene expression data and protein-protein interaction data.通过基因表达数据和蛋白质-蛋白质相互作用数据的联合分析进行基因功能预测。
J Bioinform Comput Biol. 2005 Dec;3(6):1371-89. doi: 10.1142/s0219720005001612.

引用本文的文献

1
Molecular and Phytochemical Characteristics of Flower Color and Scent Compounds in Dog Rose ( L.).狗蔷薇(L.)花色和香气化合物的分子和植物化学特征。
Molecules. 2024 Jul 2;29(13):3145. doi: 10.3390/molecules29133145.
2
Clustering algorithm based on DINNSM and its application in gene expression data analysis.基于 DINNSM 的聚类算法及其在基因表达数据分析中的应用。
Technol Health Care. 2024;32(S1):229-239. doi: 10.3233/THC-248020.
3
Efficient Approximation of Statistical Significance in Local Trend Analysis of Dependent Time Series.
相依时间序列局部趋势分析中统计显著性的有效近似
Front Genet. 2022 Apr 26;13:729011. doi: 10.3389/fgene.2022.729011. eCollection 2022.
4
Statistical significance approximation for local similarity analysis of dependent time series data.相依时间序列数据局部相似性分析的统计显著性逼近。
BMC Bioinformatics. 2019 Jan 28;20(1):53. doi: 10.1186/s12859-019-2595-x.
5
GEsture: an online hand-drawing tool for gene expression pattern search.GEsture:一种用于基因表达模式搜索的在线手绘工具。
PeerJ. 2018 Jun 20;6:e4927. doi: 10.7717/peerj.4927. eCollection 2018.
6
Generalized correlation measure using count statistics for gene expression data with ordered samples.基于有序样本的基因表达数据的广义相关度量的计数统计
Bioinformatics. 2018 Feb 15;34(4):617-624. doi: 10.1093/bioinformatics/btx641.
7
Statistical significance approximation in local trend analysis of high-throughput time-series data using the theory of Markov chains.利用马尔可夫链理论对高通量时间序列数据进行局部趋势分析时的统计显著性近似
BMC Bioinformatics. 2015 Sep 21;16:301. doi: 10.1186/s12859-015-0732-8.
8
On the selection of appropriate distances for gene expression data clustering.基因表达数据聚类中适当距离的选择。
BMC Bioinformatics. 2014;15 Suppl 2(Suppl 2):S2. doi: 10.1186/1471-2105-15-S2-S2. Epub 2014 Jan 24.
9
Efficient statistical significance approximation for local similarity analysis of high-throughput time series data.高通量时间序列数据局部相似性分析的高效统计显著性逼近。
Bioinformatics. 2013 Jan 15;29(2):230-7. doi: 10.1093/bioinformatics/bts668. Epub 2012 Nov 23.
10
Clustering of High Throughput Gene Expression Data.高通量基因表达数据的聚类
Comput Oper Res. 2012 Dec;39(12):3046-3061. doi: 10.1016/j.cor.2012.03.008.