• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多自组织映射聚类方法在巨噬细胞基因表达分析中的应用。

Application of Multi-SOM clustering approach to macrophage gene expression analysis.

作者信息

Ghouila Amel, Yahia Sadok Ben, Malouche Dhafer, Jmel Haifa, Laouini Dhafer, Guerfali Fatma Z, Abdelhak Sonia

机构信息

Research Unit on Molecular Investigation of Genetic Orphan Diseases, Institut Pasteur de Tunis, 13 Place Pasteur, BP 74, Tunis Belvédère 1002, Tunisia.

出版信息

Infect Genet Evol. 2009 May;9(3):328-36. doi: 10.1016/j.meegid.2008.09.009. Epub 2008 Oct 17.

DOI:10.1016/j.meegid.2008.09.009
PMID:18992849
Abstract

The production of increasingly reliable and accessible gene expression data has stimulated the development of computational tools to interpret such data and to organize them efficiently. The clustering techniques are largely recognized as useful exploratory tools for gene expression data analysis. Genes that show similar expression patterns over a wide range of experimental conditions can be clustered together. This relies on the hypothesis that genes that belong to the same cluster are coregulated and involved in related functions. Nevertheless, clustering algorithms still show limits, particularly for the estimation of the number of clusters and the interpretation of hierarchical dendrogram, which may significantly influence the outputs of the analysis process. We propose here a multi level SOM based clustering algorithm named Multi-SOM. Through the use of clustering validity indices, Multi-SOM overcomes the problem of the estimation of clusters number. To test the validity of the proposed clustering algorithm, we first tested it on supervised training data sets. Results were evaluated by computing the number of misclassified samples. We have then used Multi-SOM for the analysis of macrophage gene expression data generated in vitro from the same individual blood infected with 5 different pathogens. This analysis led to the identification of sets of tightly coregulated genes across different pathogens. Gene Ontology tools were then used to estimate the biological significance of the clustering, which showed that the obtained clusters are coherent and biologically significant.

摘要

越来越可靠且易于获取的基因表达数据的产生,刺激了用于解释此类数据并对其进行有效组织的计算工具的发展。聚类技术在很大程度上被认为是用于基因表达数据分析的有用探索工具。在广泛的实验条件下显示出相似表达模式的基因可以被聚类在一起。这依赖于这样的假设,即属于同一聚类的基因是共调控的且参与相关功能。然而,聚类算法仍然存在局限性,特别是在聚类数量的估计和层次树状图的解释方面,这可能会显著影响分析过程的输出。我们在此提出一种基于多层自组织映射的聚类算法,名为Multi - SOM。通过使用聚类有效性指标,Multi - SOM克服了聚类数量估计的问题。为了测试所提出聚类算法的有效性,我们首先在监督训练数据集上对其进行测试。通过计算错误分类样本的数量来评估结果。然后我们使用Multi - SOM对来自同一个体感染5种不同病原体的血液体外产生的巨噬细胞基因表达数据进行分析。该分析导致识别出跨不同病原体的紧密共调控基因集。然后使用基因本体工具来估计聚类的生物学意义,结果表明所获得的聚类是连贯的且具有生物学意义。

相似文献

1
Application of Multi-SOM clustering approach to macrophage gene expression analysis.多自组织映射聚类方法在巨噬细胞基因表达分析中的应用。
Infect Genet Evol. 2009 May;9(3):328-36. doi: 10.1016/j.meegid.2008.09.009. Epub 2008 Oct 17.
2
Clustering of change patterns using Fourier coefficients.使用傅里叶系数对变化模式进行聚类。
Bioinformatics. 2008 Jan 15;24(2):184-91. doi: 10.1093/bioinformatics/btm568. Epub 2007 Nov 19.
3
Detecting clusters of different geometrical shapes in microarray gene expression data.在微阵列基因表达数据中检测不同几何形状的聚类。
Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.
4
An iterative data mining approach for mining overlapping coexpression patterns in noisy gene expression data.一种用于在嘈杂基因表达数据中挖掘重叠共表达模式的迭代数据挖掘方法。
IEEE Trans Nanobioscience. 2009 Sep;8(3):252-8. doi: 10.1109/TNB.2009.2026747. Epub 2009 Jul 14.
5
A new algorithm for comparing and visualizing relationships between hierarchical and flat gene expression data clusterings.一种用于比较和可视化层次化与平面化基因表达数据聚类之间关系的新算法。
Bioinformatics. 2005 Nov 1;21(21):3993-9. doi: 10.1093/bioinformatics/bti644. Epub 2005 Sep 1.
6
Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: detecting varying patterns in expression profiles.用于基因分组的分裂相关聚类算法(DCCA):检测表达谱中的变化模式。
Bioinformatics. 2008 Jun 1;24(11):1359-66. doi: 10.1093/bioinformatics/btn133. Epub 2008 Apr 10.
7
Improving cluster visualization in self-organizing maps: application in gene expression data analysis.改进自组织映射中的聚类可视化:在基因表达数据分析中的应用。
Comput Biol Med. 2007 Dec;37(12):1677-89. doi: 10.1016/j.compbiomed.2007.04.003. Epub 2007 Jun 4.
8
TA-clustering: cluster analysis of gene expression profiles through Temporal Abstractions.TA聚类:通过时间抽象对基因表达谱进行聚类分析。
Int J Med Inform. 2005 Aug;74(7-8):505-17. doi: 10.1016/j.ijmedinf.2005.03.014.
9
A mixture model with random-effects components for clustering correlated gene-expression profiles.一种具有随机效应成分的混合模型,用于对相关基因表达谱进行聚类。
Bioinformatics. 2006 Jul 15;22(14):1745-52. doi: 10.1093/bioinformatics/btl165. Epub 2006 May 3.
10
Analysis of a Gibbs sampler method for model-based clustering of gene expression data.一种基于模型的基因表达数据聚类的吉布斯采样器方法分析。
Bioinformatics. 2008 Jan 15;24(2):176-83. doi: 10.1093/bioinformatics/btm562. Epub 2007 Nov 22.

引用本文的文献

1
Comparative transcriptomic analysis of contrasting hybrid cultivars reveal key drought-responsive genes and metabolic pathways regulating drought stress tolerance in maize at various stages.对比分析不同杂交品种的转录组学研究揭示了调控玉米在不同阶段对干旱胁迫耐受性的关键抗旱响应基因和代谢途径。
PLoS One. 2020 Oct 15;15(10):e0240468. doi: 10.1371/journal.pone.0240468. eCollection 2020.
2
Pomegranate seed clustering by machine vision.基于机器视觉的石榴籽聚类
Food Sci Nutr. 2017 Nov 12;6(1):18-26. doi: 10.1002/fsn3.475. eCollection 2018 Jan.
3
Clustering of High Throughput Gene Expression Data.
高通量基因表达数据的聚类
Comput Oper Res. 2012 Dec;39(12):3046-3061. doi: 10.1016/j.cor.2012.03.008.