Villmann Thomas, Schleif Frank-Michael, Kostrzewa Markus, Walch Axel, Hammer Barbara
Medical Department, University Leipzig Germany.
Brief Bioinform. 2008 Mar;9(2):129-43. doi: 10.1093/bib/bbn009. Epub 2008 Mar 11.
In the present contribution we propose two recently developed classification algorithms for the analysis of mass-spectrometric data-the supervised neural gas and the fuzzy-labeled self-organizing map. The algorithms are inherently regularizing, which is recommended, for these spectral data because of its high dimensionality and the sparseness for specific problems. The algorithms are both prototype-based such that the principle of characteristic representants is realized. This leads to an easy interpretation of the generated classifcation model. Further, the fuzzy-labeled self-organizing map is able to process uncertainty in data, and classification results can be obtained as fuzzy decisions. Moreover, this fuzzy classification together with the property of topographic mapping offers the possibility of class similarity detection, which can be used for class visualization. We demonstrate the power of both methods for two exemplary examples: the classification of bacteria (listeria types) and neoplastic and non-neoplastic cell populations in breast cancer tissue sections.
在本论文中,我们提出了两种最近开发的用于质谱数据分析的分类算法——监督神经气体算法和模糊标记自组织映射算法。由于这些光谱数据具有高维度和特定问题的稀疏性,这两种算法本质上是正则化的,这对于此类光谱数据是推荐使用的。这两种算法都是基于原型的,从而实现了特征代表原理。这使得生成的分类模型易于解释。此外,模糊标记自组织映射能够处理数据中的不确定性,并且可以作为模糊决策获得分类结果。此外,这种模糊分类与拓扑映射特性一起提供了类相似性检测的可能性,可用于类可视化。我们通过两个示例展示了这两种方法的强大功能:细菌(李斯特菌类型)的分类以及乳腺癌组织切片中肿瘤和非肿瘤细胞群体的分类。