Khan Mohd Shoaib, Lohani Qm Danish
Department of Mathematics, South Asian University, New Delhi, 110021 India.
J Inequal Appl. 2017;2017(1):300. doi: 10.1186/s13660-017-1541-6. Epub 2017 Dec 6.
In the field of pattern recognition, clustering groups the data into different clusters on the basis of similarity among them. Many a time, the similarity level between data points is derived through a distance measure; so, a number of clustering techniques reliant on such a measure are developed. Clustering algorithms are modified by employing an appropriate distance measure due to the high versatility of a data set. The distance measure becomes appropriate in clustering algorithm if weights assigned at the components of the distance measure are in concurrence to the problem. In this paper, we propose a new sequence space [Formula: see text] related to [Formula: see text] using an Orlicz function. Many interesting properties of the sequence space [Formula: see text] are established by the help of a distance measure, which is also used to modify the -means clustering algorithm. To show the efficacy of the modified -means clustering algorithm over the standard -means clustering algorithm, we have implemented them for two real-world data set, viz. a two-moon data set and a path-based data set (borrowed from the UCI repository). The clustering accuracy obtained by our proposed clustering algoritm outperformes the standard -means clustering algorithm.
在模式识别领域,聚类是根据数据之间的相似性将其分组到不同的簇中。很多时候,数据点之间的相似性水平是通过距离度量得出的;因此,开发了许多依赖这种度量的聚类技术。由于数据集具有高度的通用性,通过采用适当的距离度量来修改聚类算法。如果在距离度量的组件上分配的权重与问题一致,那么该距离度量在聚类算法中就会变得合适。在本文中,我们使用一个奥利奇函数提出了一个与[公式:见原文]相关的新序列空间[公式:见原文]。借助一个距离度量建立了序列空间[公式:见原文]的许多有趣性质,该距离度量也用于修改K均值聚类算法。为了展示改进后的K均值聚类算法相对于标准K均值聚类算法的有效性,我们针对两个真实世界的数据集实现了它们,即一个双月数据集和一个基于路径的数据集(从UCI存储库借用)。我们提出的聚类算法获得的聚类准确率优于标准K均值聚类算法。