Maji Pradipta, Garai Partha
IEEE Trans Cybern. 2021 Jul;51(7):3641-3652. doi: 10.1109/TCYB.2019.2925130. Epub 2021 Jun 23.
One of the important issues in pattern recognition and machine learning is how to find natural groups present in a dataset. In this regard, this paper presents a novel clustering algorithm, called rough hypercuboid-based interval type-2 fuzzy c -means (RIT2FCM). It judiciously integrates the merits of the rough hypercuboid approach, c -means algorithm, and interval type-2 fuzzy set, to address the uncertainty associated with real-life datasets. Using the concept of hypercuboid equivalence partition matrix (HEM) of rough hypercuboid approach, the lower approximation and boundary region of each cluster are implicitly defined, without using any prespecified threshold parameter. The interval-valued fuzzifier is applied to address the uncertainty coupled with different parameters of rough-fuzzy clustering algorithms, where the determination of the appropriate value of fuzzifier is a difficult task. An analytical formulation on the convergence analysis of the proposed RIT2FCM algorithm, along with a theoretical bound of its fuzzifier, is also introduced. The efficacy of the proposed RIT2FCM method is extensively compared with that of several existing clustering algorithms, using some cluster validity and classification rate indices on various real-life datasets. The proposed algorithm performs better than the state-of-the-art c -means algorithms in 92.59% cases, with respect to different cluster validity indices, in lesser computation time.
模式识别和机器学习中的一个重要问题是如何在数据集中找到自然形成的组。在这方面,本文提出了一种新颖的聚类算法,称为基于粗糙超长方体的区间二型模糊c均值(RIT2FCM)。它明智地整合了粗糙超长方体方法、c均值算法和区间二型模糊集的优点,以解决与现实生活数据集相关的不确定性。利用粗糙超长方体方法的超长方体等价划分矩阵(HEM)概念,隐式定义每个聚类的下近似和边界区域,而无需使用任何预先指定的阈值参数。应用区间值模糊化器来处理与粗糙模糊聚类算法不同参数相关的不确定性,其中确定模糊化器的合适值是一项艰巨的任务。还介绍了所提出的RIT2FCM算法收敛性分析的解析公式及其模糊化器的理论界限。使用各种现实生活数据集上的一些聚类有效性和分类率指标,将所提出的RIT2FCM方法的有效性与几种现有聚类算法进行了广泛比较。在所提出的算法在92.59%的情况下,相对于不同的聚类有效性指标,在更短的计算时间内,比当前最先进的c均值算法表现更好。