Bian Zekang, Zhang Jin, Chung Fu-Lai, Wang Shitong
IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):10461-10474. doi: 10.1109/TNNLS.2023.3242049. Epub 2024 Aug 5.
Motivated by both the commonly used "from wholly coarse to locally fine" cognitive behavior and the recent finding that simple yet interpretable linear regression model should be a basic component of a classifier, a novel hybrid ensemble classifier called hybrid Takagi-Sugeno-Kang fuzzy classifier (H-TSK-FC) and its residual sketch learning (RSL) method are proposed. H-TSK-FC essentially shares the virtues of both deep and wide interpretable fuzzy classifiers and simultaneously has both feature-importance-based and linguistic-based interpretabilities. RSL method is featured as follows: 1) a global linear regression subclassifier on all original features of all training samples is generated quickly by the sparse representation-based linear regression subclassifier training procedure to identify/understand the importance of each feature and partition the output residuals of the incorrectly classified training samples into several residual sketches; 2) by using both the enhanced soft subspace clustering method (ESSC) for the linguistically interpretable antecedents of fuzzy rules and the least learning machine (LLM) for the consequents of fuzzy rules on residual sketches, several interpretable Takagi-Sugeno-Kang (TSK) fuzzy subclassifiers are stacked in parallel through residual sketches and accordingly generated to achieve local refinements; and 3) the final predictions are made to further enhance H-TSK-FC's generalization capability and decide which interpretable prediction route should be used by taking the minimal-distance-based priority for all the constructed subclassifiers. In contrast to existing deep or wide interpretable TSK fuzzy classifiers, benefiting from the use of feature-importance-based interpretability, H-TSK-FC has been experimentally witnessed to have faster running speed and better linguistic interpretability (i.e., fewer rules and/or TSK fuzzy subclassifiers and smaller model complexities) yet keep at least comparable generalization capability.
受常用的“从整体粗糙到局部精细”认知行为以及最近发现的简单且可解释的线性回归模型应作为分类器的基本组成部分的启发,提出了一种名为混合高木-菅野-康模糊分类器(H-TSK-FC)的新型混合集成分类器及其残差草图学习(RSL)方法。H-TSK-FC本质上兼具深度和广度可解释模糊分类器的优点,同时具有基于特征重要性和基于语言的可解释性。RSL方法的特点如下:1)通过基于稀疏表示的线性回归子分类器训练过程,快速生成关于所有训练样本的所有原始特征的全局线性回归子分类器,以识别/理解每个特征的重要性,并将错误分类的训练样本的输出残差划分为几个残差草图;2)通过对模糊规则的语言可解释前件使用增强软子空间聚类方法(ESSC),以及对残差草图上的模糊规则后件使用最小学习机(LLM),通过残差草图并行堆叠多个可解释的高木-菅野-康(TSK)模糊子分类器,并相应生成以实现局部细化;3)进行最终预测以进一步增强H-TSK-FC的泛化能力,并通过为所有构建的子分类器采用基于最小距离的优先级来决定应使用哪条可解释的预测路径。与现有的深度或广度可解释TSK模糊分类器相比,受益于基于特征重要性的可解释性的使用,实验证明H-TSK-FC具有更快的运行速度和更好的语言可解释性(即更少的规则和/或TSK模糊子分类器以及更小的模型复杂度),同时保持至少相当的泛化能力。