Przybyła-Kasperek Małgorzata, Marfo Kwabena Frimpong
Institute of Computer Science, University of Silesia in Katowice, Sosnowiec, Poland.
PLoS One. 2024 Dec 2;19(12):e0311041. doi: 10.1371/journal.pone.0311041. eCollection 2024.
The paper introduces a novel approach for constructing a global model utilizing multilayer perceptron (MLP) neural networks and dispersed data sources. These dispersed data are independently gathered in various local tables, each potentially containing different objects and attributes, albeit with some shared elements (objects and attributes). Our approach involves the development of local models based on these local tables imputed with some artificial objects. Subsequently, local models are aggregated using weighted techniques. To complete, the global model is retrained using some global objects. In this study, the proposed method is compared with two existing approaches from the literature-homogeneous and heterogeneous multi-model classifiers. The analysis reveals that the proposed approach consistently outperforms these existing methods across multiple evaluation criteria including classification accuracy, balanced accuracy, F1-score, and precision. The results demonstrate that the proposed method significantly outperforms traditional ensemble classifiers and homogeneous ensembles of MLPs. Specifically, the proposed approach achieves an average classification accuracy improvement of 15% and a balanced accuracy enhancement of 12% over the baseline methods mentioned above. Moreover, in practical applications such as healthcare and smart agriculture, the model showcases superior properties by providing a single model that is easier to use and interpret. These improvements underscore the model's robustness and adaptability, making it a valuable tool for diverse real-world applications.
本文介绍了一种利用多层感知器(MLP)神经网络和分散数据源构建全局模型的新方法。这些分散的数据独立收集在各种本地表中,每个本地表可能包含不同的对象和属性,尽管有一些共享元素(对象和属性)。我们的方法包括基于这些填充了一些人工对象的本地表开发局部模型。随后,使用加权技术聚合局部模型。最后,使用一些全局对象对全局模型进行重新训练。在本研究中,将所提出的方法与文献中的两种现有方法——同质和异质多模型分类器进行了比较。分析表明,在所提出的方法在包括分类准确率、平衡准确率、F1分数和精确率在内的多个评估标准上始终优于这些现有方法。结果表明,所提出的方法显著优于传统的集成分类器和MLP的同质集成。具体而言,与上述基线方法相比,所提出的方法平均分类准确率提高了15%,平衡准确率提高了12%。此外,在医疗保健和智能农业等实际应用中,该模型通过提供一个更易于使用和解释的单一模型展示了优越的性能。这些改进突出了该模型的稳健性和适应性,使其成为各种实际应用中的宝贵工具。