Department of Computer Science and Informatics, University of Energy and Natural Resources, Sunyani, Ghana.
PLoS One. 2024 Mar 15;19(3):e0300133. doi: 10.1371/journal.pone.0300133. eCollection 2024.
Convolutional Neural Networks (CNNs) are frequently used algorithms because of their propensity to learn relevant and hierarchical features through their feature extraction technique. However, the availability of enormous volumes of data in various variations is crucial for their performance. Capsule networks (CapsNets) perform well on a small amount of data but perform poorly on complex images. To address this, we proposed a new Capsule Network architecture called Tri Texton-Dense CapsNet (TTDCapsNet) for better complex and medical image classification. The TTDCapsNet is made up of three hierarchic blocks of Texton-Dense CapsNet (TDCapsNet) models. A single TDCapsNet is a CapsNet architecture composed of a texton detection layer to extract essential features, which are passed onto an eight-layered block of dense convolution that further extracts features, and then the output feature map is given as input to a Primary Capsule (PC), and then to a Class Capsule (CC) layer for classification. The resulting feature map from the first PC serves as input into the second-level TDCapsNet, and that from the second PC serves as input into the third-level TDCapsNet. The routing algorithm receives feature maps from each PC for the various CCs. Routing the concatenation of the three PCs creates an additional CC layer. All these four feature maps combined, help to achieve better classification. On fashion-MNIST, CIFAR-10, Breast Cancer, and Brain Tumor datasets, the proposed model is evaluated and achieved validation accuracies of 94.90%, 89.09%, 95.01%, and 97.71% respectively. Findings from this work indicate that TTDCapsNet outperforms the baseline and performs comparatively well with the state-of-the-art CapsNet models using different performance metrics. This work clarifies the viability of using Capsule Network on complex tasks in the real world. Thus, the proposed model can be used as an intelligent system, to help oncologists in diagnosing cancerous diseases and administering treatment required.
卷积神经网络 (CNNs) 因其通过特征提取技术学习相关和分层特征的倾向而被频繁使用。然而,大量不同变体的数据的可用性对其性能至关重要。胶囊网络 (CapsNets) 在少量数据上表现良好,但在复杂图像上表现不佳。为了解决这个问题,我们提出了一种新的胶囊网络架构,称为三纹理-密集胶囊网络 (TTDCapsNet),用于更好地进行复杂和医学图像分类。TTDCapsNet 由三个层次结构的纹理-密集胶囊网络 (TDCapsNet) 模型组成。单个 TDCapsNet 是一种由纹理检测层组成的胶囊网络架构,用于提取基本特征,这些特征被传递到一个由 8 层密集卷积组成的块中,以进一步提取特征,然后将输出特征图作为输入提供给主胶囊 (PC),然后提供给分类胶囊 (CC) 层进行分类。来自第一 PC 的生成特征图作为输入进入第二级 TDCapsNet,来自第二 PC 的生成特征图作为输入进入第三级 TDCapsNet。路由算法从每个 PC 接收用于各种 CC 的特征图。路由三个 PC 的串联生成一个附加的 CC 层。所有这四个特征图的组合有助于实现更好的分类。在时尚-MNIST、CIFAR-10、乳腺癌和脑肿瘤数据集上,对所提出的模型进行了评估,并分别实现了 94.90%、89.09%、95.01%和 97.71%的验证准确率。这项工作的结果表明,TTDCapsNet 优于基线,并且使用不同的性能指标与最先进的胶囊网络模型相比表现相当出色。这项工作阐明了在现实世界中的复杂任务中使用胶囊网络的可行性。因此,所提出的模型可以用作智能系统,帮助肿瘤学家诊断癌症疾病并提供所需的治疗。