Yotsu Rie, Ding Zhengming, Hamm Jihun, Blanton Ronald
Department of Tropical Medicine, Tulane University School of Public Health and Tropical Medicine, 1440 Canal Street, New Orleans, LA 70112.
Department of Computer Science, Tulane University School of Science and Engineering, 201 Lindy Claiborne Boggs Center, 6823 St. Charles Avenue, New Orleans, LA 70118.
medRxiv. 2023 Mar 15:2023.03.14.23287243. doi: 10.1101/2023.03.14.23287243.
Deep learning, which is a part of a broader concept of artificial intelligence (AI) and/or machine learning has achieved remarkable success in vision tasks. While there is growing interest in the use of this technology in diagnostic support for skin-related neglected tropical diseases (skin NTDs), there have been limited studies in this area and fewer focused on dark skin. In this study, we aimed to develop deep learning based AI models with clinical images we collected for five skin NTDs, namely, Buruli ulcer, leprosy, mycetoma, scabies, and yaws, to understand how diagnostic accuracy can or cannot be improved using different models and training patterns.
This study used photographs collected prospectively in Côte d'Ivoire and Ghana through our ongoing studies with use of digital health tools for clinical data documentation and for teledermatology. Our dataset included a total of 1,709 images from 506 patients. Two convolutional neural networks, ResNet-50 and VGG-16 models were adopted to examine the performance of different deep learning architectures and validate their feasibility in diagnosis of the targeted skin NTDs.
The two models were able to correctly predict over 70% of the diagnoses, and there was a consistent performance improvement with more training samples. The ResNet-50 model performed better than the VGG-16 model. Model trained with PCR confirmed cases of Buruli ulcer yielded 1-3% increase in prediction accuracy over training sets including unconfirmed cases.
Our approach was to have the deep learning model distinguish between multiple pathologies simultaneously - which is close to real-world practice. The more images used for training, the more accurate the diagnosis became. The percentages of correct diagnosis increased with PCR-positive cases of Buruli ulcer. This demonstrated that it may be better to input images from the more accurately diagnosed cases in the training models also for achieving better accuracy in the generated AI models. However, the increase was marginal which may be an indication that the accuracy of clinical diagnosis alone is reliable to an extent for Buruli ulcer. Diagnostic tests also have its flaws, and they are not always reliable. One hope for AI is that it will objectively resolve this gap between diagnostic tests and clinical diagnoses with addition of another tool. While there are still challenges to be overcome, there is a potential for AI to address the unmet needs where access to medical care is limited, like for those affected by skin NTDs.
The diagnosis of skin diseases depends in large part, though not exclusively on visual inspection. The diagnosis and management of these diseases is thus particularly amenable to teledermatology approaches. The widespread availability of cell phone technology and electronic information transfer provides new potential for access to health care in low-income countries, yet there are limited efforts targeting these neglected populations with dark skin and consequently limited availability of tools. In this study, we leveraged a collection of skin images gathered through a system of teledermatology in the West African countries of Côte d'Ivoire and Ghana, and applied deep learning, a form of artificial intelligence (AI) - to see if deep learning models can distinguish between different diseases and support their diagnosis. Skin-related neglected tropical diseases, or skin NTDs, prevail in these regions and were our target conditions: Buruli ulcer, leprosy, mycetoma, scabies, and yaws. The accuracy of prediction depended on the number of images that were fed into the model for training with marginal improvement using laboratory confirmed cases in training. Using more images and greater efforts in this area, it is possible that AI can help address the unmet needs where access to medical care is limited.
深度学习作为人工智能(AI)和/或机器学习这一更广泛概念的一部分,在视觉任务中取得了显著成功。尽管人们对将该技术用于皮肤相关被忽视热带病(皮肤NTDs)的诊断支持的兴趣与日俱增,但该领域的研究有限,且较少关注深色皮肤。在本研究中,我们旨在利用我们为五种皮肤NTDs(即布鲁里溃疡、麻风病、足菌肿、疥疮和雅司病)收集的临床图像开发基于深度学习的AI模型,以了解使用不同模型和训练模式时诊断准确性如何能够或无法得到提高。
本研究使用了通过我们正在进行的研究在前瞻性地在科特迪瓦和加纳收集的照片,这些研究使用数字健康工具进行临床数据记录和远程皮肤病学诊断。我们的数据集包括来自506名患者的总共1709张图像。采用了两个卷积神经网络,即ResNet - 50和VGG - 16模型,来检验不同深度学习架构的性能,并验证它们在诊断目标皮肤NTDs方面的可行性。
这两个模型能够正确预测超过70%的诊断结果,并且随着训练样本的增加,性能持续提高。ResNet - 50模型的表现优于VGG - 16模型。使用聚合酶链反应(PCR)确诊的布鲁里溃疡病例进行训练的模型,相较于包含未确诊病例的训练集,预测准确率提高了1% - 3%。
我们的方法是让深度学习模型同时区分多种病理情况——这接近实际临床实践。用于训练的图像越多,诊断就越准确。布鲁里溃疡PCR阳性病例的正确诊断百分比有所增加。这表明,为了在生成的AI模型中获得更高的准确性,在训练模型时输入来自诊断更准确病例的图像可能会更好。然而,这种增加幅度很小,这可能表明仅临床诊断的准确性在一定程度上对于布鲁里溃疡是可靠的。诊断测试也有其缺陷,并不总是可靠的。AI的一个希望是,它将通过增加另一种工具来客观地解决诊断测试与临床诊断之间的这一差距。尽管仍有挑战需要克服,但AI有潜力满足医疗服务获取受限地区(如受皮肤NTDs影响的人群)未得到满足的需求。
皮肤疾病的诊断在很大程度上(尽管并非完全)依赖于视觉检查。因此,这些疾病的诊断和管理特别适合采用远程皮肤病学方法。手机技术的广泛普及和电子信息传输为低收入国家提供了获取医疗保健的新潜力,但针对这些深色皮肤被忽视人群的努力有限,因此相关工具的可用性也有限。在本研究中,我们利用通过科特迪瓦和加纳这两个西非国家的远程皮肤病学系统收集的一系列皮肤图像,并应用深度学习(一种人工智能形式),以查看深度学习模型是否能够区分不同疾病并支持其诊断。皮肤相关被忽视热带病(皮肤NTDs)在这些地区普遍存在,是我们的目标疾病:布鲁里溃疡、麻风病、足菌肿、疥疮和雅司病。预测的准确性取决于输入模型进行训练的图像数量,在训练中使用实验室确诊病例时改进幅度很小。在这一领域使用更多图像并付出更大努力,AI有可能帮助满足医疗服务获取受限地区未得到满足的需求。