舌象特征数据集构建与实时检测。

Tongue feature dataset construction and real-time detection.

机构信息

Graduate Institute of Chinese Medicine, School of Chinese Medicine, College of Chinese Medicine, China Medical University, Taichung, Taiwan, Republic of China.

Center for Artificial Intelligence in Medicine, Chang Gung Memorial Hospital, Taoyuan, Taiwan, Republic of China.

出版信息

PLoS One. 2024 Mar 7;19(3):e0296070. doi: 10.1371/journal.pone.0296070. eCollection 2024.

Abstract

BACKGROUND

Tongue diagnosis in traditional Chinese medicine (TCM) provides clinically important, objective evidence from direct observation of specific features that assist with diagnosis. However, the current interpretation of tongue features requires a significant amount of manpower and time. TCM physicians may have different interpretations of features displayed by the same tongue. An automated interpretation system that interprets tongue features would expedite the interpretation process and yield more consistent results.

MATERIALS AND METHODS

This study applied deep learning visualization to tongue diagnosis. After collecting tongue images and corresponding interpretation reports by TCM physicians in a single teaching hospital, various tongue features such as fissures, tooth marks, and different types of coatings were annotated manually with rectangles. These annotated data and images were used to train a deep learning object detection model. Upon completion of training, the position of each tongue feature was dynamically marked.

RESULTS

A large high-quality manually annotated tongue feature dataset was constructed and analyzed. A detection model was trained with average precision (AP) 47.67%, 58.94%, 71.25% and 59.78% for fissures, tooth marks, thick and yellow coatings, respectively. At over 40 frames per second on a NVIDIA GeForce GTX 1060, the model was capable of detecting tongue features from any viewpoint in real time.

CONCLUSIONS/SIGNIFICANCE: This study constructed a tongue feature dataset and trained a deep learning object detection model to locate tongue features in real time. The model provided interpretability and intuitiveness that are often lacking in general neural network models and implies good feasibility for clinical application.

摘要

背景

中医舌诊通过直接观察特定特征提供重要的临床客观依据,有助于诊断。然而,目前舌象特征的解读需要大量的人力和时间。不同的中医师可能对同一舌象有不同的解读。一个能够解读舌象特征的自动化解读系统可以加快解读过程并产生更一致的结果。

材料与方法

本研究将深度学习可视化应用于舌诊。在一家教学医院收集舌象和相应的中医医生解读报告后,手动用矩形框标注各种舌象特征,如裂纹、齿痕和不同类型的舌苔。这些标注数据和图像用于训练深度学习目标检测模型。训练完成后,动态标记每个舌象特征的位置。

结果

构建并分析了一个大型的高质量手动标注舌象特征数据集。该检测模型在裂纹、齿痕、厚黄苔等方面的平均精度(AP)分别为 47.67%、58.94%、71.25%和 59.78%。在 NVIDIA GeForce GTX 1060 上,该模型能够以每秒 40 帧以上的速度实时检测舌象特征。

结论/意义:本研究构建了一个舌象特征数据集,并训练了一个深度学习目标检测模型,实时定位舌象特征。该模型提供了通常在一般神经网络模型中缺乏的可解释性和直观性,这意味着它具有良好的临床应用可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f8/10919637/2e6ea01b32f3/pone.0296070.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索