Aillis, Inc., Yaesu Central Tower 7F, 2-2-1 Yaesu, Chuo-ku, Tokyo, 104-0028, Japan.
Division of General Internal Medicine and Health Service Research, David Geffen School of Medicine at UCLA, Los Angeles, CA, USA.
Sci Rep. 2024 Aug 2;14(1):17954. doi: 10.1038/s41598-024-68817-6.
The pharynx is one of the few areas in the body where blood vessels and immune tissues can readily be observed from outside the body non-invasively. Although prior studies have found that sex could be identified from retinal images using artificial intelligence, it remains unknown as to whether individuals' sex could also be identified using pharyngeal images. Demographic information and pharyngeal images were collected from patients who visited 64 primary care clinics in Japan for influenza-like symptoms. We trained a deep learning-based classification model to predict reported sex, which incorporated a multiple instance convolutional neural network, on 20,319 images from 51 clinics. Validation was performed using 4869 images from the remaining 13 clinics not used for the training. The performance of the classification model was assessed using the area under the receiver operating characteristic curve. To interpret the model, we proposed a framework that combines a saliency map and organ segmentation map to quantitatively evaluate salient regions. The model achieved the area under the receiver operating characteristic curve of 0.883 (95% CI 0.866-0.900). In subgroup analyses, a substantial improvement in classification performance was observed for individuals aged 20 and older, indicating that sex-specific patterns between women and men may manifest as humans age (e.g., may manifest after puberty). The saliency map suggested the model primarily focused on the posterior pharyngeal wall and the uvula. Our study revealed the potential utility of pharyngeal images by accurately identifying individuals' reported sex using deep learning algorithm.
咽是人体少数几个可以从体外非侵入性地观察到血管和免疫组织的区域之一。尽管先前的研究发现可以使用人工智能从视网膜图像中识别性别,但尚不清楚是否也可以使用咽图像识别个体的性别。从因流感样症状前往日本 64 家初级保健诊所就诊的患者中收集了人口统计学信息和咽图像。我们在 51 家诊所的 20319 张图像上训练了一个基于深度学习的分类模型,以预测报告的性别,该模型结合了多个实例卷积神经网络。使用其余 13 家未用于训练的诊所的 4869 张图像进行验证。使用接收器操作特征曲线下的面积评估分类模型的性能。为了解释模型,我们提出了一个结合显著图和器官分割图的框架,以定量评估显著区域。该模型在接收器操作特征曲线下的面积为 0.883(95%CI 0.866-0.900)。在亚组分析中,观察到 20 岁及以上个体的分类性能有了实质性提高,表明女性和男性之间的性别特异性模式可能随着年龄的增长而显现(例如,可能在青春期后显现)。显著图表明,该模型主要关注咽后壁和悬雍垂。我们的研究通过使用深度学习算法准确识别个体报告的性别,揭示了咽图像的潜在应用价值。