Huang Quankeng, Jiang Wenchao, Li Junhang, Wen Jianxuan, He Ji, Song Wei
School of Computer Science and Technology, Guangdong University of Technology, 100 Outer Ring West Road, Panyu District, Guangzhou, 510006, Guangdong, China.
School of Computer Science and Technology, Guangdong University of Technology, 100 Outer Ring West Road, Panyu District, Guangzhou, 510006, Guangdong, China.
Med Image Anal. 2025 May;102:103515. doi: 10.1016/j.media.2025.103515. Epub 2025 Mar 2.
Ultrasound images and biological indicators, which reveal Hashimoto's thyroiditis (HT) characteristics in thyroid tissue from different perspectives, play crucial roles in HT recognition. Ultrasound images of patients with HT typically present a heterogeneous background with potential decreases in echogenicity. Clinicians are prone to misdiagnosing HT by visually evaluating these characteristics. In addition, patients with HT may exhibit fluctuations in relevant biological indicators, but there are no absolute relationships between a single biological indicator and HT. To address these challenges, we propose HTR-Net, a novel HT recognition network that combines ultrasound images and biological indicators through multi-modality information embedding. Specifically, HTR-Net introduces a global cross-attention module (GCA), which enhances recognition of the heterogeneous background with potential decreases in echogenicity. A distance-aware mismatched augmentation (DMA) strategy is also designed to expand the limited biological indicator data and ensure reasonable values for the augmented biological indicators, thus enhancing the model performance. In order to address the nonabsolute relationship between HT and a single biological indicator, we propose a distance-aware loss (DL) function to constrain feature mapping for effective information extraction from indicators, thereby enhancing the model's capability to detect anomalous sets of biological indicators. To validate the proposed method, we construct a multi-center HT dataset and conduct extensive experiments. The experimental results demonstrate that the proposed HTR-Net achieves state-of-the-art (SOTA) performance.
超声图像和生物学指标从不同角度揭示了桥本甲状腺炎(HT)在甲状腺组织中的特征,在HT的识别中起着关键作用。HT患者的超声图像通常呈现出不均匀的背景,回声可能降低。临床医生通过视觉评估这些特征容易误诊HT。此外,HT患者的相关生物学指标可能会出现波动,但单一生物学指标与HT之间没有绝对的关联。为应对这些挑战,我们提出了HTR-Net,这是一种新型的HT识别网络,通过多模态信息嵌入将超声图像和生物学指标结合起来。具体而言,HTR-Net引入了一个全局交叉注意力模块(GCA),它增强了对回声可能降低的不均匀背景的识别。还设计了一种距离感知失配增强(DMA)策略来扩展有限的生物学指标数据,并确保增强后的生物学指标值合理,从而提高模型性能。为了解决HT与单一生物学指标之间的非绝对关系,我们提出了一种距离感知损失(DL)函数来约束特征映射,以便从指标中有效提取信息,从而增强模型检测异常生物学指标集的能力。为了验证所提出的方法,我们构建了一个多中心HT数据集并进行了广泛的实验。实验结果表明,所提出的HTR-Net实现了当前最优(SOTA)性能。