Liu Peizhong, Zhang Jiansong, Wu Xiuming, Liu Shunlan, Wang Yanli, Feng Longxiang, Diao Yong, Liu Zhonghua, Lyu Guorong, Chen Yongjian
IEEE J Biomed Health Inform. 2024 Mar 27;PP. doi: 10.1109/JBHI.2024.3382604.
Deep learning in ultrasound(US) imaging aims to construct foundational models that accurately reflect the modality's unique characteristics. Nevertheless, the limited datasets and narrow task types have restricted this field in recent years. To address these challenges, we introduce US-MTD120K, a multi-task ultrasound dataset with 120,354 real-world two-dimensional images. This dataset covers three standard plane recognition and two diagnostic tasks in ultrasound imaging, providing a rich basis for model training and evaluation. We detail the data collection, distribution, and labelling processes, ensuring a thorough understanding of the dataset's structure. Furthermore, we conduct extensive benchmark tests on 27 state-of-the-art methods from both supervised and self-supervised learning(SSL) perspectives. In the realm of supervised learning, we analyze the sensitivity of two main feature computation methods to ultrasound images at the representational level, highlighting that models which judiciously constrain global feature computation could potentially serve as a viable analytical approach for US image analysis. In the context of self-supervised learning, we delved into the modelling process of self-supervised learning models for medical images and proposed an improvement strategy, named MoCo-US, a solution that addresses the excessive reliance on pretext task design from the input side. It achieves competitive performance with minimal pretext task design and enhances other SSL methods simply. The dataset and the code will be available at https://github.com/JsongZhang/CDOA-for-UMTD.
超声(US)成像中的深度学习旨在构建能够准确反映该模态独特特征的基础模型。然而,近年来有限的数据集和狭窄的任务类型限制了这一领域的发展。为应对这些挑战,我们引入了US-MTD120K,这是一个包含120354张真实世界二维图像的多任务超声数据集。该数据集涵盖了超声成像中的三个标准平面识别和两个诊断任务,为模型训练和评估提供了丰富的基础。我们详细介绍了数据收集、分布和标注过程,以确保对数据集结构有全面的了解。此外,我们从监督学习和自监督学习(SSL)的角度对27种先进方法进行了广泛的基准测试。在监督学习领域,我们在表征层面分析了两种主要特征计算方法对超声图像的敏感性,强调明智地约束全局特征计算的模型可能是超声图像分析的一种可行分析方法。在自监督学习的背景下,我们深入研究了医学图像自监督学习模型的建模过程,并提出了一种改进策略,名为MoCo-US,该解决方案从输入端解决了对 pretext 任务设计过度依赖的问题。它以最小的 pretext 任务设计实现了有竞争力的性能,并简单地增强了其他SSL方法。数据集和代码将在https://github.com/JsongZhang/CDOA-for-UMTD上提供。