• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SMiT:用于疾病严重程度检测的对称掩模转换器。

SMiT: symmetric mask transformer for disease severity detection.

机构信息

The College of Software, Xinjiang University, Urumqi, 830046, China.

The College of Information Science and Engineering, Xinjiang University, Urumqi, 830046, China.

出版信息

J Cancer Res Clin Oncol. 2023 Nov;149(17):16075-16086. doi: 10.1007/s00432-023-05223-x. Epub 2023 Sep 12.

DOI:10.1007/s00432-023-05223-x
PMID:37698681
Abstract

PURPOSE

The application of deep learning methods to the intelligent diagnosis of diseases has been the focus of intelligent medical research. When dealing with image classification tasks, if the lesion area is small and uneven, the background image involved in the training will affect the ultimate accuracy in determining the extent of the lesion. We did not follow the traditional approach of building an intelligent system to assist physicians in diagnosis from the perspective of CNN models, but instead proposed a pure transformer framework that can be used for diagnostic grading of pathological images.

METHODS

We propose a Symmetric Mask Pre-Training vision Transformer SMiT model for grading pathological cancer images. SMiT performs a symmetrically identical high probability sparsification of the input image token sequence at the first and last encoder layer positions to pre-train visual transformers, and the parameters of the baseline model are fine-tuned after loading the pre-training weights, allowing the model to concentrate more on extracting detailed features in the lesion region, effectively getting rid of the potential feature dependency problem.

RESULTS

SMiT achieved 92.8% classification accuracy on 4500 histopathological images of colorectal cancer processed by Gaussian filter denoising. We validated the effectiveness and generalizability of this study's methodology on the publicly available diabetic retinopathy dataset APTOS2019 from Kaggle and achieved quadratic Cohen Kappa, accuracy and F1-score of 91.9%, 86.91% and 72.85%, respectively, which were 1-2% better than previous studies based on CNN models.

CONCLUSION

SMiT uses a simpler strategy to achieve better results to assist physicians in making accurate clinical decisions, which can be an inspiration for making good use of the visual transformers in the field of medical imaging.

摘要

目的

深度学习方法在疾病智能诊断中的应用一直是智能医学研究的焦点。在处理图像分类任务时,如果病变区域小且不均匀,训练中涉及的背景图像会影响最终确定病变程度的准确性。我们没有从 CNN 模型的角度遵循传统的方法来构建智能系统以协助医生进行诊断,而是提出了一种纯粹的变压器框架,可用于对病理图像进行诊断分级。

方法

我们提出了一种用于分级病理癌症图像的对称掩模预训练视觉 Transformer SMiT 模型。SMiT 在第一个和最后一个编码器层位置对输入图像标记序列执行对称相同的高概率稀疏化,以预训练视觉 Transformer,并且在加载预训练权重后对基线模型的参数进行微调,使模型能够更专注于提取病变区域的详细特征,有效地摆脱潜在的特征依赖问题。

结果

SMiT 在经过高斯滤波降噪处理的 4500 张结直肠癌组织病理学图像上实现了 92.8%的分类准确率。我们在 Kaggle 上提供的公开可用的糖尿病视网膜病变数据集 APTOS2019 上验证了这项研究方法的有效性和通用性,分别达到了 91.9%、86.91%和 72.85%的二次科恩 Kappa、准确率和 F1 得分,比以前基于 CNN 模型的研究高出 1-2%。

结论

SMiT 采用更简单的策略来实现更好的结果,以协助医生做出准确的临床决策,这可为充分利用医学影像领域的视觉 Transformer 提供启示。

相似文献

1
SMiT: symmetric mask transformer for disease severity detection.SMiT:用于疾病严重程度检测的对称掩模转换器。
J Cancer Res Clin Oncol. 2023 Nov;149(17):16075-16086. doi: 10.1007/s00432-023-05223-x. Epub 2023 Sep 12.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
4
Sexual Harassment and Prevention Training性骚扰与预防培训
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
7
Short-Term Memory Impairment短期记忆障碍
8
Optical coherence tomography (OCT) for detection of macular oedema in patients with diabetic retinopathy.光学相干断层扫描(OCT)用于检测糖尿病视网膜病变患者的黄斑水肿。
Cochrane Database Syst Rev. 2015 Jan 7;1(1):CD008081. doi: 10.1002/14651858.CD008081.pub3.
9
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
10
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

引用本文的文献

1
Research on grading detection methods for diabetic retinopathy based on deep learning.基于深度学习的糖尿病视网膜病变分级检测方法研究
Pak J Med Sci. 2025 Jan;41(1):225-229. doi: 10.12669/pjms.41.1.9171.
2
Discriminative, generative artificial intelligence, and foundation models in retina imaging.视网膜成像中的判别式、生成式人工智能及基础模型。
Taiwan J Ophthalmol. 2024 Nov 28;14(4):473-485. doi: 10.4103/tjo.TJO-D-24-00064. eCollection 2024 Oct-Dec.

本文引用的文献

1
Prediction and risk assessment of sepsis-associated encephalopathy in ICU based on interpretable machine learning.基于可解释机器学习的 ICU 相关性脓毒症脑病预测与风险评估。
Sci Rep. 2022 Dec 31;12(1):22621. doi: 10.1038/s41598-022-27134-6.
2
Transferability and interpretability of the sepsis prediction models in the intensive care unit.在重症监护病房中,脓毒症预测模型的可转移性和可解释性。
BMC Med Inform Decis Mak. 2022 Dec 29;22(1):343. doi: 10.1186/s12911-022-02090-3.
3
A novel X-Ray and γ-Ray combination strategy for potential dose escalation in patients with locally advanced pancreatic cancer.
一种用于局部晚期胰腺癌患者潜在剂量递增的新型X射线与γ射线联合策略。
Med Phys. 2023 Mar;50(3):1855-1864. doi: 10.1002/mp.16142. Epub 2022 Dec 29.
4
Detection of Breast Cancer with Lightweight Deep Neural Networks for Histology Image Classification.基于轻量级深度神经网络的组织学图像分类用于乳腺癌检测。
Crit Rev Biomed Eng. 2022;50(2):1-19. doi: 10.1615/CritRevBiomedEng.2022043417.
5
CIABNet: Category imbalance attention block network for the classification of multi-differentiated types of esophageal cancer.CIABNet:用于多分化类型食管癌分类的类别不平衡注意力块网络。
Med Phys. 2023 Mar;50(3):1507-1527. doi: 10.1002/mp.16067. Epub 2022 Nov 3.
6
HCCANet: histopathological image grading of colorectal cancer using CNN based on multichannel fusion attention mechanism.HCCANet:基于多通道融合注意力机制的 CNN 用于结直肠癌组织学图像分级。
Sci Rep. 2022 Sep 6;12(1):15103. doi: 10.1038/s41598-022-18879-1.
7
Classification of multi-differentiated liver cancer pathological images based on deep learning attention mechanism.基于深度学习注意力机制的多分化肝癌病理图像分类。
BMC Med Inform Decis Mak. 2022 Jul 4;22(1):176. doi: 10.1186/s12911-022-01919-1.
8
Non-melanoma skin cancer diagnosis: a comparison between dermoscopic and smartphone images by unified visual and sonification deep learning algorithms.非黑色素瘤皮肤癌诊断:基于统一视觉和声音化深度学习算法的皮肤镜和智能手机图像比较。
J Cancer Res Clin Oncol. 2022 Sep;148(9):2497-2505. doi: 10.1007/s00432-021-03809-x. Epub 2021 Sep 21.
9
Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.《全球癌症统计数据 2020:全球 185 个国家和地区 36 种癌症的发病率和死亡率估计》。
CA Cancer J Clin. 2021 May;71(3):209-249. doi: 10.3322/caac.21660. Epub 2021 Feb 4.
10
The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions.HAM10000 数据集,一个大型的常见色素性皮肤病变多源皮肤镜图像集合。
Sci Data. 2018 Aug 14;5:180161. doi: 10.1038/sdata.2018.161.