用于从T2加权图像评估前列腺癌侵袭性的3D视觉Transformer堆叠集成

3D-Vision-Transformer Stacking Ensemble for Assessing Prostate Cancer Aggressiveness from T2w Images.

作者信息

Pachetti Eva, Colantonio Sara

机构信息

"Alessandro Faedo" Institute of Information Science and Technologies (ISTI), National Research Council of Italy (CNR), 56127 Pisa, Italy.

Department of Information Engineering (DII), University of Pisa, 56122 Pisa, Italy.

出版信息

Bioengineering (Basel). 2023 Aug 28;10(9):1015. doi: 10.3390/bioengineering10091015.

DOI:10.3390/bioengineering10091015

PMID:37760117

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10525095/

Abstract

Vision transformers represent the cutting-edge topic in computer vision and are usually employed on two-dimensional data following a transfer learning approach. In this work, we propose a trained-from-scratch stacking ensemble of 3D-vision transformers to assess prostate cancer aggressiveness from T2-weighted images to help radiologists diagnose this disease without performing a biopsy. We trained 18 3D-vision transformers on T2-weighted axial acquisitions and combined them into two- and three-model stacking ensembles. We defined two metrics for measuring model prediction confidence, and we trained all the ensemble combinations according to a five-fold cross-validation, evaluating their accuracy, confidence in predictions, and calibration. In addition, we optimized the 18 base ViTs and compared the best-performing base and ensemble models by re-training them on a 100-sample bootstrapped training set and evaluating each model on the hold-out test set. We compared the two distributions by calculating the median and the 95% confidence interval and performing a Wilcoxon signed-rank test. The best-performing 3D-vision-transformer stacking ensemble provided state-of-the-art results in terms of area under the receiving operating curve (0.89 [0.61-1]) and exceeded the area under the precision-recall curve of the base model of 22% ( < 0.001). However, it resulted to be less confident in classifying the positive class.

摘要

视觉Transformer是计算机视觉领域的前沿课题，通常采用迁移学习方法应用于二维数据。在这项工作中，我们提出了一种从零开始训练的3D视觉Transformer堆叠集成模型，用于从T2加权图像评估前列腺癌的侵袭性，以帮助放射科医生在不进行活检的情况下诊断这种疾病。我们在T2加权轴向图像上训练了18个3D视觉Transformer，并将它们组合成双模型和三模型堆叠集成。我们定义了两个用于衡量模型预测置信度的指标，并根据五折交叉验证对所有集成组合进行训练，评估它们的准确性、预测置信度和校准情况。此外，我们对18个基础视觉Transformer进行了优化，并通过在100个样本的自助训练集上重新训练它们，并在留出测试集上评估每个模型，比较了性能最佳的基础模型和集成模型。我们通过计算中位数和95%置信区间并进行Wilcoxon符号秩检验来比较这两个分布。性能最佳的3D视觉Transformer堆叠集成在接受操作曲线下面积（0.89 [0.61 - 1]）方面提供了领先的结果，并且比基础模型的精确召回曲线下面积高出22%（< 0.001）。然而，它在对阳性类别进行分类时的置信度较低。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4600/10525095/4a02d861c00a/bioengineering-10-01015-g001.jpg

相似文献

3D-Vision-Transformer Stacking Ensemble for Assessing Prostate Cancer Aggressiveness from T2w Images.

Bioengineering (Basel). 2023 Aug 28;10(9):1015. doi: 10.3390/bioengineering10091015.

Distilling Knowledge From an Ensemble of Vision Transformers for Improved Classification of Breast Ultrasound.

Acad Radiol. 2024 Jan;31(1):104-120. doi: 10.1016/j.acra.2023.08.006. Epub 2023 Sep 2.

FibroVit-Vision transformer-based framework for detection and classification of pulmonary fibrosis from chest CT images.

Front Med (Lausanne). 2023 Nov 8;10:1282200. doi: 10.3389/fmed.2023.1282200. eCollection 2023.

HViT: Hybrid vision inspired transformer for the assessment of carotid artery plaque by addressing the cross-modality domain adaptation problem in MRI.

Comput Med Imaging Graph. 2023 Oct;109:102295. doi: 10.1016/j.compmedimag.2023.102295. Epub 2023 Sep 9.

Detecting Tuberculosis-Consistent Findings in Lateral Chest X-Rays Using an Ensemble of CNNs and Vision Transformers.

Front Genet. 2022 Feb 24;13:864724. doi: 10.3389/fgene.2022.864724. eCollection 2022.

Fully automated detection of prostate transition zone tumors on T2-weighted and apparent diffusion coefficient (ADC) map MR images using U-Net ensemble.

Med Phys. 2021 Nov;48(11):6889-6900. doi: 10.1002/mp.15181. Epub 2021 Aug 30.

Semi-automatic classification of prostate cancer on multi-parametric MR imaging using a multi-channel 3D convolutional neural network.

Eur Radiol. 2020 Feb;30(2):1243-1253. doi: 10.1007/s00330-019-06417-z. Epub 2019 Aug 29.

RT-ViT: Real-Time Monocular Depth Estimation Using Lightweight Vision Transformers.

Sensors (Basel). 2022 May 19;22(10):3849. doi: 10.3390/s22103849.

BUViTNet: Breast Ultrasound Detection via Vision Transformers.

Diagnostics (Basel). 2022 Nov 1;12(11):2654. doi: 10.3390/diagnostics12112654.

Ensemble approach of transfer learning and vision transformer leveraging explainable AI for disease diagnosis: An advancement towards smart healthcare 5.0.

Comput Biol Med. 2024 Sep;179:108874. doi: 10.1016/j.compbiomed.2024.108874. Epub 2024 Jul 15.

引用本文的文献

Deep learning-based model for detection of intracranial waveforms with poor brain compliance in southern Thailand.

Acute Crit Care. 2025 Aug;40(3):473-481. doi: 10.4266/acc.001425. Epub 2025 Aug 29.

Effective reduction of unnecessary biopsies through a deep-learning-assisted aggressive prostate cancer detector.

Sci Rep. 2025 Apr 30;15(1):15211. doi: 10.1038/s41598-025-99795-y.

本文引用的文献

Research progress on deep learning in magnetic resonance imaging-based diagnosis and treatment of prostate cancer: a review on the current status and perspectives.

Front Oncol. 2023 Jun 13;13:1189370. doi: 10.3389/fonc.2023.1189370. eCollection 2023.

Transformer-based factorized encoder for classification of pneumoconiosis on 3D CT images.

Comput Biol Med. 2022 Nov;150:106137. doi: 10.1016/j.compbiomed.2022.106137. Epub 2022 Sep 22.

Vision Transformers for Classification of Breast Ultrasound Images.

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:480-483. doi: 10.1109/EMBC48229.2022.9871809.

An improved transformer network for skin cancer classification.

Comput Biol Med. 2022 Oct;149:105939. doi: 10.1016/j.compbiomed.2022.105939. Epub 2022 Aug 10.

Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography.

J Digit Imaging. 2022 Dec;35(6):1445-1462. doi: 10.1007/s10278-022-00666-z. Epub 2022 Jul 11.

Vision transformer and explainable transfer learning models for auto detection of kidney cyst, stone and tumor from CT-radiography.

Sci Rep. 2022 Jul 6;12(1):11440. doi: 10.1038/s41598-022-15634-4.

Explainable Vision Transformers and Radiomics for COVID-19 Detection in Chest X-rays.

J Clin Med. 2022 May 26;11(11):3013. doi: 10.3390/jcm11113013.

Vision Transformer for femur fracture classification.

Injury. 2022 Jul;53(7):2625-2634. doi: 10.1016/j.injury.2022.04.013. Epub 2022 Apr 19.

Machine and Deep Learning Prediction Of Prostate Cancer Aggressiveness Using Multiparametric MRI.

Front Oncol. 2022 Jan 13;11:802964. doi: 10.3389/fonc.2021.802964. eCollection 2021.

Classification of Clinically Significant Prostate Cancer on Multi-Parametric MRI: A Validation Study Comparing Deep Learning and Radiomics.

Cancers (Basel). 2021 Dec 21;14(1):12. doi: 10.3390/cancers14010012.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于从T2加权图像评估前列腺癌侵袭性的3D视觉Transformer堆叠集成

3D-Vision-Transformer Stacking Ensemble for Assessing Prostate Cancer Aggressiveness from T2w Images.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献