文献检索，用中文搜 PubMed

Digital pathology poses unique computational challenges, as a standard gigapixel slide may comprise tens of thousands of image tiles. Prior models have often resorted to subsampling a small portion of tiles for each slide, thus missing the important slide-level context. Here we present Prov-GigaPath, a whole-slide pathology foundation model pretrained on 1.3 billion 256 × 256 pathology image tiles in 171,189 whole slides from Providence, a large US health network comprising 28 cancer centres. The slides originated from more than 30,000 patients covering 31 major tissue types. To pretrain Prov-GigaPath, we propose GigaPath, a novel vision transformer architecture for pretraining gigapixel pathology slides. To scale GigaPath for slide-level learning with tens of thousands of image tiles, GigaPath adapts the newly developed LongNet method to digital pathology. To evaluate Prov-GigaPath, we construct a digital pathology benchmark comprising 9 cancer subtyping tasks and 17 pathomics tasks, using both Providence and TCGA data. With large-scale pretraining and ultra-large-context modelling, Prov-GigaPath attains state-of-the-art performance on 25 out of 26 tasks, with significant improvement over the second-best method on 18 tasks. We further demonstrate the potential of Prov-GigaPath on vision-language pretraining for pathology by incorporating the pathology reports. In sum, Prov-GigaPath is an open-weight foundation model that achieves state-of-the-art performance on various digital pathology tasks, demonstrating the importance of real-world data and whole-slide modelling.

数字病理学带来了独特的计算挑战，因为一个标准的千兆像素幻灯片可能包含成千上万张图像块。以前的模型通常会对每张幻灯片的一小部分图像块进行子采样，从而丢失了重要的幻灯片级上下文。在这里，我们提出了 Prov-GigaPath，这是一个在 171189 张来自普罗维登斯的全幻灯片上，用 13 亿个 256×256 病理图像块进行预训练的全幻灯片病理基础模型，普罗维登斯是一个大型美国健康网络，包括 28 个癌症中心。这些幻灯片来自超过 30000 名患者，涵盖 31 种主要组织类型。为了预训练 Prov-GigaPath，我们提出了 GigaPath，这是一种用于预训练千兆像素病理幻灯片的新型视觉转换器架构。为了在具有成千上万张图像块的幻灯片级别上扩展 GigaPath 的学习能力，GigaPath 采用了新开发的 LongNet 方法来适应数字病理学。为了评估 Prov-GigaPath，我们使用普罗维登斯和 TCGA 数据构建了一个包含 9 个癌症亚型任务和 17 个病理组学任务的数字病理学基准。通过大规模预训练和超大上下文建模，Prov-GigaPath 在 26 个任务中的 25 个任务上达到了最先进的性能，在 18 个任务上比第二好的方法有显著的改进。我们进一步通过整合病理报告，展示了 Prov-GigaPath 在病理视觉语言预训练方面的潜力。总之，Prov-GigaPath 是一个开放权重的基础模型，在各种数字病理学任务上都达到了最先进的性能，证明了真实世界数据和全幻灯片建模的重要性。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于真实世界数据的全幻灯片数字病理学基础模型。

A whole-slide foundation model for digital pathology from real-world data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献