Suppr
超能文献

联邦学习为罕见癌症边界检测提供大数据支持。

Federated learning enables big data for rare cancer boundary detection.

机构信息

Center for Biomedical Image Computing and Analytics (CBICA), University of Pennsylvania, Philadelphia, PA, USA.

Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

出版信息

Nat Commun. 2022 Dec 5;13(1):7346. doi: 10.1038/s41467-022-33407-5.

DOI:10.1038/s41467-022-33407-5

PMID:36470898

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9722782/

Abstract

Although machine learning (ML) has shown promise across disciplines, out-of-sample generalizability is concerning. This is currently addressed by sharing multi-site data, but such centralization is challenging/infeasible to scale due to various limitations. Federated ML (FL) provides an alternative paradigm for accurate and generalizable ML, by only sharing numerical model updates. Here we present the largest FL study to-date, involving data from 71 sites across 6 continents, to generate an automatic tumor boundary detector for the rare disease of glioblastoma, reporting the largest such dataset in the literature (n = 6, 314). We demonstrate a 33% delineation improvement for the surgically targetable tumor, and 23% for the complete tumor extent, over a publicly trained model. We anticipate our study to: 1) enable more healthcare studies informed by large diverse data, ensuring meaningful results for rare diseases and underrepresented populations, 2) facilitate further analyses for glioblastoma by releasing our consensus model, and 3) demonstrate the FL effectiveness at such scale and task-complexity as a paradigm shift for multi-site collaborations, alleviating the need for data-sharing.

摘要

尽管机器学习（ML）在各个学科中都显示出了前景，但样本外泛化能力令人担忧。目前通过共享多站点数据来解决这一问题，但由于各种限制，这种集中化在规模上具有挑战性/不可行。联邦学习（FL）通过仅共享数值模型更新，为准确和可泛化的 ML 提供了另一种范例。在这里，我们展示了迄今为止最大的 FL 研究，涉及来自六大洲 71 个站点的数据，为罕见病胶质母细胞瘤生成自动肿瘤边界探测器，报告了文献中最大的此类数据集（n=6314）。我们证明了对于可手术靶向肿瘤，与公开训练的模型相比，可提高 33%的勾画精度，对于完整肿瘤范围，可提高 23%的勾画精度。我们预计我们的研究将：1）能够通过大数据进行更多的医疗保健研究，确保罕见病和代表性不足的人群获得有意义的结果；2）通过发布我们的共识模型，促进对胶质母细胞瘤的进一步分析；3）展示在这种规模和任务复杂性下的 FL 有效性，作为多站点合作的范式转变，减轻对数据共享的需求。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d322/9722782/53e36538f121/41467_2022_33407_Fig1_HTML.jpg

相似文献

Federated learning enables big data for rare cancer boundary detection.

Nat Commun. 2022 Dec 5;13(1):7346. doi: 10.1038/s41467-022-33407-5.

Federated Learning in Glaucoma: A Comprehensive Review and Future Perspectives.

Ophthalmol Glaucoma. 2025 Jan-Feb;8(1):92-105. doi: 10.1016/j.ogla.2024.08.004. Epub 2024 Aug 29.

Federated learning improves site performance in multicenter deep learning without data sharing.

J Am Med Inform Assoc. 2021 Jun 12;28(6):1259-1264. doi: 10.1093/jamia/ocaa341.

Preserving privacy in big data research: the role of federated learning in spine surgery.

Eur Spine J. 2024 Nov;33(11):4076-4081. doi: 10.1007/s00586-024-08172-2. Epub 2024 Feb 25.

Federated vs Local vs Central Deep Learning of Tooth Segmentation on Panoramic Radiographs.

J Dent. 2023 Aug;135:104556. doi: 10.1016/j.jdent.2023.104556. Epub 2023 May 18.

Methods and Impact for Using Federated Learning to Collaborate on Clinical Research.

Neurosurgery. 2023 Feb 1;92(2):431-438. doi: 10.1227/neu.0000000000002198. Epub 2022 Nov 8.

Privacy-by-Design with Federated Learning will drive future Rare Disease Research.

J Neuromuscul Dis. 2024 Dec 8:22143602241296276. doi: 10.1177/22143602241296276.

Federated learning with differential privacy for breast cancer diagnosis enabling secure data sharing and model integrity.

Sci Rep. 2025 Apr 16;15(1):13061. doi: 10.1038/s41598-025-95858-2.

Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data.

Sci Rep. 2020 Jul 28;10(1):12598. doi: 10.1038/s41598-020-69250-1.

PSA-FL-CDM: A Novel Federated Learning-Based Consensus Model for Post-Stroke Assessment.

Sensors (Basel). 2024 Aug 6;24(16):5095. doi: 10.3390/s24165095.

引用本文的文献

Integrating tumor location into artificial intelligence-based prognostic models in cancer.

World J Clin Oncol. 2025 Aug 24;16(8):109934. doi: 10.5306/wjco.v16.i8.109934.

Deep learning in chromatin organization: from super-resolution microscopy to clinical applications.

Cell Mol Life Sci. 2025 Aug 29;82(1):323. doi: 10.1007/s00018-025-05837-z.

Navigating real-world challenges: A case study on federated learning in computational pathology.

J Pathol Inform. 2025 Jul 23;18:100464. doi: 10.1016/j.jpi.2025.100464. eCollection 2025 Aug.

FedECA: federated external control arms for causal inference with time-to-event data in distributed settings.

Nat Commun. 2025 Aug 13;16(1):7496. doi: 10.1038/s41467-025-62525-z.

A scoping review of the governance of federated learning in healthcare.

NPJ Digit Med. 2025 Jul 10;8(1):427. doi: 10.1038/s41746-025-01836-3.

Towards fair decentralized benchmarking of healthcare AI algorithms with the Federated Tumor Segmentation (FeTS) challenge.

Nat Commun. 2025 Jul 8;16(1):6274. doi: 10.1038/s41467-025-60466-1.

Informatics at the Frontier of Cancer Research.

Cancer Res. 2025 Aug 15;85(16):2967-2986. doi: 10.1158/0008-5472.CAN-24-2829.

Revolutionizing gastroenterology and hepatology with artificial intelligence: From precision diagnosis to equitable healthcare through interdisciplinary practice.

World J Gastroenterol. 2025 Jun 28;31(24):108021. doi: 10.3748/wjg.v31.i24.108021.

Progress and challenges of artificial intelligence in lung cancer clinical translation.

NPJ Precis Oncol. 2025 Jul 1;9(1):210. doi: 10.1038/s41698-025-00986-7.

Federated target trial emulation using distributed observational data for treatment effect estimation.

NPJ Digit Med. 2025 Jul 1;8(1):387. doi: 10.1038/s41746-025-01803-y.

本文引用的文献

Decentralized federated learning through proxy model sharing.

Nat Commun. 2023 May 22;14(1):2899. doi: 10.1038/s41467-023-38569-4.

OpenFL: the open federated learning library.

Phys Med Biol. 2022 Oct 19;67(21):214001. doi: 10.1088/1361-6560/ac97d9.

The federated tumor segmentation (FeTS) tool: an open-source solution to further solid tumor research.

Phys Med Biol. 2022 Oct 12;67(20). doi: 10.1088/1361-6560/ac9449.

TMJOAI: An Artificial Web-Based Intelligence Tool for Early Diagnosis of the Temporomandibular Joint Osteoarthritis.

Clin Image Based Proced Distrib Collab Learn Artif Intell Combat COVID 19 Secur Priv Preserv Mach Learn (2021). 2021 Sep-Oct;12969:78-87. doi: 10.1007/978-3-030-90874-4_8. Epub 2021 Nov 14.

Federated learning for multi-center imaging diagnostics: a simulation study in cardiovascular disease.

Sci Rep. 2022 Mar 3;12(1):3551. doi: 10.1038/s41598-022-07186-4.

Federated learning and differential privacy for medical image analysis.

Sci Rep. 2022 Feb 4;12(1):1953. doi: 10.1038/s41598-022-05539-7.

Federated learning for computational pathology on gigapixel whole slide images.

Med Image Anal. 2022 Feb;76:102298. doi: 10.1016/j.media.2021.102298. Epub 2021 Nov 25.

Congress of Neurological Surgeons systematic review and evidence-based guidelines for the treatment of adults with progressive glioblastoma update: introduction and methods.

J Neurooncol. 2022 Jun;158(2):133-137. doi: 10.1007/s11060-021-03850-3. Epub 2021 Oct 25.

Federated learning for predicting clinical outcomes in patients with COVID-19.

Nat Med. 2021 Oct;27(10):1735-1743. doi: 10.1038/s41591-021-01506-3. Epub 2021 Sep 15.

TorchIO: A Python library for efficient loading, preprocessing, augmentation and patch-based sampling of medical images in deep learning.

Comput Methods Programs Biomed. 2021 Sep;208:106236. doi: 10.1016/j.cmpb.2021.106236. Epub 2021 Jun 17.