• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HarmonyTM:应用于帕金森病分类分布式学习的多中心数据协调

HarmonyTM: multi-center data harmonization applied to distributed learning for Parkinson's disease classification.

作者信息

Souza Raissa, Stanley Emma A M, Gulve Vedant, Moore Jasmine, Kang Chris, Camicioli Richard, Monchi Oury, Ismail Zahinoor, Wilms Matthias, Forkert Nils D

机构信息

University of Calgary, Department of Radiology, Cumming School of Medicine, Calgary, Alberta, Canada.

University of Calgary, Hotchkiss Brain Institute, Calgary, Alberta, Canada.

出版信息

J Med Imaging (Bellingham). 2024 Sep;11(5):054502. doi: 10.1117/1.JMI.11.5.054502. Epub 2024 Sep 20.

DOI:10.1117/1.JMI.11.5.054502
PMID:39308760
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11413651/
Abstract

PURPOSE

Distributed learning is widely used to comply with data-sharing regulations and access diverse datasets for training machine learning (ML) models. The traveling model (TM) is a distributed learning approach that sequentially trains with data from one center at a time, which is especially advantageous when dealing with limited local datasets. However, a critical concern emerges when centers utilize different scanners for data acquisition, which could potentially lead models to exploit these differences as shortcuts. Although data harmonization can mitigate this issue, current methods typically rely on large or paired datasets, which can be impractical to obtain in distributed setups.

APPROACH

We introduced HarmonyTM, a data harmonization method tailored for the TM. HarmonyTM effectively mitigates bias in the model's feature representation while retaining crucial disease-related information, all without requiring extensive datasets. Specifically, we employed adversarial training to "unlearn" bias from the features used in the model for classifying Parkinson's disease (PD). We evaluated HarmonyTM using multi-center three-dimensional (3D) neuroimaging datasets from 83 centers using 23 different scanners.

RESULTS

Our results show that HarmonyTM improved PD classification accuracy from 72% to 76% and reduced (unwanted) scanner classification accuracy from 53% to 30% in the TM setup.

CONCLUSION

HarmonyTM is a method tailored for harmonizing 3D neuroimaging data within the TM approach, aiming to minimize shortcut learning in distributed setups. This prevents the disease classifier from leveraging scanner-specific details to classify patients with or without PD-a key aspect for deploying ML models for clinical applications.

摘要

目的

分布式学习被广泛用于遵守数据共享规定并访问多样化的数据集以训练机器学习(ML)模型。移动模型(TM)是一种分布式学习方法,它一次从一个中心的数据依次进行训练,在处理有限的本地数据集时特别有利。然而,当各个中心使用不同的扫描仪进行数据采集时,就会出现一个关键问题,这可能会导致模型将这些差异作为捷径来利用。尽管数据协调可以缓解这个问题,但当前的方法通常依赖于大型或配对的数据集,在分布式设置中获取这些数据集可能不切实际。

方法

我们引入了HarmonyTM,这是一种专为TM量身定制的数据协调方法。HarmonyTM有效地减轻了模型特征表示中的偏差,同时保留了关键的疾病相关信息,而且无需大量数据集。具体来说,我们采用对抗训练从用于对帕金森病(PD)进行分类的模型所使用的特征中“去除”偏差。我们使用来自83个中心、使用23种不同扫描仪的多中心三维(3D)神经影像数据集对HarmonyTM进行了评估。

结果

我们的结果表明,在TM设置中,HarmonyTM将PD分类准确率从72%提高到了76%,并将(不需要的)扫描仪分类准确率从53%降低到了30%。

结论

HarmonyTM是一种专为在TM方法中协调3D神经影像数据而量身定制的方法,旨在最大限度地减少分布式设置中的捷径学习。这可以防止疾病分类器利用特定于扫描仪的细节来对患有或未患有PD的患者进行分类,这是将ML模型应用于临床的一个关键方面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d33/11413651/c95005b89c47/JMI-011-054502-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d33/11413651/8d32cb64772a/JMI-011-054502-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d33/11413651/7845807b38fb/JMI-011-054502-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d33/11413651/99ff6ee9e8ec/JMI-011-054502-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d33/11413651/c95005b89c47/JMI-011-054502-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d33/11413651/8d32cb64772a/JMI-011-054502-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d33/11413651/7845807b38fb/JMI-011-054502-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d33/11413651/99ff6ee9e8ec/JMI-011-054502-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d33/11413651/c95005b89c47/JMI-011-054502-g004.jpg

相似文献

1
HarmonyTM: multi-center data harmonization applied to distributed learning for Parkinson's disease classification.HarmonyTM:应用于帕金森病分类分布式学习的多中心数据协调
J Med Imaging (Bellingham). 2024 Sep;11(5):054502. doi: 10.1117/1.JMI.11.5.054502. Epub 2024 Sep 20.
2
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
5
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
8
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
9
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
10
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.

本文引用的文献

1
A multi-center distributed learning approach for Parkinson's disease classification using the traveling model paradigm.一种使用移动模型范式进行帕金森病分类的多中心分布式学习方法。
Front Artif Intell. 2024 Feb 7;7:1301997. doi: 10.3389/frai.2024.1301997. eCollection 2024.
2
Efficacy of MRI data harmonization in the age of machine learning: a multicenter study across 36 datasets.基于机器学习的 MRI 数据调和功效:36 个数据集的多中心研究。
Sci Data. 2024 Jan 23;11(1):115. doi: 10.1038/s41597-023-02421-7.
3
Identifying Biases in a Multicenter MRI Database for Parkinson's Disease Classification: Is the Disease Classifier a Secret Site Classifier?
多中心 MRI 数据库中帕金森病分类的偏倚识别:疾病分类器是否为秘密站点分类器?
IEEE J Biomed Health Inform. 2024 Apr;28(4):2047-2054. doi: 10.1109/JBHI.2024.3352513. Epub 2024 Apr 4.
4
Enhancing domain generalization in the AI-based analysis of chest radiographs with federated learning.基于联邦学习的人工智能分析胸片中的领域泛化增强。
Sci Rep. 2023 Dec 19;13(1):22576. doi: 10.1038/s41598-023-49956-8.
5
Image-encoded biological and non-biological variables may be used as shortcuts in deep learning models trained on multisite neuroimaging data.基于多中心神经影像数据训练的深度学习模型中,可以将图像编码的生物学和非生物学变量用作捷径。
J Am Med Inform Assoc. 2023 Nov 17;30(12):1925-1933. doi: 10.1093/jamia/ocad171.
6
Image harmonization: A review of statistical and deep learning methods for removing batch effects and evaluation metrics for effective harmonization.图像调和:去除批次效应的统计和深度学习方法综述,以及有效调和的评价指标。
Neuroimage. 2023 Jul 1;274:120125. doi: 10.1016/j.neuroimage.2023.120125. Epub 2023 Apr 20.
7
Explainable classification of Parkinson's disease using deep learning trained on a large multi-center database of T1-weighted MRI datasets.利用基于 T1 加权 MRI 数据集的大型多中心数据库训练的深度学习对帕金森病进行可解释分类。
Neuroimage Clin. 2023;38:103405. doi: 10.1016/j.nicl.2023.103405. Epub 2023 Apr 17.
8
Algorithmic encoding of protected characteristics in chest X-ray disease detection models.算法对胸部 X 射线疾病检测模型中受保护特征的编码。
EBioMedicine. 2023 Mar;89:104467. doi: 10.1016/j.ebiom.2023.104467. Epub 2023 Feb 13.
9
An analysis of the effects of limited training data in distributed learning scenarios for brain age prediction.分析在大脑年龄预测的分布式学习场景中,有限训练数据的影响。
J Am Med Inform Assoc. 2022 Dec 13;30(1):112-119. doi: 10.1093/jamia/ocac204.
10
Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset.多站点磁共振成像数据的协调揭示了在几乎无法分离的人群中机器学习的区分能力:来自 ABIDE 数据集的一个例子。
Neuroimage Clin. 2022;35:103082. doi: 10.1016/j.nicl.2022.103082. Epub 2022 Jun 8.