• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于将任意 DICOM 结构集重新标记为 TG-263 定义标签的机器学习方法。

A Machine Learning method for relabeling arbitrary DICOM structure sets to TG-263 defined labels.

机构信息

Virginia Commonwealth University, Department of Radiation Oncology, Richmond, VA, United States of America; Virginia Commonwealth University, Department of Computer Science, Richmond, VA, United States of America; National Radiation Oncology Program, Department of Veteran Affairs, Richmond, VA, United States of America.

Virginia Commonwealth University, Department of Radiation Oncology, Richmond, VA, United States of America; National Radiation Oncology Program, Department of Veteran Affairs, Richmond, VA, United States of America.

出版信息

J Biomed Inform. 2020 Sep;109:103527. doi: 10.1016/j.jbi.2020.103527. Epub 2020 Aug 8.

DOI:10.1016/j.jbi.2020.103527
PMID:32777484
Abstract

PURPOSE

To present a Machine Learning pipeline for automatically relabeling anatomical structure sets in the Digital Imaging and Communications in Medicine (DICOM) format to a standard nomenclature that will enable data abstraction for research and quality improvement.

METHODS

DICOM structure sets from approximately 1200 lung and prostate cancer patients across 40 treatment centers were used to build predictive models to automate the relabeling of clinically specified structure labels to standardized labels as defined by the American Association of Physics in Medicine's (AAPM) Task Group 263 (TG-263). Volumetric bitmaps were created based on the delineated volumes and were combined with associated bony anatomy data to build feature vectors. Feature reduction was performed with singular value decomposition and the resulting vectors were used for predicting the label of each structure using five different classifier algorithms on the Apache Spark platform with 5-fold cross-validation. Undersampling methods were used to deal with underlying class imbalance that hindered the performance of classifiers. Experiments were performed on both a curated version of the data, which included only annotated structures, and the non-curated data that included all structures from the original treatment plans.

RESULTS

Random Forest provided the highest accuracies with F scores of 98.77 for lung and 95.06 for prostate on the curated data sets. Scores were lower with 95.67 for lung and 90.22 for prostate on the non-curated data sets, highlighting some of the challenges of classifying real clinical data. Including bony anatomy data and pooling information from all structures for the same patient both increased accuracies. In some cases, undersampling with k-Means clustering for class balancing improved classifier accuracy but in all experiments it significantly reduced run time compared to random undersampling.

CONCLUSION

This work shows that structure sets can be relabeled using our approach with accuracies over 95% for many structure types when presented with curated data. Although accuracies dropped when using the full non-curated data sets, some structure types were still correctly labeled over 90% of the time. With similar results obtained on an external test data set, we can infer that the proposed models are likely to work on other clinical data sets.

摘要

目的

提出一种机器学习管道,用于将数字成像和通信在医学(DICOM)格式中的解剖结构集自动重新标记为标准命名法,从而为研究和质量改进实现数据抽象。

方法

使用来自 40 个治疗中心的大约 1200 名肺癌和前列腺癌患者的 DICOM 结构集来构建预测模型,以自动将临床指定的结构标签重新标记为美国医学物理学家协会(AAPM)第 263 任务组(TG-263)定义的标准化标签。基于勾画的体积创建体绘制位图,并将其与相关的骨性解剖数据结合起来构建特征向量。使用奇异值分解进行特征降维,并在 Apache Spark 平台上使用五种不同的分类器算法对降维后的特征向量进行 5 折交叉验证,以预测每个结构的标签。使用欠采样方法处理底层的类不平衡问题,这会影响分类器的性能。实验在经过精心整理的数据(仅包含已注释的结构)和非精心整理的数据(包含原始治疗计划中的所有结构)上进行。

结果

随机森林在经过整理的数据上的肺和前列腺的 F 分数分别为 98.77 和 95.06,提供了最高的准确性。在非整理数据上的分数分别为 95.67 和 90.22,这突出了对真实临床数据进行分类的一些挑战。包含骨性解剖数据并汇集同一患者的所有结构信息都提高了准确性。在某些情况下,使用 k-Means 聚类进行类平衡的欠采样提高了分类器的准确性,但在所有实验中,与随机欠采样相比,它显著减少了运行时间。

结论

这项工作表明,当提供经过整理的数据时,使用我们的方法可以将结构集重新标记为超过 95%的准确率,对于许多结构类型。虽然在使用完整的非整理数据集时准确性有所下降,但某些结构类型仍然有超过 90%的时间被正确标记。在外部测试数据集上获得类似的结果后,我们可以推断出所提出的模型可能适用于其他临床数据集。

相似文献

1
A Machine Learning method for relabeling arbitrary DICOM structure sets to TG-263 defined labels.一种用于将任意 DICOM 结构集重新标记为 TG-263 定义标签的机器学习方法。
J Biomed Inform. 2020 Sep;109:103527. doi: 10.1016/j.jbi.2020.103527. Epub 2020 Aug 8.
2
Evaluation of data discretization methods to derive platform independent isoform expression signatures for multi-class tumor subtyping.评估数据离散化方法以推导用于多类肿瘤亚型分型的平台独立异构体表达特征。
BMC Genomics. 2015;16 Suppl 11(Suppl 11):S3. doi: 10.1186/1471-2164-16-S11-S3. Epub 2015 Nov 10.
3
Next-Generation Radiogenomics Sequencing for Prediction of EGFR and KRAS Mutation Status in NSCLC Patients Using Multimodal Imaging and Machine Learning Algorithms.使用多模态成像和机器学习算法的下一代放射基因组学测序预测非小细胞肺癌患者的EGFR和KRAS突变状态
Mol Imaging Biol. 2020 Aug;22(4):1132-1148. doi: 10.1007/s11307-020-01487-8.
4
GHOST: Adjusting the Decision Threshold to Handle Imbalanced Data in Machine Learning.调整决策阈值以处理机器学习中的不平衡数据。
J Chem Inf Model. 2021 Jun 28;61(6):2623-2640. doi: 10.1021/acs.jcim.1c00160. Epub 2021 Jun 8.
5
A learning-based material decomposition pipeline for multi-energy x-ray imaging.基于学习的多能量 X 射线成像材料分解管道。
Med Phys. 2019 Feb;46(2):689-703. doi: 10.1002/mp.13317. Epub 2018 Dec 24.
6
Comparison of machine learning algorithms to predict clinically significant prostate cancer of the peripheral zone with multiparametric MRI using clinical assessment categories and radiomic features.比较机器学习算法,使用临床评估类别和放射组学特征,预测多参数 MRI 下外周带具有临床意义的前列腺癌。
Eur Radiol. 2020 Dec;30(12):6757-6769. doi: 10.1007/s00330-020-07064-5. Epub 2020 Jul 16.
7
Integrated Natural Language Processing and Machine Learning Models for Standardizing Radiotherapy Structure Names.用于标准化放射治疗结构名称的集成自然语言处理和机器学习模型
Healthcare (Basel). 2020 Apr 30;8(2):120. doi: 10.3390/healthcare8020120.
8
Open RT Structures: A Solution for TG-263 Accessibility.开放型放疗结构:TG-263可及性的一种解决方案
Int J Radiat Oncol Biol Phys. 2024 Mar 1;118(3):859-863. doi: 10.1016/j.ijrobp.2023.09.041. Epub 2023 Sep 29.
9
Machine-learned cluster identification in high-dimensional data.高维数据中的机器学习聚类识别
J Biomed Inform. 2017 Feb;66:95-104. doi: 10.1016/j.jbi.2016.12.011. Epub 2016 Dec 28.
10
Soft Clustering for Enhancing the Diagnosis of Chronic Diseases over Machine Learning Algorithms.基于机器学习算法的软聚类在慢性病诊断中的应用。
J Healthc Eng. 2020 Mar 9;2020:4984967. doi: 10.1155/2020/4984967. eCollection 2020.

引用本文的文献

1
Improving Hospital Length of Stay Prediction through Heterogeneous Data Integration from MIMIC-III Records.通过整合MIMIC-III记录中的异构数据改进住院时间预测
Res Sq. 2025 Aug 26:rs.3.rs-6753896. doi: 10.21203/rs.3.rs-6753896/v1.
2
Harmonizing organ-at-risk structure names using open-source large language models.使用开源大语言模型协调危及器官结构名称
Phys Imaging Radiat Oncol. 2025 Jul 24;35:100813. doi: 10.1016/j.phro.2025.100813. eCollection 2025 Jul.
3
Perceptions on and roadblocks to implementation of standardized nomenclature in radiation oncology: A survey from TG-263U1.
放射肿瘤学中标准化命名法的认知和实施障碍:来自 TG-263U1 的调查。
J Appl Clin Med Phys. 2024 Jun;25(6):e14359. doi: 10.1002/acm2.14359. Epub 2024 Apr 30.
4
Machine Learning Meets Cancer.机器学习与癌症相遇。
Cancers (Basel). 2024 Mar 8;16(6):1100. doi: 10.3390/cancers16061100.
5
Artificial general intelligence for radiation oncology.用于放射肿瘤学的通用人工智能。
Meta Radiol. 2023 Nov;1(3). doi: 10.1016/j.metrad.2023.100045. Epub 2023 Nov 24.
6
Infrastructure tools to support an effective Radiation Oncology Learning Health System.支持有效的放射肿瘤学学习健康系统的基础设施工具。
J Appl Clin Med Phys. 2023 Oct;24(10):e14127. doi: 10.1002/acm2.14127. Epub 2023 Aug 25.
7
Deep learning-based classification of organs at risk and delineation guideline in pelvic cancer radiation therapy.基于深度学习的骨盆癌放射治疗中危险器官分类和勾画指南。
J Appl Clin Med Phys. 2023 Sep;24(9):e14022. doi: 10.1002/acm2.14022. Epub 2023 May 12.
8
Standardising Breast Radiotherapy Structure Naming Conventions: A Machine Learning Approach.标准化乳腺放射治疗结构命名规范:一种机器学习方法。
Cancers (Basel). 2023 Jan 17;15(3):564. doi: 10.3390/cancers15030564.
9
Deep learning-based classification and structure name standardization for organ at risk and target delineations in prostate cancer radiotherapy.基于深度学习的前列腺癌放射治疗中危及器官和靶区勾画的分类和结构名称标准化。
J Appl Clin Med Phys. 2021 Dec;22(12):51-63. doi: 10.1002/acm2.13446. Epub 2021 Oct 8.
10
Automated data abstraction for quality surveillance and outcome assessment in radiation oncology.自动化数据提取在放射肿瘤学中的质量监测和结果评估。
J Appl Clin Med Phys. 2021 Jul;22(7):177-187. doi: 10.1002/acm2.13308. Epub 2021 Jun 8.