Suppr超能文献

MR[Formula: see text]WI 软组织肉瘤数据不平衡机器学习方法研究。

Research on imbalance machine learning methods for MR[Formula: see text]WI soft tissue sarcoma data.

机构信息

College of Computer Science and Technology, Qingdao University, Qingdao, 266071 China.

Department of Radiology, The Affiliated Hospital of Qingdao University, Qingdao, China.

出版信息

BMC Med Imaging. 2022 Aug 26;22(1):149. doi: 10.1186/s12880-022-00876-5.

Abstract

BACKGROUND

Soft tissue sarcoma is a rare and highly heterogeneous tumor in clinical practice. Pathological grading of the soft tissue sarcoma is a key factor in patient prognosis and treatment planning while the clinical data of soft tissue sarcoma are imbalanced. In this paper, we propose an effective solution to find the optimal imbalance machine learning model for predicting the classification of soft tissue sarcoma data.

METHODS

In this paper, a large number of features are first obtained based on [Formula: see text]WI images using the radiomics methods.Then, we explore the methods of feature selection, sampling and classification, get 17 imbalance machine learning models based on the above features and performed extensive experiments to classify imbalanced soft tissue sarcoma data. Meanwhile, we used another dataset splitting method as well, which could improve the classification performance and verify the validity of the models.

RESULTS

The experimental results show that the combination of extremely randomized trees (ERT) classification algorithm using SMOTETomek and the recursive feature elimination technique (RFE) performs best compared to other methods. The accuracy of RFE+STT+ERT is 81.57% , which is close to the accuracy of biopsy, and the accuracy is 95.69% when using another dataset splitting method.

CONCLUSION

Preoperative predicting pathological grade of soft tissue sarcoma in an accurate and noninvasive manner is essential. Our proposed machine learning method (RFE+STT+ERT) can make a positive contribution to solving the imbalanced data classification problem, which can favorably support the development of personalized treatment plans for soft tissue sarcoma patients.

摘要

背景

软组织肉瘤在临床实践中是一种罕见且高度异质的肿瘤。软组织肉瘤的病理分级是患者预后和治疗计划的关键因素,而软组织肉瘤的临床数据是不平衡的。在本文中,我们提出了一种有效的解决方案,以找到预测软组织肉瘤数据分类的最佳不平衡机器学习模型。

方法

在本文中,首先使用放射组学方法从 [Formula: see text]WI 图像中获得大量特征。然后,我们探索了特征选择、采样和分类的方法,基于上述特征得到了 17 个不平衡机器学习模型,并进行了广泛的实验来分类不平衡的软组织肉瘤数据。同时,我们还使用了另一种数据集划分方法,这可以提高分类性能并验证模型的有效性。

结果

实验结果表明,与其他方法相比,极端随机树(ERT)分类算法与 SMOTETomek 结合和递归特征消除技术(RFE)的组合表现最佳。RFE+STT+ERT 的准确率为 81.57%,接近活检的准确率,当使用另一种数据集划分方法时,准确率为 95.69%。

结论

准确、无创地预测软组织肉瘤的术前病理分级至关重要。我们提出的机器学习方法(RFE+STT+ERT)可以为解决不平衡数据分类问题做出积极贡献,从而有利于支持软组织肉瘤患者的个性化治疗计划的制定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf47/9419389/11e907e31742/12880_2022_876_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验