Department of EEE, Islamic University of Technology, Gazipur, Bangladesh.
Department of CSE, Islamic University of Technology, Gazipur, Bangladesh.
Comput Math Methods Med. 2022 Sep 25;2022:9391136. doi: 10.1155/2022/9391136. eCollection 2022.
Bone marrow transplant (BMT) is an effective surgical treatment for bone marrow-related disorders. However, several associated risk factors can impair long-term survival after BMT. Machine learning (ML) technologies have been proven useful in survival prediction of BMT receivers along with the influences that limit their resilience. In this study, an efficient classification model predicting the survival of children undergoing BMT is presented using a public dataset. Several supervised ML methods were investigated in this regard with an 80-20 train-test split ratio. To ensure prediction with minimal time and resources, only the top 11 out of the 59 dataset features were considered using Chi-square feature selection method. Furthermore, hyperparameter optimization (HPO) using the grid search cross-validation (GSCV) technique was adopted to increase the accuracy of prediction. Four experiments were conducted utilizing a combination of default and optimized hyperparameters on the original and reduced datasets. Our investigation revealed that the top 11 features of HPO had the same prediction accuracy (94.73%) as the entire dataset with default parameters, however, requiring minimal time and resources. Hence, the proposed approach may aid in the development of a computer-aided diagnostic system with satisfactory accuracy and minimal computation time by utilizing medical data records.
骨髓移植(BMT)是治疗骨髓相关疾病的有效手术方法。然而,一些相关的风险因素会影响 BMT 后的长期存活率。机器学习(ML)技术已被证明可用于预测 BMT 接受者的存活率,以及限制其恢复能力的影响因素。在这项研究中,使用公共数据集提出了一种有效的分类模型,用于预测儿童接受 BMT 的存活率。在这方面,研究了几种有监督的 ML 方法,并采用 80-20 的训练-测试分割比例。为了确保以最少的时间和资源进行预测,仅使用卡方特征选择方法从 59 个数据集特征中选择前 11 个特征。此外,采用网格搜索交叉验证(GSCV)技术进行超参数优化(HPO),以提高预测精度。在原始数据集和简化数据集上,利用默认和优化的超参数进行了四次实验。我们的研究表明,HPO 的前 11 个特征与默认参数下的整个数据集具有相同的预测准确性(94.73%),但所需的时间和资源更少。因此,通过利用医疗数据记录,该方法可能有助于开发具有令人满意的准确性和最小计算时间的计算机辅助诊断系统。