Barakat Nahla H, Barakat Sana H, Ahmed Nadia
Faculty of Informatics and Computer Science, The British University in Egypt, Cairo, Egypt.
Department of Pediatrics, Faculty of Medicine, Alexandria University, Alexandria, Egypt.
Healthc Inform Res. 2019 Jul;25(3):173-181. doi: 10.4258/hir.2019.25.3.173. Epub 2019 Jul 31.
The aim of this study is to develop an intelligent diagnostic system utilizing machine learning for data cleansing, then build an intelligent model and obtain new cutoff values for APRI (aspartate aminotransferase-to-platelet ratio) and FIB-4 (fibrosis score) for the prediction and staging of fibrosis in children with chronic hepatitis C (CHC).
Random forest (RF) was utilized in this study for data cleansing; then, prediction and staging of fibrosis, APRI and FIB-4 scores and their areas under the ROC curve (AUC) have been obtained on the cleaned dataset. A cohort of 166 Egyptian children with CHC was studied.
RF, APRI, and FIB-4 achieved high AUCs; where APRI had AUCs of 0.78, 0.816, and 0.77; FIB-4 had AUCs of 0.74, 0.828, and 0.78; and RF had AUCs of 0.903, 0.894, and 0.822, for the prediction of any type of fibrosis, advanced fibrosis, and differentiating between mild and advanced fibrosis, respectively.
Machine learning is a valuable addition to non-invasive methods of liver fibrosis prediction and staging in pediatrics. Furthermore, the obtained cutoff values for APRI and FIB-4 showed good performance and are consistent with some previously obtained cutoff values. There was some agreement between the predictions of RF, APRI and FIB-4 for the prediction and staging of fibrosis.
本研究旨在开发一种利用机器学习进行数据清理的智能诊断系统,然后建立一个智能模型,并获得用于预测和分期慢性丙型肝炎(CHC)儿童纤维化的天冬氨酸转氨酶与血小板比值(APRI)和FIB-4(纤维化评分)的新临界值。
本研究使用随机森林(RF)进行数据清理;然后,在清理后的数据集上获得纤维化的预测和分期、APRI和FIB-4评分及其ROC曲线下面积(AUC)。对166名埃及CHC儿童进行了队列研究。
RF、APRI和FIB-4的AUC值较高;其中,APRI预测任何类型纤维化、进展性纤维化以及区分轻度和进展性纤维化的AUC值分别为0.78、0.816和0.77;FIB-4的AUC值分别为0.74、0.828和0.78;RF的AUC值分别为0.903、0.894和0.822。
机器学习是儿科肝纤维化预测和分期非侵入性方法的重要补充。此外,获得的APRI和FIB-4临界值表现良好,与之前获得的一些临界值一致。RF、APRI和FIB-4在纤维化预测和分期方面的预测结果存在一定一致性。