Center for Research and Innovation, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand.
Department of Clinical Microscopy, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand.
BMC Med Inform Decis Mak. 2019 Nov 7;19(1):212. doi: 10.1186/s12911-019-0929-2.
The hypochromic microcytic anemia (HMA) commonly found in Thailand are iron deficiency anemia (IDA) and thalassemia trait (TT). Accurate discrimination between IDA and TT is an important issue and better methods are urgently needed. Although considerable RBC formulas and indices with various optimal cut-off values have been developed, distinguishing between IDA and TT is still a challenging problem due to the diversity of various anemic populations. To address this problem, it is desirable to develop an improved and automated prediction model for discriminating IDA from TT.
We retrospectively collected laboratory data of HMA found in Thai adults. Five machine learnings, including k-nearest neighbor (k-NN), decision tree, random forest (RF), artificial neural network (ANN) and support vector machine (SVM), were applied to construct a discriminant model. Performance was assessed and compared with thirteen existing discriminant formulas and indices.
The data of 186 patients (146 patients with TT and 40 with IDA) were enrolled. The interpretable rules derived from the RF model were proposed to demonstrate the combination of RBC indices for discriminating IDA from TT. A web-based tool 'ThalPred' was implemented using an SVM model based on seven RBC parameters. ThalPred achieved prediction results with an external accuracy, MCC and AUC of 95.59, 0.87 and 0.98, respectively.
ThalPred and an interpretable rule were provided for distinguishing IDA from TT. For the convenience of health care team experimental scientists, a web-based tool has been established at http://codes.bio/thalpred/ by which users can easily get their desired screening test result without the need to go through the underlying mathematical and computational details.
在泰国常见的低色素小红细胞性贫血(HMA)为缺铁性贫血(IDA)和地中海贫血特征(TT)。准确区分 IDA 和 TT 是一个重要问题,迫切需要更好的方法。尽管已经开发了具有各种最佳截断值的相当多的 RBC 公式和指数,但由于各种贫血人群的多样性,区分 IDA 和 TT 仍然是一个具有挑战性的问题。为了解决这个问题,最好开发一种改进的、自动化的预测模型,用于区分 IDA 和 TT。
我们回顾性收集了泰国成年人 HMA 的实验室数据。五种机器学习方法,包括 k-最近邻(k-NN)、决策树、随机森林(RF)、人工神经网络(ANN)和支持向量机(SVM),用于构建判别模型。评估了性能并与 13 种现有的判别公式和指数进行了比较。
共纳入 186 例患者(146 例 TT,40 例 IDA)的数据。从 RF 模型中提取出可解释的规则,用于展示用于区分 IDA 和 TT 的 RBC 指数组合。使用基于 7 个 RBC 参数的 SVM 模型实现了一个基于网络的工具 'ThalPred'。ThalPred 的外部准确性、MCC 和 AUC 分别为 95.59、0.87 和 0.98。
提供了 ThalPred 和一个可解释的规则,用于区分 IDA 和 TT。为了方便医疗保健团队的实验科学家,我们在 http://codes.bio/thalpred/ 上建立了一个基于网络的工具,用户可以通过该工具轻松获得他们所需的筛选测试结果,而无需了解底层的数学和计算细节。