Jonathan Joan, Sanga Camilius, Mwita Magesa, Mgode Georgies
Centre for Information and Communication Technology, Sokoine University of Agriculture, P.O Box 3218, Chuo Kikuu, Morogoro, Tanzania.
Sokoine National Agricultural Library (SNAL), Sokoine University of Agriculture, P.O.Box 3022, Morogoro, Tanzania.
Online J Public Health Inform. 2021 Sep 8;13(2):e12. doi: 10.5210/ojphi.v13i2.11465. eCollection 2021.
The diagnosis of tuberculosis (TB) disease remains a global challenge, and the need for innovative diagnostic approaches is inevitable. Trained African giant pouched rats are the scent TB detection technology for operational research. The adoption of this technology is beneficial to countries with a high TB burden due to its cost-effectiveness and speed than microscopy. However, rats with some factors perform better. Thus, more insights on factors that may affect performance is important to increase rats' TB detection performance. This paper intends to provide understanding on the factors that influence rats TB detection performance using visual analytics approach. Visual analytics provide insight of data through the combination of computational predictive models and interactive visualizations. Three algorithms such as Decision tree, Random Forest and Naive Bayes were used to predict the factors that influence rats TB detection performance. Hence, our study found that age is the most significant factor, and rats of ages between 3.1 to 6 years portrayed potentiality. The algorithms were validated using the same test data to check their prediction accuracy. The accuracy check showed that the random forest outperforms with an accuracy of 78.82% than the two. However, their accuracies difference is small. The study findings may help rats TB trainers, researchers in rats TB and Information systems, and decision makers to improve detection performance. This study recommends further research that incorporates gender factors and a large sample size.
结核病(TB)的诊断仍然是一项全球性挑战,因此不可避免地需要创新的诊断方法。经过训练的非洲巨囊鼠是用于操作研究的结核病气味检测技术。由于该技术具有成本效益且比显微镜检查速度更快,因此采用该技术对结核病负担较高的国家有益。然而,某些因素会使大鼠的表现更好。因此,深入了解可能影响大鼠表现的因素对于提高大鼠的结核病检测性能非常重要。本文旨在通过视觉分析方法,让人们了解影响大鼠结核病检测性能的因素。视觉分析通过结合计算预测模型和交互式可视化来洞察数据。使用决策树、随机森林和朴素贝叶斯三种算法来预测影响大鼠结核病检测性能的因素。因此,我们的研究发现年龄是最重要的因素,3.1至6岁的大鼠表现出了潜力。使用相同的测试数据对算法进行验证,以检查它们的预测准确性。准确性检查表明,随机森林的表现优于另外两种算法,准确率为78.82%。然而,它们的准确率差异很小。研究结果可能有助于大鼠结核病训练师、大鼠结核病及信息系统研究人员以及决策者提高检测性能。本研究建议进一步开展纳入性别因素和大样本量的研究。