Yang Jingdong, Jiang Biao, Qiu Zehao, Meng Yifei, Zhang Xiaolin, Yu Shaoqing, Dai Fu, Qian Yue
School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai, China.
School of Electronic and Information Engineering, Tongji University, Shanghai, China.
Comput Methods Biomech Biomed Engin. 2024 Apr 11:1-16. doi: 10.1080/10255842.2024.2339461.
Common clinical rhinitis is characterized by different types of cases and class imbalance. Its prediction belongs to multiple output classification. Low recognition rate and poor generalization performance often occur for minority class. Therefore, we propose a novel integrated classification model, ARF-OOBEE, which transforms the multi-output classification to multi-label classification and multi-class classification. The multi-label classifier automatically adjusts the number and depth of integrated forest learners according to the imbalance ratio of single class label in a subset. It can effectively reduce the impact of class imbalance on classification and improve prediction performance of both majority or minority class concurrently. Also, we build a multi-class classification based on out-of-bag Extra-Tree to accomplish finer classification for the predicted labels. In addition, we calculate the feature importance for rhinitis on the grounds of the purity of nodes in decision-making tree inside Random Forest and study the correlation between rhinitis features. We conduct 12 folds cross-validation experiments on 461 cases of clinical rhinitis. The outcomes show that the evaluation indicators of ARF-OOBEE, such as Sensitivity, Specificity, Accuracy, F1-Score, AUC, and G-Mean are 74.9%,86.5%,92.0%,78.3%,95.3%, and 79.9%, respectively. In comparison to the other methods, ARF-OOBEE has better evaluation indicator and is more effective for the early clinical diagnosis of rhinitis.
常见的临床鼻炎具有不同类型的病例和类别不均衡的特点。其预测属于多输出分类。对于少数类,往往存在识别率低和泛化性能差的问题。因此,我们提出了一种新颖的集成分类模型ARF - OOBEE,它将多输出分类转化为多标签分类和多类分类。多标签分类器根据子集中单个类标签的不均衡比率自动调整集成森林学习者的数量和深度。它可以有效减少类别不均衡对分类的影响,并同时提高多数类和少数类的预测性能。此外,我们基于袋外Extra - Tree构建多类分类,以对预测标签进行更精细的分类。另外,我们基于随机森林中决策树节点的纯度计算鼻炎的特征重要性,并研究鼻炎特征之间的相关性。我们对461例临床鼻炎病例进行了12折交叉验证实验。结果表明,ARF - OOBEE的评估指标,如灵敏度、特异性、准确率、F1分数、AUC和G - 均值分别为74.9%、86.5%、92.0%、78.3%、95.3%和79.9%。与其他方法相比,ARF - OOBEE具有更好的评估指标,对鼻炎的早期临床诊断更有效。