IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):4084-4095. doi: 10.1109/TNNLS.2021.3055816. Epub 2022 Aug 3.
Image ordinal estimation is to predict the ordinal label of a given image, which can be categorized as an ordinal regression (OR) problem. Recent methods formulate an OR problem as a series of binary classification problems. Such methods cannot ensure that the global ordinal relationship is preserved since the relationships among different binary classifiers are neglected. We propose a novel OR approach, termed convolutional OR forest (CORF), for image ordinal estimation, which can integrate OR and differentiable decision trees with a convolutional neural network for obtaining precise and stable global ordinal relationships. The advantages of the proposed CORF are twofold. First, instead of learning a series of binary classifiers independently, the proposed method aims at learning an ordinal distribution for OR by optimizing those binary classifiers simultaneously. Second, the differentiable decision trees in the proposed CORF can be trained together with the ordinal distribution in an end-to-end manner. The effectiveness of the proposed CORF is verified on two image ordinal estimation tasks, i.e., facial age estimation and image esthetic assessment, showing significant improvements and better stability over the state-of-the-art OR methods.
图像序次估计是指预测给定图像的序次标签,可以归类为序次回归 (OR) 问题。最近的方法将 OR 问题表示为一系列的二进制分类问题。由于忽略了不同二分类器之间的关系,这些方法不能确保全局序次关系得以保留。我们提出了一种新的 OR 方法,称为卷积 OR 森林 (CORF),用于图像序次估计,它可以将 OR 和可微决策树与卷积神经网络集成,以获得精确和稳定的全局序次关系。所提出的 CORF 的优势有两点。首先,与独立学习一系列二分类器不同,该方法旨在通过同时优化这些二分类器来学习 OR 的序次分布。其次,所提出的 CORF 中的可微决策树可以与序次分布一起端到端地训练。在所提出的 CORF 上,在两个图像序次估计任务,即面部年龄估计和图像美学评估上进行了有效性验证,显示出比最先进的 OR 方法有显著的改进和更好的稳定性。