College of Systems Engineering, National University of Defense Technology, Changsha 410073, China.
Sensors (Basel). 2021 Oct 11;21(20):6747. doi: 10.3390/s21206747.
Hand pose estimation from RGB images has always been a difficult task, owing to the incompleteness of the depth information. Moon et al. improved the accuracy of hand pose estimation by using a new network, InterNet, through their unique design. Still, the network still has potential for improvement. Based on the architecture of MobileNet v3 and MoGA, we redesigned a feature extractor that introduced the latest achievements in the field of computer vision, such as the ACON activation function and the new attention mechanism module, etc. Using these modules effectively with our network, architecture can better extract global features from an RGB image of the hand, leading to a greater performance improvement compared to InterNet and other similar networks.
基于 MobileNet v3 和 MoGA 架构,我们重新设计了一个特征提取器,引入了计算机视觉领域的最新成果,如 ACON 激活函数和新的注意力机制模块等。在我们的网络中有效地使用这些模块,架构可以更好地从手部的 RGB 图像中提取全局特征,与 InterNet 和其他类似网络相比,性能得到了更大的提升。