IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2253-2258. doi: 10.1109/TNNLS.2017.2785233.
In this brief, we develop a deep reinforcement learning method to actively recognize objects by choosing a sequence of actions for an active camera that helps to discriminate between the objects. The method is realized using trust region policy optimization, in which the policy is realized by an extreme learning machine and, therefore, leads to efficient optimization algorithm. The experimental results on the publicly available data set show the advantages of the developed extreme trust region optimization method.
在这篇简短的文章中,我们开发了一种深度强化学习方法,通过为主动相机选择一系列动作来主动识别物体,从而帮助区分物体。该方法使用信任区域策略优化来实现,其中策略由极限学习机实现,因此导致高效的优化算法。在公开可用数据集上的实验结果表明了所开发的极限信任区域优化方法的优势。