Enhancing Multi-Step Action Prediction For Active Object Detection
Fen Fang, Qianli Xu, Nicolas Gauthier, Liyuan Li, Joo-Hwee Lim
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 00:07:01
View planning, i.e., predicting next best views, is a key technical component of active object detection. It has been shown that adopting multi-step action in a reinforcement learning setup can boost the efficiency and accuracy of active object detection. However, existing methods suffer from unstable detection outcome when one adopts a naive strategy to combine the Q-values of multiple branches of action advantages, namely, action range and action type. This is partially caused by the lack of independence of these branches. To tackle this issue, we propose a novel mechanism to disentangle action range from action type. We evaluate our method on two public dataset and compare it with competitive benchmarks. It is shown that our method facilitates substantial gain in view planning efficiency, while enhancing detection accuracy.