ACTIVE PERCEPTION SYSTEM FOR ENHANCED VISUAL SIGNAL RECOVERY USING DEEP REINFORCEMENT LEARNING
Gaurav Chaudhary (Indian Institute of Technology Kanpur, India); Prof Laxmidhar Behera (IIT Kanpur); Tushar Sandhan (Indian Institute of Technology Kanpur)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
Deep neural networks have demonstrated excellent object detection
and segmentation performance from RGB data. However,
these models can only recognize and predict segmentation
masks with great accuracy when RGB data have sufficient
information about the objects of interest. In this paper,
we suggest an intelligent, active perception system that
can adjust its 3D position to improve signal acquisition. The
segmentation of cluttered scene is improved a lot due to this
proposed system, which also enhances grasp pose detection
for the robotic manipulator. The ResNet-50 backbone of the
proposed perception system is initialized using pre-trained
weights to extract a latent state from an RGB image of the
cluttered scene. A Reinforcement Learning (RL) agent uses
these retrieved states to reposition the visual perception system
for enhancement of the underlying computer vision tasks
such as segmentation of the cluttered scene. Our trained RL
agent can anticipate the better position of the visual perception
system, which ensures enhanced signal recovery. The
effectiveness of the proposed approach is tested in a pybullet
simulation environment, as well as validated with contemporary
deep learning based segmentation methods.