Dilated Convolutional Neural Networks For Panoramic Image Saliency Prediction

Feng Dai, Youqiang Zhang, Hongliang Li, Yike Ma, Qiang Zhao

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 12:55

04 May 2020

Saliency prediction is an important way to understand humanâs behavior and has a wide range of applications. Although lots of algorithms have been designed to predict saliency for planar images, there are few works for 360Âº images. In this paper, we propose an encoder-decoder network for panoramic image saliency prediction. Dilated convolutional layers are deployed in the network, which can extract more representative features and improve the accuracy of saliency prediction. To deal with the image distortions in 360Âº images, our network takes cube map format as input and processes six faces of cube map simultaneously. Respecting the saliency distribution of ground truth, we also propose a new data augmentation method to train the network, which is validated to be helpful for performance improvement. Extensive experiments show that our method gives new state-of-the-art results on 360Âº image saliency prediction.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020