Skip to main content

Chroma Intra Prediction with attention-based CNN architectures

Marc Gorriz Blanch, Saverio Blasi, Alan Smeaton, Noel E. O'Connor, Marta Mrak

  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
    Length: 11:11
26 Oct 2020

Neural networks can be used in video coding to improve chroma intra-prediction. In particular, the usage of fully-connected networks has enabled a better characterisation of the cross-component information, improving the prediction accuracy of related linear models. Nonetheless, such multi-layer perceptron architectures tend to disregard the locations of individual reference samples when predicting the chroma block components. This paper proposes a twofold network architecture based on a novel attention module which takes into account the spatial location of each neighbouring reference sample when computing the prediction. The proposed approach is integrated into the Versatile Video Coding (VVC) prediction pipeline. Experimental results demonstrate compression gains over the latest VVC anchor, as well as over previous chroma intra-prediction methods which are based on neural networks.

Value-Added Bundle(s) Including this Product

More Like This

  • SPS
    Members: $150.00
    IEEE Members: $250.00
    Non-members: $350.00
  • SPS
    Members: $150.00
    IEEE Members: $250.00
    Non-members: $350.00
  • SPS
    Members: $150.00
    IEEE Members: $250.00
    Non-members: $350.00