An Open Dataset For Video Coding For Machines Standardization

Wen Gao, Xiaozhong Xu, Matthew Qin, Shan Liu

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:14:00

07 Oct 2022

Unsupervised image segmentation is a challenge task, since a high-quality segmented image should perceive not only local object structures but also certain semantics without any annotations. in this paper, we propose a novel encoder-decoder pixel clustering framework with dual constraints to incorporate local structure and global semantic information for guiding pixel feature learning in a self-supervised manner. On one hand, a Local Structure Constraint (LStC) is constructed based on fine-grained superpixels, which improves the boundary perception of pixel features by keeping intra-superpixel feature consistency and largening inter-superpixel feature distance. On the other hand, a new Global Semantic Constraint (GSeC) is proposed via adapting the mutual information maximization technique to the single-image setting, and it strengthens the global semantic perception of pixel features and thus improves the segmenting integrity of objects. Finally, based on the learned pixel features, a smoothing component is employed to achieve semantically meaningful pixel clustering. The experimental evaluation on BSDS500 and PASCAL Context datasets show the superiority of our method on region and boundary qualities.

Tags:

International Conference on Image Processing

IEEE ICIP 2022

icip

An Open Dataset For Video Coding For Machines Standardization

Wen Gao, Xiaozhong Xu, Matthew Qin, Shan Liu

Value-Added Bundle(s) Including this Product

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

More Like This

Training Strategy For Limited Labeled Data By Learning From Confusion

Encoder Enabled Gan-Based Video Generators

Combining Non-Data-Adaptive Transforms For Oct Image Denoising By Iterative Basis Pursuit

Join the IEEE Signal Processing Society