SMOOTH AND STEPWISE SELF-DISTILLATION FOR OBJECT DETECTION

Jieren Deng, Xin Zhou, Hao Tian, Zhihong Pan, Derek Aguiar

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Poster 09 Oct 2023

Distilling the structured information captured in feature maps has contributed to improved results for object detection tasks, but requires careful selection of baseline architectures and substantial pre-training. Self-distillation addresses these limitations and has recently achieved state-of-the-art performance for object detection despite making several simplifying architectural assumptions. Building on this work, we propose Smooth and Stepwise Self-Distillation (SSSD) for object detection. Our SSSD architecture forms an implicit teacher from object labels and a feature pyramid network backbone to distill label-annotated feature maps using Jensen-Shannon distance, which is smoother than distillation losses used in prior work. We additionally add a distillation coefficient that is adaptively configured based on the learning rate. We extensively benchmark SSSD against a baseline and two state-of-the-art object detector architectures on the COCO dataset by varying the coefficients and backbone and detector networks. We demonstrate that SSSD achieves higher average precision in most experimental settings, is robust to a wide range of coefficients, and benefits from our stepwise distillation procedure.

Tags:

object detection

deep learning

knowledge distillation

smooth

self-distillation

SMOOTH AND STEPWISE SELF-DISTILLATION FOR OBJECT DETECTION

Jieren Deng, Xin Zhou, Hao Tian, Zhihong Pan, Derek Aguiar

More Like This

Signal Processing and Deep Learning for Practical Active Noise Control

Short Course Bundle: ICASSP 2023 COURSE 2: Graph Signal Processing and Geometric Learning: A Foundational Approach (Parts 1-4)

Short Course Bundle: ICASSP 2023 COURSE 1: A Hands-on Approach for Implementing Stochastic Optimization Algorithms from Scratch (Parts 1-4)

Join the IEEE Signal Processing Society