Lance: Efficient Low-Precision Quantized Winograd Convolution For Neural Networks Based On Graphics Processing Units

Guangli Li, Lei Liu, Xueying Wang, Xiaobing Feng, Xiu Ma

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 06:48

04 May 2020

Accelerating deep convolutional neural networks has become an active topic and sparked an interest in academia and industry. In this paper, we propose an efficient low-precision quantized Winograd convolution algorithm, called LANCE, which combines the advantages of fast convolution and quantization techniques. By embedding linear quantization operations into the Winograd-domain, the fast convolution can be performed efficiently under low-precision computation on graphics processing units. We test neural network models with LANCE on representative image classification datasets, including SVHN, CIFAR, and ImageNet. The experimental results show that our 8-bit quantized Winograd convolution improves the performance by up to 2.40x over the full-precision convolution with trivial accuracy loss.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Lance: Efficient Low-Precision Quantized Winograd Convolution For Neural Networks Based On Graphics Processing Units

Guangli Li, Lei Liu, Xueying Wang, Xiaobing Feng, Xiu Ma

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society