SPEECH RECOGNITION USING BIOLOGICALLY-INSPIRED NEURAL NETWORKS

Thomas Bohnstingl, Ayush Garg, Stanis?aw Wo?niak, Evangelos Eleftheriou, Angeliki Pantazi, George Saon

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:12:32

10 May 2022

Automatic speech recognition systems (ASR), such as the recurrent neural network transducer (RNN-T), have reached close to human-like performance and are deployed in commercial applications. However, their core operations depart from the powerful biological counterpart, the human brain. On the other hand, the current developments in biologically-inspired ASR models lag behind in terms of accuracy and focus primarily on small-scale applications. In this work, we revisit the incorporation of biologically-plausible models into deep learning and enhance their capabilities, by taking inspiration from the brain's diverse neural and synaptic dynamics. In particular, we propose novel deep learning units by introducing neural connectivity concepts emulating the axo-somatic and the axo-axonic synapses and integrate them into the RNN-T architecture. We demonstrate for the first time that such a model can yield performance levels competitive to the state-of-the-art. Moreover, our implementation has a significantly reduced computational cost and a lower latency.

Tags:

speech recognition

synapse types

spiking neural networks

spiking neural unit

rnn-t

SPEECH RECOGNITION USING BIOLOGICALLY-INSPIRED NEURAL NETWORKS

Thomas Bohnstingl, Ayush Garg, Stanis?aw Wo?niak, Evangelos Eleftheriou, Angeliki Pantazi, George Saon

Value-Added Bundle(s) Including this Product

ICASSP 2022, May 2022 Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

Tutorial: Foundational Problems in Neural Speech Recognition

Conversational Speech Processing and Recognition: Speech Separation, End-to-End Modeling, and Speaker Diarization

SPIKING GLOM: BIO-INSPIRED ARCHITECTURE FOR NEXT-GENERATION OBJECT RECOGNITION

Join the IEEE Signal Processing Society