Skip to main content

Multilingual end-to-end spoken language understanding for ultra-low footprint applications

Markus Mueller (Amazon Alexa); Anastasios Alexandridis (Amazon.com); Zach Trozenski (Amazon Alexa); Joel Whiteman (Amazon Alexa); Grant Strimel (Amazon Alexa); Nathan Susanj (Amazon Alexa); Athanasios Mouchtaris (Amazon Alexa); Siegfried Kunzmann (Amazon Alexa)

  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
07 Jun 2023

Tiny Signal-to-Interpretation (TinyS2I) has been recently introduced as an ultra low-footprint end-to-end spoken language understanding (SLU) model. This architecture is capable of running in ultra resource constrained environments like voice assistant devices, while at the same time reducing latency. In this work, we propose an extension to TinyS2I and train a multilingual system supporting several languages. Multilingual TinyS2I models show little to no degradation compared to their monolingual counterparts. Increasing the network size in width and depth improves the classification accuracy for mono- and multilingual setups, with the multilingual one improving beyond the monolingual accuracy. This enables users to interact with the device in the language of their choice and dynamically switch between languages without an explicit language setting or accuracy degradation.

More Like This

  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00