Multilingual Grapheme-To-Phoneme Conversion With Byte Representation

Mingzhi Yu, Hieu Nguyen, Alex Sokolov, Jack Lepird, Kanthashree Sathyendra, Samridhi Choudhary, Athanasios Mouchtaris, Siegfried Kunzmann

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:11

04 May 2020

Grapheme-to-phoneme (G2P) models convert a written word into its corresponding pronunciation and are essential components in automatic-speech-recognition and text-to-speech systems. Recently, the use of neural encoder-decoder architectures has substantially improved G2P accuracy for mono- and multi-lingual cases. However, most multilingual G2P studies focus on sets of languages that share similar graphemes, such as European languages. Multilingual G2P for languages from different writing systems, e.g. European and East Asian, remains an understudied area. In this work, we propose a multilingual G2P model with byte-level input representation to accommodate different grapheme systems, along with an attention-based Transformer architecture. We evaluate the performance of both character-level and byte-level G2P using data from multiple European and East Asian locales. Models using byte representation yield 16.2%â 50.2% relative word error rate improvement over character-based counterparts for mono- and multi-lingual use cases. In addition, byte-level models are 15.0%â20.1% smaller in size. Our results show that byte is an efficient representation for multilingual G2P with languages having large grapheme vocabularies.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Multilingual Grapheme-To-Phoneme Conversion With Byte Representation

Mingzhi Yu, Hieu Nguyen, Alex Sokolov, Jack Lepird, Kanthashree Sathyendra, Samridhi Choudhary, Athanasios Mouchtaris, Siegfried Kunzmann

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society