Converting Written Language To Spoken Language With Neural Machine Translation For Language Modeling

Shintaro Ando, Masayuki Suzuki, Nobuyasu Itoh, Gakuto Kurata, Nobuaki Minematsu

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:32

04 May 2020

When building a language model (LM) for spontaneous speech, the ideal situation is to have a large amount of spoken, in-domain training data. Having such abundant data, however, is not realistic. We address this problem by generating texts in spoken language from those in written language by using a neural machine translation (NMT) model. We collected faithful transcripts of fully spontaneous speech and corresponding written versions and used them as a parallel corpus to train the NMT model. We used top-k random sampling, which generates a large variety of texts of higher quality as compared to other decoding methods for NMT. Our experimental results show that the NMT model is capable of converting written texts in a certain domain to spoken texts, and that the converted texts are effective for training LMs.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Converting Written Language To Spoken Language With Neural Machine Translation For Language Modeling

Shintaro Ando, Masayuki Suzuki, Nobuyasu Itoh, Gakuto Kurata, Nobuaki Minematsu

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society