Frame-Based Overlapping Speech Detection Using Convolutional Neural Networks

Midia Yousefi, John H.L. Hansen

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:59

04 May 2020

Naturalistic speech recordings usually contain speech signals from multiple speakers. This phenomenon can degrade the performance of speech technologies due to the complexity of tracing and recognizing individual speakers. In this study, we investigate the detection of overlapping speech on segments as short as 25 ms using Convolutional Neural Networks. We evaluate the detection performance using different spectral features, and show that pyknogram features outperforms other commonly used speech features. The proposed system can predict overlapping speech with an accuracy of 84% and Fscore of 88% on a dataset of mixed speech generated based on the GRID dataset.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Value-Added Bundle(s) Including this Product

04 May 2020

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

26 Apr 2024

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

SPS

Members: $150.00
IEEE Members: $250.00
Non-members: $350.00

19 Apr 2024

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

SPS

Members: $150.00
IEEE Members: $250.00
Non-members: $350.00

16 Oct 2022

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

SPS

Members: $150.00
IEEE Members: $250.00
Non-members: $350.00