Pan: Phoneme-Aware Network For Monaural Speech Enhancement
Zhihao Du, Ming Lei, Jiqing Han, Shiliang Zhang
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 14:17
Current methods for monaural speech enhancement only utilize acoustic information but ignore the phonetic information of an utterance. In the voice conversion community, significant progress has been achieved by using the phonetic information via the phonetic posteriorgrams (PPGs). Inspired by the progress, we propose a phoneme-aware network (PAN) to utilize the noisy PPGs for speech enhancement. Since the PPG prediction and speech enhancement benefit from each other, a PPG predictor is involved into the PAN and an iterative training algorithm is proposed for PAN. Experimental results show that the enhancement performance is improved by using the phonetic information in terms of speech intelligibility, perceptual quality and character error rate. To the best of our knowledge, this is the first time to introduce the PPG into speech enhancement.