Grid-Based Decimation for Wavelet Transforms with Stably Invertible Implementation

Nicki Holighaus (Acoustics Research Institute, Austrian Academy of Sciences); Günther Koliander (Acoustics Research Institute, Austrian Academy of Sciences); Clara Hollomey (Acoustics Research Institute, Austrian Academy of Sciences); Friedrich Pillichshammer (Institute of Financial Mathematics and Applied Number Theory, Johannes Kepler University Linz)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

09 Jun 2023

The constant center frequency to bandwidth ratio (Q-factor) of wavelet transforms provides a very natural representation for audio data. However, invertible wavelet transforms have either required non-uniform decimation—leading to irregular data structures that are cumbersome to work with—or require excessively high oversampling with unacceptable computational overhead. Here, we present a novel decimation strategy for wavelet transforms that leads to stable representations with oversampling rates close to one and uniform decimation. Specifically, we show that finite implementations of the resulting representation are energy-preserving in the sense of frame theory. The obtained wavelet coefficients can be stored in a time-frequency matrix with a natural interpretation of columns as time frames and rows as frequency channels. This matrix structure immediately grants access to a large number of algorithms that are successfully used in time-frequency audio processing, but could not previously be used jointly with wavelet transforms. We demonstrate the application of our method in processing based on nonnegative matrix factorization, in onset detection, and in phaseless reconstruction.

Tags:

Speech production, perception and psychoacoustics

Grid-Based Decimation for Wavelet Transforms with Stably Invertible Implementation

Value-Added Bundle(s) Including this Product

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

Drone-vs-Bird Detection Grand Challenge at ICASSP23

Epilepsy Detection Grand Challenge

Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality

Join the IEEE Signal Processing Society