Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model

Conference: Speech Communication - 15th ITG Conference
09/20/2023 - 09/22/2023 at Aachen

doi:10.30420/456164022

Proceedings: ITG-Fb. 312: Speech Communication

Pages: 5Language: englishTyp: PDF

Authors:
Lemercier, Jean-Marie; Gerkmann, Timo (Universität Hamburg, Germany)
Thiemann, Joachim; Koning, Raphael (Advanced Bionics, Hannover, Germany)

Abstract:
In this paper we present a method for single-channel wind noise reduction using our previously proposed diffusionbased stochastic regeneration model combining predictive and generative modelling. We introduce a non-additive speech in noise model to account for the non-linear deformation of the membrane caused by the wind flow and possible clipping. We show that our stochastic regeneration model outperforms other neural-network-based wind noise reduction methods as well as purely predictive and generative models, on a dataset using simulated and realrecorded wind noise. We further show that the proposed method generalizes well by testing on an unseen dataset with real-recorded wind noise. Audio samples, data generation scripts and code for the proposed methods can be found online.