Plosive Enhancement Using Phase Linearization and Smoothing

Conference: Speech Communication - 14th ITG Conference
09/29/2021 - 10/01/2021 at online

Proceedings: ITG-Fb. 298: Speech Communication

Pages: 5Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Authors:
Peer, Tal; Ziegert, Klaus-Johan; Gerkmann, Timo (Signal Processing (SP), Universität Hamburg, Germany)

Abstract:
Despite their small share in overall signal energy, plosives have been previously shown to be important for speech perception. We propose a simple, yet effective, model-based phase-aware speech enhancement approach specifically targeted at plosives. Starting from a model of the plosive burst as a unit impulse, we introduce three phase enhancement schemes: simple replacement of the noisy phase with a linear function, linear regression, as well as smoothing by local polynomial regression. To improve the outcome and compensate for model mismatch we also propose an SNR-based weighting. All schemes are evaluated under both oracle and realistic conditions, showing a consistent improvement in instrumentally predicted speech quality and, to a lesser degree, speech intelligibility. When only frames containing plosives are considered, a segmental SNR improvement of 2 dB to 6 dB can be observed, depending on the input SNR.