Deep Complex Convolutional Recurrent Network for Multi-Channel Speech Enhancement and Dereverberation

Abstract

This paper proposes a neural network based system for multi-channel speech enhancement and dereverberation. Speech recorded indoors by a far field microphone, is invariably degraded by noise and reflections. Recent single channel enhancement systems have improved denoising performance, but do not reduce reverberation, which also reduces speech quality and intelligibility. To address this, we propose a deep complex convolution recurrent network (DCCRN) based multi-channel system, with integrated minimum power distortionless response (MPDR) beamformer and weighted prediction error (WPE) preprocessing.

PESQ and STOI performance is evaluated on a test set of room impulse responses and noise samples recorded by the same setup. The proposed system shows a statistically significant improvement over competitive systems.

Read the publication

Language

English

Author(s)

Femke B. Gelderblom
Tor Andre Myrvoll

Affiliation

SINTEF Digital / Sustainable Communication Technologies
Norwegian University of Science and Technology

Year

2021

Publisher

IEEE (Institute of Electrical and Electronics Engineers)

Book

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)

ISBN

9781728163383

DOI

https://doi.org/10.1109/mlsp52302.2021.9596086

Read fulltext

https://hdl.handle.net/11250/2987939

View this publication at Norwegian Research Information Repository

Contact us

Our services

Career

Sustainability

Management and board

Institutes

Other units

About us

Follow us