Synthetic Data For Dnn-Based Doa Estimation of Indoor Speech

Sammendrag

This paper investigates the use of different room impulse response (RIR) simulation methods for synthesizing training data for deep neural network-based direction of arrival (DOA) estimation of speech in reverberant rooms.

Different sets of synthetic RIRs are obtained using the image source method (ISM) and more advanced methods including diffuse reflections and/or source directivity. Multi-layer perceptron (MLP) deep neural network (DNN) models are trained on generalized cross correlation (GCC) features extracted for each set. Finally, models are tested on features obtained from measured RIRs.

This study shows the importance of training with RIRs from directive sources, as resultant DOA models achieved up to 51% error reduction compared to the steered response power with phase transform (SRP-PHAT) baseline (significant with p<<.01), while models trained with RIRs from omnidirectional sources did worse than the baseline. The performance difference was specifically present when estimating the azimuth of speakers not facing the array directly.

Les publikasjonen

Kategori

Vitenskapelig Kapittel/Artikkel/Konferanseartikkel

Oppdragsgiver

Research Council of Norway (RCN) / 256753

Språk

Engelsk

Forfatter(e)

Femke B. Gelderblom
Yi Liu
Johannes Kvam
Tor Andre Myrvoll

Institusjon(er)

Norges teknisk-naturvitenskapelige universitet
SINTEF Digital / Sustainable Communication Technologies
SINTEF Digital / Smart Sensors and Microsystems

År

2021

Forlag

IEEE (Institute of Electrical and Electronics Engineers)

Bok

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Hefte nr.

2021

ISBN

978-1-7281-7606-2

Side(r)

4390 - 4394

DOI

https://doi.org/10.1109/icassp39728.2021.9414415

Les fulltekst

https://hdl.handle.net/11250/2824872

Vis denne publikasjonen hos Cristin

Kontakt oss

Tjenester

Rapporter og publikasjoner

Forskningssenter og samarbeid

Karriere

Bærekraft

Institutter

Andre enheter

Ledelse og organisering

Om oss

Følg oss