Subjective Intelligibility of Deep Neural Network-Based Speech Enhancement

Sammendrag

Recent literature indicates increasing interest in deep neural networks for use in speech enhancement systems. Currently, these systems are mostly evaluated through objective measures
of speech quality and/or intelligibility. Subjective intelligibility evaluations of these systems have so far not been reported. In this paper we report the results of a speech recognition test with 15 participants, where the participants were asked to pick out
words in background noise before and after enhancement using a common deep neural network approach. We found that, although the objective measure STOI predicts that intelligibility
should improve or at the very least stay the same, the speech recognition threshold, which is a measure of intelligibility, deteriorated by 4 dB. These results indicate that STOI is not a
good predictor for the subjective intelligibility of deep neural network-based speech enhancement systems. We also found that the postprocessing technique of global variance normalisation does not significantly affect subjective intelligibility.

Les publikasjonen

Kategori

Vitenskapelig artikkel

Språk

Engelsk

Forfatter(e)

Institusjon(er)

SINTEF Digital / Sustainable Communication Technologies

År

2017

Publisert i

Interspeech (USB)

ISSN

2308-457X

Forlag

International Speech Communication Association

Årgang

2017-August

Side(r)

1968 - 1972

Eksterne ressurser

DOI

https://doi.org/10.21437/interspeech.2017-1041

Les fulltekst

https://hdl.handle.net/11250/2451833

Vis denne publikasjonen hos Cristin

Kontakt oss

Tjenester

Rapporter og publikasjoner

Forskningssenter og samarbeid

Karriere

Bærekraft

Institutter

Andre enheter

Ledelse og organisering

Om oss

Følg oss