Effects of noise suppression and envelope dynamic range compression on the intelligibility of vocoded sentences for a tonal language

Chen, Fei and Zheng, Dingchang and Tsao, Yu (2017) Effects of noise suppression and envelope dynamic range compression on the intelligibility of vocoded sentences for a tonal language. Journal of the Acoustical Society of America, 142 (3). pp. 1157-1166. ISSN 1520-8524

[img]
Preview
Text
Accepted Version
Available under the following license: Creative Commons Attribution Non-commercial No Derivatives.

Download (395kB) | Preview
Official URL: https://doi.org/10.1121/1.5000164

Abstract

Vocoder simulation studies have suggested that the carrier signal type employed affects the intelligibility of vocoded speech. The present work further assessed how carrier signal type interacts with additional signal processing, namely, single-channel noise suppression and envelope dynamic range compression, in determining the intelligibility of vocoder simulations. In Experiment 1, Mandarin sentences that had been corrupted by speech spectrum-shaped noise (SSN) or two-talker babble (2TB) were processed by one of four single-channel noise-suppression algorithms before undergoing tone-vocoded (TV) or noise-vocoded (NV) processing. In Experiment 2, dynamic ranges of multiband envelope waveforms were compressed by scaling of the mean-removed envelope waveforms with a compression factor before undergoing TV or NV processing. TV Mandarin sentences yielded higher intelligibility scores with normal-hearing (NH) listeners than did noise-vocoded sentences. The intelligibility advantage of noise-suppressed vocoded speech depended on the masker type (SSN vs 2TB). NV speech was more negatively influenced by envelope dynamic range compression than was TV speech. These findings suggest that an interactional effect exists between the carrier signal type employed in the vocoding process and envelope distortion caused by signal processing.

Item Type: Journal Article
Additional Information: Copyright 2017 Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America. The following article appeared in The Journal of the Acoustical Society of America, 142 (3). pp. 1157-1166. ISSN 0001-4966 and may be found at https://doi.org/10.1121/1.5000164.
Keywords: Vocoded sentences, Speech perception, Wave mechanics
Faculty: Faculty of Medical Science
Depositing User: Professor D Zheng
Date Deposited: 28 Sep 2017 13:46
Last Modified: 07 Mar 2019 11:57
URI: http://arro.anglia.ac.uk/id/eprint/702214

Actions (login required)

Edit Item Edit Item