Auditory feature for monaural speech segregation

Yi Jiang; RunshenG Liu; Yuanyuan Zu

doi:10.2991/icieac-14.2014.16

<Previous Article In Volume

Next Article In Volume>

Auditory feature for monaural speech segregation

Authors

Yi Jiang, RunshenG Liu, Yuanyuan Zu

Corresponding Author

Yi Jiang

Available Online March 2014.

DOI: 10.2991/icieac-14.2014.16 How to use a DOI?
Keywords: gammatone frequency cepstral coefficients (GFCC); monaural speech segregation; binary classification; time-frequency(T-F) unit
Abstract: Monaural speech segregation has been a very challenging problem for speech signal processing. The implication of the ideal binary masks to an auditory mixture has been shown to yield substantial improvements in signal-to-noise-ratio (SNR) and intelligibility. In this paper, we use the time-frequency (T-F) unit level gammatone frequency cepstral coefficients (GFCC) auditory feature to estimate the ideal binary mask for monaural speech segregation. The paper reports the successful attempt to use GFCC as the segregation cue with deep neural networks (DNNs) classifier. Results show that robust performance can be achieved across noisy and reverberant conditions.
Copyright: © 2014, the Authors. Published by Atlantis Press.
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the 2nd International Conference on Information, Electronics and Computer
Series: Advances in Intelligent Systems Research
Publication Date: March 2014
ISBN: 978-90-78677-99-4
ISSN: 1951-6851
DOI: 10.2991/icieac-14.2014.16 How to use a DOI?
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

ris enw bib

TY  - CONF
AU  - Yi Jiang
AU  - RunshenG Liu
AU  - Yuanyuan Zu
PY  - 2014/03
DA  - 2014/03
TI  - Auditory feature for monaural speech segregation
BT  - Proceedings of the 2nd International Conference on Information, Electronics and Computer
PB  - Atlantis Press
SP  - 69
EP  - 72
SN  - 1951-6851
UR  - https://doi.org/10.2991/icieac-14.2014.16
DO  - 10.2991/icieac-14.2014.16
ID  - Jiang2014/03
ER  -

download .riscopy to clipboard