Histogram Equalization of the Speech Representation for Robust Speech Recognition (2001)
| Citations: | 12 - 2 self |
BibTeX
@MISC{Torre01histogramequalization,
author = {Angel de la Torre and Antonio M. Peinado and Jose C. Segura and Jose L. Perez and Carmen Bentez and Antonio J. Rubio},
title = {Histogram Equalization of the Speech Representation for Robust Speech Recognition},
year = {2001}
}
OpenURL
Abstract
The noise degrades the performance of Automatic Speech Recognition systems mainly due to the mismatch between the training and recognition conditions it introduces. The noise causes a distortion of the feature space which usually presents a non-linear behavior. In order to reduce this mismatch, the methods proposed for robust speech recognition try to compensate the noise effect either by obtaining an estimation of the clean speech or by adapting the recognizer acoustic models for a proper modeling of the noisy speech. In this paper we propose a method to compensate the noise effect over the speech representation. This method is based on the histogram equalization technique frequently applied for Digital Image Processing, which has been adapted to the speech representation. For each component of the feature vectors representing the speech signal, the histogram is estimated and the transformation which converts it into a reference histogram is calculated. Such transformations tend to compensate the distortion the noise produces over the different components of the feature vector and improve the performance of the recognition systems under noise conditions. We describe how the histogram equalization method can be adapted to robust speech recognition and present some recognition experiments to evaluate the proposed method.







