Results 1 -
3 of
3
NIST RT'05S evaluation : Pre-processing
- in NIST 2005 Spring Rich Transcrition Evaluation Workshop
, 2005
"... This paper presents di#erent pre-processing techniques, coupled with three speaker diarization systems in the framework of the NIST 2005 Spring Rich Transcription campaign (RT'05S). ..."
Abstract
- Add to MetaCart
This paper presents di#erent pre-processing techniques, coupled with three speaker diarization systems in the framework of the NIST 2005 Spring Rich Transcription campaign (RT'05S).
SPEAKER SEGMENTATION AND CLUSTERING FOR SIMULTANEOUSLY PRESENTED SPEECH
"... This paper proposes a new scheme used to segment and cluster speech segments on an unsupervised basis in cases where multiple speakers are presented simultaneously at different SNRs. The new elements in our work are in the development of new feature for segmenting and clustering simultaneously-prese ..."
Abstract
- Add to MetaCart
This paper proposes a new scheme used to segment and cluster speech segments on an unsupervised basis in cases where multiple speakers are presented simultaneously at different SNRs. The new elements in our work are in the development of new feature for segmenting and clustering simultaneously-presented speech, the procedure for identifying a candidate set of possible speaker-change points, and the use of pair-wise cross-segment distance distributions to cluster segments by speaker. The proposed system is evaluated in terms of the F measure that is obtained. The system is compared to a baseline system that uses MFCC for acoustic features, the Bayesian Information Criterion (BIC) for detecting speaker-change points, and the Kullback-Leibler distance for clustering the segments. Experimental indicate that the new system consistently provides better performance than the baseline system with very small computational cost. 1 Index Terms: speech segmentation, speaker clustering, feature extraction

