Segmentation of Telephone Speech Based on Speech and Non-Speech Models, SPECOM
Tác giả: Michael Heck, Christian Mohr, Sebastian Stüker, Markus Müller, Kevin Kilgour, Jonas Gehring, Quoc Bao Nguyen,Van Huy Nguyen, and Alex Waibel,
Nhà xuất bản: Lecture Notes in Computer Science Volume 8113, ISSN 302-9743, Czech republic, Sept -
In this paper we investigate the automatic segmentation of recorded telephone conversations based on models for speech and non-speech to find sentence-like chunks for use in speech recognition systems. Presented are two different approaches, based on Gaussian Mixture Models (GMMs) and Support Vector Machines (SVMs), respectively. The proposed methods provide segmentations that allow for competitive speech recognition performance in terms of word error rate (WER) compared to manual segmentation.