Unsupervised Training of an HMM-based Speech Recognizer for Topic Classification – Herb Gish (BBN Technologies)
View Seminar Video
We address the problem of performing topic classification of speech when no transcriptions from the speech corpus of interest are available. The approach we take is one of incremental learning about the speech corpus starting with adaptive segmentation of the speech, leading to the generation of discovered acoustic units and a segmental recognizer for these units, and finally to an initial tokenization of the speech for the training of a HMM speech recognizer. The recognizer trained is BBN’s Byblos system. We discuss the performance of this system and also consider the case when a small amount of transcribed data is available.
Dr. Herbert Gish received a Ph.D. in Applied Mathematics from Harvard University in 1967. He is a Principal Scientist at BBN Technologies in Cambridge, Massachusetts in the Speech and Language Processing Department. His most recent work deals with information extraction from speech and text with a focus on problems that have very limited amounts of training data available.