CLSP
WORKSHOP '96

Optimistic & Pessimistic


This page describes an experiment to learn bound for the rescoring of our HTK lattices usiong acoustic probabilities that were either perfect representations of Viterbi-aligned phonetic targets or else probabilities that had no relation to the acoustics, conducted by Mark Ordowski and Nelson Morgan.

OPTIMISTIC or PESSIMISTIC

These two tests provide an upper(perfect acoustic probs) and lower(irrelevant acoustic probs) bound of how the MLP effects the decoding stage of the LVCSR system.

The optimistic test was an alignment of the 240 dev-test male speakers. The MLP output probabilities were manipulated so that based on the alignment a probability of 1 was set for the correct phone and the other phone probabilities were set to zero. These probabilities were then used to rescore the HTK lattice. The result was a 33.3% WER (Speaker Stats).While we would expect there to be a 10% word error rate due to the limitations of teh lattice topology, this result may reflect the limitations of using the language scores from HTK.

The pessimistic test was essentially the opposite of the optimistic test. The MLP output was set for every frame to be the prior phone probabilities. The result of this test was a 73.5% WER (Speaker Stats). This shows the large degree to which all of our results are affected by the acoustic decoding done to create the lattices in the first place, and suggests caution in interpreting results in the vicinity of 70%. Essentially, 73.5% should be considered the baseline for our rescoring experiments.


Last modified on October 16, 1996
Christophe Ris <ris@cspjhu.ece.jhu.edu >