| CLSP RT-02 System | ||||||||||||||||||||||||||||||||||||||||||||||||
|
1 Discriminative Speaker adaptive Training
Until recently SAT techniques have been based on maximum likelihood (ML) parameter estimation framework. During MLE training, model parameters are adjusted to increase the likelihood of the word strings corresponding to the training utterances without taking account of the probability of other possible word strings. MMIE training was proposed in as an alternative to MLE and maximises the mutual information between the training word sequences and the observation sequences. The MMIE criterion increases the probability of the model sequence corresponding to the training data given the training data. Discriminative optimization criteria can be more effective in reducing the word error rate than maximum likelihood estimation and hence are of interest. Recent work by McDonough provides the formulae for reestimating the linear transforms using MLLR and the models parameters using MMIE. In this work both the linear transforms and the model parameters are reestimated under MMIE criteria. The SAT training routine used is as follow: 1. Start with the speaker independent model set. 2. Estimate a speaker dependent transform for each speaker using the MMIE framework. 3. Estimate the new model set (SI) given the current speaker-dependent transform. 4. Goto step 2. Below we have the re-estimation formulas
2. Performance of ML-SAT D-SAT Systems
|
||||||||||||||||||||||||||||||||||||||||||||||||
| The
Center for Language and Speech Processing
The Johns Hopkins University 3400 North Charles Street, Barton Hall Baltimore, MD 21218 |
|||||
| Telephone: (410) 516-4237 | Fax: (410) 516-5050 | E-mail: clsp@clsp.jhu.edu | |||