This project is concerned with speech and speaker variations as it affects signal processing and acoustic modeling:
Signal processing will involve non-linear speaker and channel adaptation by finding a common low dimensional mapping of training data, based on the J-RASTA signal processing approach. Knowledge of the Multi-band Recognition paradigm will be experimented with and exploited.
Variability as a function of global and local speaking rate will be incorporated into the acoustic model. We will exploit discriminant HMM technology developed by Bourlard and Morgan, using transition probabilities that are dependent on acoustics (and in this case rate).
Team Members | |
---|---|
Senior Members | |
Hynek Hermansky | CLSP |
Herve Bourlard | Faculte Polytechnique de Mons (BE) / ICSI |
Jordan Cohen | IDA |
Nelson Morgan | ICSI |
Christophe Ris | Faculte Polytechnique de Mons (BE) |
Graduate Students | |
Mark Odrawski | CLSP |
Nikki Mirghafori | ICSI |
Sangita Tibrewala | Oregon Graduate Institute |