CLSP
WORKSHOP '96

Hidden Speaking Mode Group

The goal of this project is to utilize the results of conversational discourse analysis (e.g., prosody) to model pronunciation variations, and, more specifically, to investigate the use of a hidden speaking mode to represent systematic variations that are correlated with the word sequence (predictable from syntactic structure and thus includable into the language model). Possible systematic factors affecting pronunciation differences are local factors, such as coarticulation effects across word boundaries, linguistic factors, such as phrasing and focus (which are cued by prosodic markers that affect the acoustic realization of a word), speaking rate (which may be associated with discourse structure), and global factors, such as dialect.
People
Mari Ostendorf - BU (leader) (mo)
Michiel Bacchiani - BU (bacchian)
Bill Byrne - JHU (byrne)
Michael Finke - CMU
Asela Gunawardna - JHU (zilla)
Ken Ross - BBN (knross)
Sam Roweis - Caltech (roweis)
Liz Shriberg - SRI (liz)
David Talkin - Entropic (talkin)
Alex Waibel - CMU
Barbara Wheatley - DoD
Torsten Zeppenfeld - CMU (tor)
Data and resources used by the Hidden Mode group is available at the CLSP ftp site

Final Report (postscript)