WORKSHOP
'96
Hidden Speaking Mode Group
The goal of this project is to utilize the results of conversational
discourse analysis (e.g., prosody) to model pronunciation variations,
and, more specifically, to investigate the use of a hidden speaking mode
to represent systematic variations that are correlated with the word
sequence (predictable from syntactic structure and thus includable into
the language model). Possible systematic factors affecting pronunciation
differences are local factors, such as coarticulation effects across word
boundaries, linguistic factors, such as phrasing and focus (which are
cued by prosodic markers that affect the acoustic realization of a word),
speaking rate (which may be associated with discourse structure), and
global factors, such as dialect.
- People
Mari Ostendorf - BU (leader) (mo)
Michiel Bacchiani - BU (bacchian)
Bill Byrne - JHU (byrne)
Michael Finke - CMU
Asela Gunawardna - JHU (zilla)
Ken Ross - BBN (knross)
Sam Roweis - Caltech (roweis)
Liz Shriberg - SRI (liz)
David Talkin - Entropic (talkin)
Alex Waibel - CMU
Barbara Wheatley - DoD
Torsten Zeppenfeld - CMU (tor)
Data and resources used by the Hidden Mode group
is available at the
CLSP ftp site
Final Report (postscript)