Modeling Systematic Variations in pronunciation Modeling via a Language-Dependent Hidden Speaking Mode

Research Group of the 1996 Summer Workshop

The goal of this project is to utilize the results of conversational discourse analysis (e.g., prosody) to model pronunciation variations, and, more specifically, to investigate the use of a hidden speaking mode to represent systematic variations that are correlated with the word sequence (predictable from syntactic structure and thus includable into the language model). Possible systematic factors affecting pronunciation differences are local factors, such as coarticulation effects across word boundaries, linguistic factors, such as phrasing and focus (which are cued by prosodic markers that affect the acoustic realization of a word), speaking rate (which may be associated with discourse structure), and global factors, such as dialect.

Team Members
Senior Members
Bill Bryne	JHU
Mari Ostendorf	BU
Ken Ross	BBN
Liz Shriberg	SRI
David Talkin	Entropic
Alex Waibel	CMU
Barbara Wheatley	DoD
Graduate Students
Asela Gunarwardana	CLSP
Michiel Bacchiani	BU
Michael Finke	CMU
Sam Roweis	Caltech
Torsten Zeppenfeld	CMU

Modeling Systematic Variations in pronunciation Modeling via a Language-Dependent Hidden Speaking Mode

Upcoming Seminars

Center for Language and Speech Processing