CLSP Homepage : Workshop Homepage
Workshop 2005
Workshop 2005 Saturday, July 4, 2009

Parsing and Spoken Structural Event Detection


Even though speech-recognition accuracy has improved significantly over the past 10 years, these systems do not currently generate/model structural information (meta-data) such as sentence boundaries (e.g., periods) or the form of a disfluency (e.g., in .I want [to go] * {I mean} meet with Fred., .to go. is an edit, which is signaled by an interruption point indicated as *, as well as an edit term .I mean..). Automatic detection of these phenomena would simultaneously improve parsing accuracy and provide a mechanism for cleaning up transcriptions for the downstream text processing modules. Similarly, constraints imposed by text processing systems such as parsers can be used to assign certain types of meta-data for correct identification of disfluencies.

The goal of this workshop is to investigate the enrichment of speech recognition output using parsing constraints and the improvement of parsing accuracy due to speech recognition enrichment. We will investigate the following questions: (1) How does the incorporation of syntactic knowledge affect sentence boundary and disfluency detection accuracy? (2) How does the availability of more accurate sentence boundaries and disfluency annotation affect parsing accuracy? This workshop project is interdisciplinary bringing together researchers from the speech recognition and natural language processing communities. The undergraduates on this project will be exposed to research that spans these two important areas, and will gain experience on approaches to interfacing between technologies in these two areas.
Click here for technical details
Team Members:
Mary Harper Team Leader Purdue University harper at ecn dot purdue dot edu
Bonnie Dorr Senior Researcher University of Maryland bonnie at umiacs dot umd dot edu
John Hale Senior Researcher Michigan State University jthale at msu dot edu
Brian Roark Senior Researcher Oregon Health and Sciences University roark at cslu dot ogi dot edu
Izhak Shafran Senior Researcher Johns Hopkins University zakshafran at jhu dot edu
Yang Liu Graduate Student ICSI yangl at ICSI dot Berkeley dot edu
Matt Lease Graduate Student Brown University mlease at cs dot brown dot edu
Matt Snover Graduate Student University of Maryland snover at cs dot umd dot edu
Lisa Yung Graduate Student Johns Hopkins University lisayung at uiuc dot edu
Anna Krasnyanskaya Undergraduate Student UCLA ak858 at ucla dot edu
Robin Stewart Undergraduate Student Williams 06rss_2 at williams dot edu
 
Technical Contact:
Mary Harper
School of Electrical and Computer Engineering
Purdue University
Administrative Contact:
2005 Summer Workshop
Center for Language and Speech Processing
Johns Hopkins University

The Center for Language and Speech Processing
The Johns Hopkins University
3400 North Charles Street, Barton Hall
Baltimore, MD 21218
*Telephone: (410) 516-4237 *Fax: (410) 516-5050 *E-mail: clsp@clsp.jhu.edu