Parsing and Spoken Structural Event Detection

Research Group of the 2005 Summer Workshop

Even though speech-recognition accuracy has improved significantly over the past 10 years, these systems do not currently generate/model structural information (meta-data) such as sentence boundaries (e.g., periods) or the form of a disfluency (e.g., in .I want [to go] * {I mean} meet with Fred., .to go. is an edit, which is signaled by an interruption point indicated as *, as well as an edit term .I mean..). Automatic detection of these phenomena would simultaneously improve parsing accuracy and provide a mechanism for cleaning up transcriptions for the downstream text processing modules. Similarly, constraints imposed by text processing systems such as parsers can be used to assign certain types of meta-data for correct identification of disfluencies.

The goal of this workshop is to investigate the enrichment of speech recognition output using parsing constraints and the improvement of parsing accuracy due to speech recognition enrichment. We will investigate the following questions: (1) How does the incorporation of syntactic knowledge affect sentence boundary and disfluency detection accuracy? (2) How does the availability of more accurate sentence boundaries and disfluency annotation affect parsing accuracy? This workshop project is interdisciplinary bringing together researchers from the speech recognition and natural language processing communities. The undergraduates on this project will be exposed to research that spans these two important areas, and will gain experience on approaches to interfacing between technologies in these two areas.

Opening Day Presentation
Closing Day Presentation
Final Report
SParseval Tool

Team Members
Senior Members
Bonnie Dorr	University of Maryland
John Hale	Michigan State University
Mary Harper	Purdue University
Brian Roark	Oregon Health and Sciences University
Izhak Shafran	Johns Hopkins University
Graduate Students
Matt Lease	Brown University
Yang Liu	ICSI
Matt Snover	University of Maryland
Lisa Yung	Johns Hopkins University
Undergraduate Students
Anna Krasnyanskaya	UCLA
Robin Stewart	Williams

Parsing and Spoken Structural Event Detection

Upcoming Seminars

Center for Language and Speech Processing