Automatic Extraction of Subcategorization from Corpora

Computer Laboratory
University of Cambridge
Cambridge, England
March 12, 1996

Abstract
--------
I will describe a novel technique for constructing a
subcategorization dictionary from textual corpora. Each dictionary
entry encodes the relative frequency of occurrence of a comprehensive
set of subcategorization classes for English. An initial experiment
demonstrates that the technique achieves accuracy comparable to
previous approaches, which are all limited to a highly restricted set
of subcategorization classes.
 
**************************************************************************

Seminar Schedule