CLSP Homepage : Workshop Homepage
Workshop 2002
July 31, 2002 Guest Lecture Friday, August 29, 2008


Jump To:

The WS02 Guest Lectures are open to the public, and a video recording of the lecture will be archived in the CLSP library. Talks are held every Wednesday and on selected other dates throughout the course of the workshop. All lectures will be given on the Homewood Campus of Johns Hopkins University.

Click here for directions to JHU.



Guest Lecture Information
Facing the Curse of Dimensionality in Statistical Language Modeling using Distributed Representations: Yoshua Bengio - 07/31/2002


slides from Yoshua Bengio's lecture (.pdf format)
  • Location: Shaffer Hall, Room 100

  • Time: 10:30 am - 12:00 noon

  • Abstract:

    From the point of view of statistical machine learning, a central challenge in the problem of statistical language modeling is the combinatorial explosion of combinations of words in sentences, which is related to the curse of dimensionality in non-parametric modeling of high-dimensional data. In terms of probabilistic modeling, this translates into the question of distributing probability mass from the observed training sentences in such a way as to assign as high a probability as possible to sentences not yet seen. We first try to identify the main ways to thus generalize that have been used in statistical language models. We then describe an approach based on a distributed representation of words (to avoid the curse of dimensionality), using an artificial neural network, which learns a smooth notion of similarity between words in order to smooth the empirical distribution. This approach is similar to maximum entropy models, but with the features also learned (by penalized maximum likelihood), and has yielded very good results, but it is very computationally expensive. To address that issue we introduce sampling methods similar to Hinton's contrastive divergence, in order to approximate the gradient of the distribution's partition function. Finally, we describe current research to take advantage of WordNet in order to begin incorporating the notions of polysemy, ontology, and grammar into this approach.

     

  • Supplmental Materials:



The Center for Language and Speech Processing
The Johns Hopkins University
3400 North Charles Street, Barton Hall
Baltimore, MD 21218
*Telephone: (410) 516-4237 *Fax: (410) 516-5050 *E-mail: clsp@clsp.jhu.edu