Nathan Schneider (Georgetown University) “The Ins and Outs of Preposition Semantics: Challenges of Coverage and Cross-Linguistic Adequacy”

February 21, 2017 @ 12:00 pm – 1:15 pm
Hackerman Hall B17
3400 N Charles St
Baltimore, MD 21218
Center for Language and Speech Processing


In most accounts of semantics, prepositions fly under the radar. I will argue that they should instead be put front and center given their crucial status as linkers of meaning—whether for spatial and temporal relations, for predicate-driven roles, or in special constructions. To that end, we have sought to characterize semantic functions expressed by prepositions in English, and similar markers in other languages. One central challenge is coverage: in order to comprehensively annotate prepositions in a corpus, we have found it necessary to develop and document a rich hierarchy of semantic functions (Srikumar & Roth 2013; Schneider et al. 2015, 2016). Another central challenge is cross-linguistic adequacy: we see that certain functions are expressed with adpositions in some languages but not others, and languages differ in how functions are grouped under each adposition (making them especially difficult for second language learners and machine translation systems). We have released an English corpus comprehensively annotated with preposition functions (, and are currently annotating parallel text in Korean, Hindi, and Hebrew to investigate cross-linguistic similarities and differences with respect to adposition/case semantics.

This is joint work with Vivek Srikumar, Jena Hwang, Archna Bhatia, Na-Rae Han, Meredith Green, Abhijit Suresh, Kathryn Conger, Tim O’Gorman, and Martha Palmer.


Nathan Schneider is an annotation schemer and computational modeler for natural language. As Assistant Professor of Linguistics and Computer Science at Georgetown University, he looks for synergies between practical language technologies and the scientific study of language. He specializes in broad-coverage semantic analysis: designing linguistic meaning representations, annotating them in corpora, and automating them with statistical natural language processing techniques. A central focus in this research is the nexus between grammar and lexicon as manifested in multiword expressions and adpositions/case markers. He has inhabited UC Berkeley (BA in Computer Science and Linguistics), Carnegie Mellon University (Ph.D. in Language Technologies), and the University of Edinburgh (postdoc). Now a Hoya, he continues to play with data and algorithms for linguistic meaning.

Center for Language and Speech Processing