Ferhan Ture (Comcast Applied AI Research Group) “A Family of Neural Models for Voice Query Understanding on an Entertainment Platform”

When:
September 13, 2019 @ 12:00 pm – 1:15 pm
2019-09-13T12:00:00-04:00
2019-09-13T13:15:00-04:00
Where:
Hackerman Hall B17
3400 N. Charles Street
Baltimore
MD
Cost:
Free

Abstract

We tackle the challenge of understanding voice queries posed against the Comcast Xfinity X1 entertainment platform, where consumers direct speech input at their “voice remotes”. Such queries range from specific program navigation (i.e., watch a movie) to requests with vague intents and even queries that have nothing to do with watching TV. We present successively richer neural network architectures to tackle this challenge based on two key insights: The first is that session context can be exploited to disambiguate queries and recover from ASR errors, which we operationalize with hierarchical recurrent neural networks. The second insight is that query understanding requires evidence integration across multiple related tasks, which we identify as program prediction, intent classification, and query tagging. We present a novel multi-task neural architecture that jointly learns to accomplish all three tasks. Our initial model, already deployed in production, serves millions of queries daily with an improved user experience.

Biography

Ferhan Ture leads the Natural Language Processing team at the Comcast Applied AI Research group, where he blends latest advancements in the field into a suite of voice-activated Xfinity products used by millions every day. Prior to that, he was a scientist at BBN Technologies, developing novel algorithms for multilingual question answering as part of the DARPA BOLT program. From 2008 to 2013, he worked on search and language translation problems under the supervision of Jimmy Lin, as part of his doctorate studies at the Department of Computer Science at University of Maryland.

 

Johns Hopkins University

Johns Hopkins University, Whiting School of Engineering

Center for Language and Speech Processing
Hackerman 226
3400 North Charles Street, Baltimore, MD 21218-2680

Center for Language and Speech Processing