Events

Amir Hussein “Towards End-to-End Conversational Speech Translation”

February 20, 2024
When: February 5, 2024 @ 12:00 pm – 1:15 pm
Where: Hackerman Hall B17, 3400 N. Charles Street, Baltimore, MD 21218

Abstract Over the past three decades, the fields of automatic speech recognition (ASR) and machine translation (MT) have witnessed remarkable advancements, leading to exciting research directions such as speech-to-text translation (ST). This talk will delve[…]

Read More

Lara J. Martin (University of Maryland) “Neurosymbolic AI or: How I Learned to Stop Worrying and Love the Large Language Model”

February 12, 2024
When: February 16, 2024 @ 12:00 pm – 1:15 pm
Where: Hackerman Hall B17, 3400 N. Charles Street, Baltimore, MD 21218

Abstract Large language models like ChatGPT have shown extraordinary abilities for writing. While impressive at first glance, large language models aren’t perfect and often make mistakes humans would not make. The main architecture behind ChatGPT[…]

Read More

Abeer Alwan (UCLA) “Dealing with Limited Speech Data and Variability: Three case studies”

January 11, 2024
When: February 2, 2024 @ 12:00 pm – 1:15 pm
Where: Hackerman Hall B17, 3400 N. Charles Street, Baltimore, MD 21218

Abstract Our research focuses on improving speech processing algorithms, such as automatic speech recognition (ASR), speaker identification, and depression detection, under challenging conditions such as limited data (for example, children’s or clinical speech), mismatched conditions[…]

Read More

Michael I Mandel (Meta) “Speech and Audio Processing in Non-Invasive Brain-Computer Interfaces at Meta”

January 11, 2024
When: January 29, 2024 @ 12:00 pm – 1:15 pm
Where: Hackerman Hall B17, 3400 N. Charles Street, Baltimore, MD 21218

Abstract Non-invasive neural interfaces have the potential to transform human-computer interaction by providing users with low friction, information rich, always available inputs. Reality Labs at Meta is developing such an interface for the control of[…]

Read More

Karen Livescu (Toyota Technological Institute at Chicago) “What Do Pre-Trained Speech Representation Models Know? Layer-Wise Analysis and Benchmarking”

November 6, 2023
When: December 1, 2023 @ 12:00 pm – 1:15 pm
Where: Hackerman Hall B17, 3400 N. Charles Street, Baltimore, MD 21218

Abstract Pre-trained speech representation models have become ubiquitous in speech processing over the past few years.  They have both improved the state of the art and made it feasible to learn task-specific models with very[…]

Read More

Carlos Busso (University of Texas at Dallas) “Multimodal Machine Learning for Human-Centric Tasks”

November 6, 2023
When: November 17, 2023 @ 12:00 pm – 1:15 pm
Where: Hackerman Hall B17, 3400 N. Charles Street, Baltimore, MD 21218

Abstract The almost unlimited multimedia content available on video-sharing websites has opened new challenges and opportunities for building robust multimodal solutions. This seminar will describe our novel multimodal architectures that (1) are robust to missing[…]

Read More

Center for Language and Speech Processing