Events

Karen Livescu (Toyota Technological Institute at Chicago) “What Do Pre-Trained Speech Representation Models Know? Layer-Wise Analysis and Benchmarking”

November 6, 2023
When: December 1, 2023 @ 12:00 pm – 1:15 pm
Where: Hackerman Hall B17, 3400 N. Charles Street, Baltimore, MD 21218

Abstract Pre-trained speech representation models have become ubiquitous in speech processing over the past few years.  They have both improved the state of the art and made it feasible to learn task-specific models with very[…]

Read More

Carlos Busso (University of Texas at Dallas) “Multimodal Machine Learning for Human-Centric Tasks”

November 6, 2023
When: November 17, 2023 @ 12:00 pm – 1:15 pm
Where: Hackerman Hall B17, 3400 N. Charles Street, Baltimore, MD 21218

Abstract The almost unlimited multimedia content available on video-sharing websites has opened new challenges and opportunities for building robust multimodal solutions. This seminar will describe our novel multimodal architectures that (1) are robust to missing[…]

Read More

Florian Metze (CMU) “Masked Autoencoders that Listen”

November 6, 2023
When: November 10, 2023 @ 12:00 pm – 1:15 pm
Where: Hackerman Hall B17, 3400 N. Charles Street, Baltimore, MD 21218

Abstract In this talk, I will present a simple extension of image-based Masked Autoencoders (MAE) to self-supervised representation learning from audio spectrograms. Following the Transformer encoder-decoder design in MAE, our Audio-MAE first encodes audio spectrogram[…]

Read More

Student Seminar – Neha Verma “Exploring Geometric Representational Disparities Between Multilingual and Bilingual Translation Models”

November 6, 2023
When: November 6, 2023 @ 12:00 pm – 1:15 pm
Where: Hackerman Hall B17, 3400 N. Charles Street, Baltimore, MD 21218

Abstract Multilingual machine translation has proven immensely useful for both parameter efficiency and overall performance for many language pairs via complete parameter sharing. However, some language pairs in multilingual models can see worse performance than[…]

Read More

Center for Language and Speech Processing