Events

Neha Verma (JHU) – “Merging Feed-Forward Sublayers for Compressed Transformers”

October 25, 2024

When: October 28, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract With the rise and ubiquity of larger deep learning models, the need for high-quality compression techniques has been growing in order to deploy these models widely. The sheer parameter count of some models makes[…]

Leo Du (JHU) “Discrete Gradient-based Sampling with applications to Language Models”

October 21, 2024

When: October 21, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract Gradient-based sampling algorithms are a cornerstone of modern Bayesian computation, widely used in applications ranging from probabilistic programming to diffusion models. While these methods perform exceptionally well in continuous domains, extending them to discrete[…]

Jack Zhang (JHU) “Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements”

October 9, 2024

When: October 14, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract The current paradigm for safety alignment of large language models (LLMs) follows a one-size-fits-all approach: the model refuses to interact with any content deemed unsafe by the model provider. This approach lacks flexibility in[…]

Roger Grosse (University of Toronto) “Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo”

October 4, 2024

When: October 11, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract Numerous capability and safety techniques of Large Language Models (LLMs), including RLHF, automated red-teaming, prompt engineering, and infilling, can be cast as sampling from an unnormalized target distribution defined by a given reward or[…]

Sebastian Nehrdich (Berkeley) “MITRA: Beyond Just Machine Translation for Premodern Asian Low Resource Languages”

September 30, 2024

When: October 25, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract Recent years saw the rise of multilingual language models that achieve high levels of performance for a large number of tasks, with some of them handling hundreds of languages at once. Premodern languages are[…]

Nate Robinson (JHU) “NLP for Related Languages”

September 30, 2024

When: October 7, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract In the age of data- and capital-driven machine learning, the gap between technological advancements for high- and low-resource language varieties keeps growing, leaving many with the greatest need for language technologies without access to[…]

Helin Wang (JHU) “Solo Audio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer”

September 23, 2024

When: September 27, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract SoloAudio is an innovative diffusion-based generative model designed for target sound extraction (TSE). It trains latent diffusion models on audio, replacing the previous U-Net backbone with a skip-connected Transformer that operates on latent features.[…]

Neha Verma (JHU) – “Merging Feed-Forward Sublayers for Compressed Transformers”

Leo Du (JHU) “Discrete Gradient-based Sampling with applications to Language Models”

Jack Zhang (JHU) “Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements”

Roger Grosse (University of Toronto) “Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo”

Sebastian Nehrdich (Berkeley) “MITRA: Beyond Just Machine Translation for Premodern Asian Low Resource Languages”

Nate Robinson (JHU) “NLP for Related Languages”

Helin Wang (JHU) “Solo Audio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer”

Subscribe

Upcoming Seminars

Center for Language and Speech Processing