Events

Jack Zhang (JHU) “Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements”

October 9, 2024

When: October 14, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract The current paradigm for safety alignment of large language models (LLMs) follows a one-size-fits-all approach: the model refuses to interact with any content deemed unsafe by the model provider. This approach lacks flexibility in[…]

Roger Grosse (University of Toronto) “Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo”

October 4, 2024

When: October 11, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract Numerous capability and safety techniques of Large Language Models (LLMs), including RLHF, automated red-teaming, prompt engineering, and infilling, can be cast as sampling from an unnormalized target distribution defined by a given reward or[…]

Sebastian Nehrdich (Berkeley) “MITRA: Beyond Just Machine Translation for Premodern Asian Low Resource Languages”

September 30, 2024

When: October 25, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract Recent years saw the rise of multilingual language models that achieve high levels of performance for a large number of tasks, with some of them handling hundreds of languages at once. Premodern languages are[…]

Nate Robinson (JHU) “NLP for Related Languages”

September 30, 2024

When: October 7, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract In the age of data- and capital-driven machine learning, the gap between technological advancements for high- and low-resource language varieties keeps growing, leaving many with the greatest need for language technologies without access to[…]

Helin Wang (JHU) “Solo Audio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer”

September 23, 2024

When: September 27, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract SoloAudio is an innovative diffusion-based generative model designed for target sound extraction (TSE). It trains latent diffusion models on audio, replacing the previous U-Net backbone with a skip-connected Transformer that operates on latent features.[…]

Orion Weller (JHU) – “Using Language Models for Instruction-Based Information Retrieval”

September 18, 2024

When: September 23, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract Just as instructions have enabled complex user tasks and zero-shot optimization via prompts, so can instructions in retrieval enable complex information needs with zero-shot adaptation. In this talk, we describe efforts to build a[…]

Tatsu Hashimoto (Stanford University) “Improving LLM generalization by selecting and synthesizing data”

September 13, 2024

When: October 4, 2024 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST, Baltimore, MD 21218

Abstract Language model pretraining has been a remarkably strong recipe for cross-task and cross-domain generalization in NLP. However, these gains have come at the expense of control: we rarely control the training data for language[…]

Jack Zhang (JHU) “Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements”

Roger Grosse (University of Toronto) “Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo”

Sebastian Nehrdich (Berkeley) “MITRA: Beyond Just Machine Translation for Premodern Asian Low Resource Languages”

Nate Robinson (JHU) “NLP for Related Languages”

Helin Wang (JHU) “Solo Audio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer”

Orion Weller (JHU) – “Using Language Models for Instruction-Based Information Retrieval”

Tatsu Hashimoto (Stanford University) “Improving LLM generalization by selecting and synthesizing data”

Subscribe

Upcoming Seminars

Center for Language and Speech Processing