| Sun | Mon | Tue | Wed | Thu | Fri | Sat |
|---|---|---|---|---|---|---|
|
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety – Jack Zhang (JHU)
12:00 pm
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety – Jack Zhang (JHU)
@ Hackerman Hall B17
Oct 27 @ 12:00 pm β 1:30 pm
Abstract Harnessing the power of LLMs requires a delicate dance between being helpful and harmless. This creates a fundamental tension between two competing challenges: vulnerability to adversarial attacks that elicit unsafe content, and a tendency[...]
|