Seminars

Sun Mon Tue Wed Thu Fri Sat
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety – Jack Zhang (JHU) 12:00 pm
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety – Jack Zhang (JHU) @ Hackerman Hall B17
Oct 27 @ 12:00 pm – 1:30 pm
Abstract Harnessing the power of LLMs requires a delicate dance between being helpful and harmless. This creates a fundamental tension between two competing challenges: vulnerability to adversarial attacks that elicit unsafe content, and a tendency[...]
28
29
30
31

Center for Language and Speech Processing