Upcoming Seminars
Sep
19
Fri
12:00 pm
Supervising Models that are Smar...
@ Hackerman Hall B17
Supervising Models that are Smar...
@ Hackerman Hall B17
Sep 19 @ 12:00 pm β 1:15 pm
Abstract: Advanced AI systems are being deployed for more and more complex tasks. To ensure reliable human oversight over AIs, we need supervision protocols that remain effective despite the increase in task complexity and model[...]
Sep
26
Fri
12:00 pm
Auditing Memorization, Dissectin...
@ Hackerman Hall B17
Auditing Memorization, Dissectin...
@ Hackerman Hall B17
Sep 26 @ 12:00 pm β 1:15 pm
Abstract: The widespread adoption of large language models (LLMs) places a responsibility on the AI research community to rigorously study and understand them. In this talk, I will describe my groupβs research on analyzing LLMsβ[...]