Auditing Memorization, Dissecting Mechanisms, and Evaluating Behavior of Large Language Models – Robin Jia (USC)
Abstract: The widespread adoption of large language models (LLMs) places a responsibility on the AI research community to rigorously study and understand them. In this talk, I will describe my group’s research on analyzing LLMs’[…]