Making LLMs Reason Better, Faster, and Longer – Monday, April 27, 2026 – Mirella Lapata (U of Edinburgh)
Abstract Recent “reasoning models” improve LLM performance by generating chains of thought before producing an answer, with the reasoning process itself optimized through reinforcement learning. This paradigm has delivered impressive results on math and coding[…]