Topic 1: On the Failure of Latent State Persistence in Large Language Models – Jay Huang – Friday, April 3, 2026

When:
April 3, 2026 @ 12:00 pm – 1:15 pm
2026-04-03T12:00:00-04:00
2026-04-03T13:15:00-04:00
Where:
Hackerman Hall B17
Cost:
Free

Abstract

While Large Language Models (LLMs) excel in reasoning, whether they can sustain persistent latent states remains under-explored. The capacity to maintain and manipulate unexpressed, internal representations-analogous to human working memory-is a cornerstone of complex reasoning. In this paper, we formalize and quantify the “Latent State Persistence” (LSP) gap through three novel experiments. First, we utilize a Number Guessing Game, demonstrating that across independent queries, LLMs fail to allocate probability mass to a singular hidden choice, violating a fundamental probabilistic principle. Second, we employ a Yes-No Game to show that as the number of questions increases, LLMs suffer from “concept drift,” leading to inevitable self-contradictions due to the lack of LSP. Finally, inspired by Mathematical Mentalism, we task models with tracking transformations on hidden variables, revealing a failure in variable binding and state evolution when the initial state is not explicitly present in the context. Collectively, these findings suggest that LLMs function as reactive post-hoc solvers rather than proactive planners with LSP. Our work provides a framework for evaluating the fidelity of internal representations and highlights a fundamental architectural divergence between autoregressive transformers and human-like cognition.

Bio

Jen-Tse (Jay) Huang is a postdoctoral researcher at the Center for Language and Speech Processing (CLSP) at Johns Hopkins University, working with Mark Dredze. He received his Ph.D. in Computer Science and Engineering from the Chinese University of Hong Kong and his B.Sc. from Peking University. His research explores the alignment between human and AI, leveraging psychological, cognitive, and behavioral sciences. His work has been published in top-tier AI venues, with one oral presentation at ICLR 2024. He actively serves as an area chair for conferences including NeurIPS and ACL Rolling Review.

Also Available by Zoomhttps://wse.zoom.us/j/96735183473

Center for Language and Speech Processing