Pinned
Delighted to share our #neurips2023 paper w @grockious @hmd_palangi et al
Evaluating Cognitive Maps & Planning in LLMs with CogEval
We test planning in 8 LLMs.
Failures like hallucinating invalid paths/falling in loops don't support emergent planning.
1/n
arxiv.org/abs/2309.15129



























