Dissecting the Ullman Variations with a SCALPEL: Why do LLMs fail at
  Trivial Alterations to the False Belief Task?

Dissecting the Ullman Variations with a SCALPEL: Why do LLMs fail at Trivial Alterations to the False Belief Task?

Papers citing "Dissecting the Ullman Variations with a SCALPEL: Why do LLMs fail at Trivial Alterations to the False Belief Task?"