u/Effective_Rip2500

Have any of you attempted to get an LLM to play Sokoban? I’m curious—given the current state of LLMs, can they actually complete a level without using external tools?

I provided a model with the level below. It seems like a dead-simple logic puzzle, but for some reason, the model consistently fails to grasp it.

#######
#P    #
# # # #
# B B #
#.  . #
#     #
#######

Legend:

  • "P": The player-controlled character.
  • "B": The boxes.
  • ". ": The target goals for the boxes.

Why do you think such a powerful reasoning engine struggles with such basic spatial logic? Is it a "tokenization" issue or a lack of an internal world model?

reddit.com
u/Effective_Rip2500 — 7 days ago