Transformers can easily be trained / designed to handle grids, it's just that off the shelf standard LLMs haven't been particularly, (although they would have seen some)
Are there some well-known examples of success in it?
Are there some well-known examples of success in it?