Tried exactly the same model. And unfortunately the reasoning is just useless. Built it is still not able to tell how many r's in strawberry.
That's a tokenizer issue though?
That's a tokenizer issue though?