isn't 5 tok/s like 100wpm? Pretty standard typing speed.
You also would need to compare token generation not with the actual output, but with the thoughts and deleted and edited parts.