I think for now it's better to convert tokens into code/library code and then work with that for deterministic results rather than relying on Claude being correct or not.