It gets it wrong because the current "AI" for filling out forms is extremely weak and brittle compared to the general language models we have now.
Do you have an example form field that a general language model could fill out better than a human + highly focussed deterministic algorithm?
Language models seem pretty weak and brittle in my interactions with them too.