Benchmark comparing conventional UUID and AID across models, hallucination rate, token usage, would be cool!