The non-hallucination rate in AA-omniscience is SOTA, better than Opus 4.7, Gemini 3.1 Pro and GPT5.5! Congrats to the team
wonder at which level there's a capability state transition? 5%? 1%?
Truly incredible! Very impressed by their progress. I wonder how much of their own chips did they use for training.
referencing this:
https://artificialanalysis.ai/evaluations/omniscience?models...
(had to add it to the chart, wasn't displayed by default. is it the lowest rate in the datasetor no?)