The full dataset is here - | alt Hacker News

alt Hacker News

codelion • 01/21/2025 • 0 replies • view on HN

The full dataset is here - https://huggingface.co/datasets/AI-MO/aimo-validation-aime you can use the eval script I have in optillm to benchmark on it - https://github.com/codelion/optillm/blob/main/scripts/eval_a...