logoalt Hacker News

denysvitalilast Saturday at 10:54 AM2 repliesview on HN

According to https://github.com/IQuestLab/IQuest-Coder-V1/issues/14#issue... the result is still good after fixing the cheating problem. 76.2% (from 81.4%) which still beats Opus 4.5 (74.4%)!!


Replies

ipythonlast Saturday at 1:14 PM

Unfortunately they seem to have neglected to update their front page readme with this information, continuing to mislead people: https://github.com/IQuestLab/IQuest-Coder-V1

show 1 reply
alexpop80last Monday at 12:08 PM

What do you mean? Opus 4.5 and GPT 5.2 broke the 80% mark and no other models yet seem to be passing this important milestone.