logoalt Hacker News

MVissers10/11/20241 replyview on HN

Which model? The field moves so fast it’s hard to validate statements like this without that info.

O1-preview?


Replies

hintymad10/11/2024

GPT-4o. I tried only a few samples on o1-preview, and the results were bad. That did not have any statistical significance, though

show 1 reply