Note that this uses a harness so it doesn't qualify for the official ARC-AGI-3 leaderboard Ac...

lairv • today at 1:32 AM • 5 replies • view on HN

Note that this uses a harness so it doesn't qualify for the official ARC-AGI-3 leaderboard

According to the authors the harness isn't ARC-AGI specific though https://x.com/agenticasdk/status/2037335806264971461

Replies

It is 100% ARC-AGI-3 specific though, just read through the prompts https://github.com/symbolica-ai/ARC-AGI-3-Agents/blob/symbol...

➕ show 3 replies

krackers • today at 3:14 AM

> this uses a harness

This seems like an arbitrary restriction. Tool-use requires a harness, and their whitepaper never defines exactly what counts as valid.

➕ show 2 replies

osti • today at 3:32 AM

Doesn't the chat version of chatgpt or gemini also have interleaved tool calls, so do those also count as with harnesses?

➕ show 1 reply

mmaunder • today at 4:34 AM

We're calling agents harnesses now?

➕ show 3 replies

falcor84 • today at 1:52 AM

I for one think that harness development is perhaps the most interesting part at the moment and would love to have an alternative leaderboard with harnesses.

➕ show 2 replies

alt Hacker News

Replies