See https://arcprize.org/arc-agi-pub.
This allows for arbitrary API usage unlike the ARC-AGI prize.
I will resolve based on when the date when the method is first reported even if it is not submitted until substantially later (due to e.g. no public API).
I will apply a restriction roughly in line with the $10,000 runtime cost contraint, though it is unclear how this should resolve with respect to private models with no known API price.
I will exclude solutions that seem heavily overfit based on the semi-private evaluation set.
(Cut off data is resolved based on midnight PT.)
@RyanGreenblatt I think this resolves NO? o3 did pass 85%, but it was well above the cost constraints.
wow, 83% on public with high efficiency. So close (and I feel the mana punishment for trading too fast before reading the fine print)
If 85% is reached on the main ARC-AGI prize track and there isn't a clear reason to think this wouldn't transfer to ARC-AGI-Pub, I would also resolve this market to true even if no one submits to ARC-AGI-Pub.