ARC-AGI-3 public set result
Agentica claims to solve all public ARC-AGI-3 tasks
Agentica published a claim of solving all public ARC-AGI-3 tasks, adding to the week's theme of benchmark saturation. The panel discussed it alongside METR and ARC-AGI-2 results as part of weighing signal versus noise in headline benchmark leaps.