Benchmarks & Evals
ARC-AGI-2 SOTA result
Confluence Labs exits stealth with 97.9% SOTA on ARC-AGI-2
Confluence Labs emerged from stealth with a 97.9% state-of-the-art result on the ARC-AGI-2 benchmark, publishing code on GitHub. The panel read it as a major signal that ARC-AGI-2 is near saturation, part of a broader pattern of benchmarks getting solved faster than expected.
97.9% ARC-AGI-2