M
METR
Benchmarks & Evals
Time Horizon Benchmark
METR Time Horizon goes vertical: Opus 4.6 hits ~14.5-hour tasks
METR's updated Time Horizon benchmark shows Claude Opus 4.6 completing tasks equivalent to roughly 14.5 hours of expert human work, with the autonomy doubling time now cited at 49 days. The panel treated this as the week's strongest evidence that agent capability growth has entered a visibly faster phase.
14.5h METR Time Horizon49 days Autonomy Doubling Time