Benchmarks & EvalsOpen weights
ZeroBench
ZeroBench: the 'impossible' benchmark where all top VLMs score zero
A new benchmark called ZeroBench launched, claiming to be the impossible benchmark for vision-language models: all current top-of-the-line VLMs score zero on it. Tasks include visually demanding puzzles like reading a question written in the shape of a star hidden among scattered letters, highlighting how far VLMs still are from true visual understanding.