Researcher at Nous Research and co-creator of Husky Hold'em Bench, a novel agent benchmark that evaluates LLMs by having them design and implement competitive poker bots in a round-robin tournament. His work combines strategic reasoning evaluation with software engineering skill assessment.
Bhavesh Kumar appeared as a guest on ThursdAI, the weekly AI news podcast hosted by Alex Volkov. Browse the full guest directory or subscribe on Substack to never miss an episode.
ThursdAI — The weekly AI podcast. Every Thursday, live.
Subscribe Free →