Google has reclaimed the top spot in a valued AI benchmark table, knocking OpenAI into second place.
On the respected Chatbot Arena leaderboard, the Alphabet-owned company has assumed the lead with the introduction of its Gemini-Exp-1206 experimental model. Previously, Sam Altman’s company was in pole position with its ChatGPT-4o offering, just shading Gemini-Exp-114 which was released on November 15.
Those competing LLMs were effectively matched, with Google appearing to close the gap on its nearest competitor.
Chatbot Arena reported the new Google Gemini version showed significant improvement across important categories including mathematics, creative writing, and visuals, with a 40-point improvement on previous offerings. Despite this, Tech Crunch has outlined how the current AI benchmarking approach could vastly oversimplify model evaluation.
What a way to celebrate one year of incredible Gemini progress — #1