Google DeepMind's latest version, Gemini Exp 1114, has achieved significant success on the Chatbot Arena, rising to the top of the overall leaderboard with over 6,000 community votes and performing excellently in multiple areas:
First, we need to understand what LLM Arena is. LLM Arena (or Chatbot Arena) is a platform for evaluating LLMs, primarily aimed at promoting community-driven LLM performance assessments. It is one of the most prestigious evaluation platforms.
From the overall leaderboard, Google's new model Gemini (Exp 1114) scored a remarkable increase of 40+, achieving a score of 1344, while the latest version of ChatGPT 4.0 scored 1340. This seems to be the first time a model from Google has achieved such results.
Gemini-Exp-1114 is tied for first place in the math arena, performing on par with o1:
Currently, Gemini-Exp-1114 can be experienced in conversation at Google AI Studio.
The Terminator is coming.