| Screenshot as of January 28, 2026 | |
Type of site | Artificial intelligence |
|---|---|
| Country of origin | United States |
| Founders |
|
| URL | arena |
| Registration | Optional |
| Launched | April 24, 2023 |
Arena [1] (formerly LMArena and Chatbot Arena) is a public, web-based platform that evaluates large language models (LLMs) through anonymous, crowd-sourced pairwise comparisons. Users enter prompts for two anonymous models to respond to and vote on the model that gave the better response, after which the models' identities are revealed. Users can also choose models to test themselves. [2] [3]
Arena is popular within the artificial intelligence industry, with major companies supplying their large language models, such as OpenAI's GPT-4o and o1, Google DeepMind's Gemini, [4] and Anthropic's Claude, [5] and using their subsequent rankings to promote them.
The website has been used for preview releases of upcoming models. Notably, Chinese company DeepSeek tested its prototype models in the Arena months before its R1 model gained attention in Western media. [6] Other notable pre-release models include OpenAI's GPT-5 under the codename "summit" and Google DeepMind's Gemini 2.5 Flash Image (an image-generation and editing model) under the codename "Nano Banana". [7] [8]
Arena's evaluation methodology for large language models has been examined in academic analyses, which have identified specific limitations and suggested areas for improvement. The platform is an active contributor of the AI research ecosystem and has since implemented methodological updates in coordination with ongoing research through its policy updates. [9] [10]
Chatbot Arena was released on April 24, 2023. During the first week, Vicuna (vicuna-13b), an LLM fine-tuned from LLaMA by LMSYS was ranked at #1, with an ELO of 1169, followed by Koala (koala-13b), a dialogue model by BAIR at #2 with an ELO of 1082, and Oasst Pythia (oasst-pythia-12b), an LLM by LAION at #3 with an ELO of 1065. [11] In the second week, GPT-4, Claude-v1, and GPT-3.5 were added to the arena alongside RWKV-4-Raven-14B. [12] The website had collected over 130,000 votes by December 2023. [13]
In July of that year, Chatbot Arena publicly released two datasets, one consisting of 33 thousand crowd-sourced conversations and the other consisting of three thousand expert conversations. [14]
In June 2024, Chatbot Arena introduced image support. [15]
In September 2024, Chatbot Arena moved to its own dedicated domain name, lmarena.ai (or LMArena), [16] thus separating from LMSys. [17]
In April 2025, LMArena introduced the Search Arena, which benchmarks search-enabled LLMs. [18] That same month, LMArena incorporated as an independent company, [19] and launched a new UI beta at beta.lmarena.ai, [20] which collected over 40,000 votes. [21] That May, LMArena raised $100 million in a seed funding round, valuing the company at $600 million. [22] Participants in the seed funding round included Andreessen Horowitz, UC Investments, Lightspeed Venture Partners, Felicis Ventures, and Kleiner Perkins. [22] In June, they moved the new UI out of beta. [21]
On January 6, 2026, LMArena announced the closing of a $150 million Series A funding round, bringing the company’s post-money valuation to approximately $1.7 billion. The round was led by Felicis and UC Investments (University of California), with participation from Andreessen Horowitz, The House Fund, LDVP, Kleiner Perkins, Lightspeed Venture Partners, and Laude Ventures. LMArena stated that the funding would be used to scale its AI evaluation platform, expand technical and research teams, and support product development following rapid community growth and adoption. [23]
On January 21, 2026, LMArena introduced Video Arena. [24] Registered users are rate limited to 3 generations per 24 hours, and videos are typically 5-8 seconds long. [25] Registration is required to generate videos. [24] [25]
On January 28, 2026, LMArena rebranded to "Arena". [1]
In February 2026, Arena released a model router called Max, which was trained on over 5 million votes. Max was previously launched in Battle mode under the codenames theta-hat and arcstride. [26]