| Screenshot as of February 20, 2025, using the Gradio library | |
Type of site | Artificial intelligence |
|---|---|
| Country of origin | United States |
| Owner | LMSYS Org |
| Founders |
|
| URL | lmarena |
| Registration | Optional |
| Launched | May 3, 2023 |
LMArena (formerly Chatbot Arena) is a public, web-based platform that evaluates large language models (LLMs) through anonymous, crowd-sourced pairwise comparisons. Users enter prompts for two anonymous models to respond to and vote on the model that gave the better response, after which the models' identities are revealed. Users can also choose models to test themselves. [1] [2]
LMArena is popular within the artificial intelligence industry, with major companies supplying their large language models, such as OpenAI's GPT-4o and o1, Google DeepMind's Gemini, [3] and Anthropic's Claude, [4] and using their subsequent rankings to promote them.
The website has even been used for preview releases of upcoming models. Notably, Chinese company DeepSeek tested its prototype models in the LMArena months before its R1 model gained attention in Western media. [5] Other notable pre-release models include OpenAI's GPT-5 under the codename "summit" and Google DeepMind's Gemini 2.5 Flash Image (an image-generation and editing model) under the codename "Nano Banana". [6] [7]
LMArena’s evaluation methodology for large language models has been examined in academic analyses, which have identified specific limitations and suggested areas for improvement. The platform is an active contributor of the AI research ecosystem and has since implemented methodological updates in coordination with ongoing research through its policy updates. [8] [9]
In January 2026, LMArena announced the closing of a $150 million Series A funding round, bringing the company’s post-money valuation to approximately $1.7 billion. The round was led by Felicis and UC Investments (University of California), with participation from Andreessen Horowitz, The House Fund, LDVP, Kleiner Perkins, Lightspeed Venture Partners, and Laude Ventures. LMArena stated that the funding would be used to scale its AI evaluation platform, expand technical and research teams, and support product development following rapid community growth and adoption. [10]