[OC] LLMs ranked on frontend development and UI generation from 30K+ people

Data

[OC] LLMs ranked on frontend development and UI generation from 30K+ people

July 20, 2025

View 3 Comments

3 Comments

adviceguru25 on July 20, 2025 8:04 pm

[Design Arena](https://www.designarena.ai/) is a crowdsource benchmark where users provide large language models a prompt and then compare generations (e.g. websites, games, images, etc.) from several models at random. So far, the voting platform has amassed 30K+ unique users.

The leaderboard above is determined by win rate (% of comparisons in which a user picked a generation from model X over the other generation). Elo rating is an approximate formula based off win-rate to adjust for number of battles participated in.

We’re always trying to improve the benchmark, so let us know if you have feedback!
UchuYagi on July 20, 2025 8:41 pm

Probably anecdotal, but from my experience in the last ~1yr of heavy usage:

New Code and Refactoring:
1. Claude Sonnet 4
2. Gemini 2.5 Pro
3. o4

Debugging:
1. o4
2. Gemini 2.5 Pro
3. Claude Sonnet 4

This is on massive corporate React and Vue codebases with a few additional libraries.
nut-sack on July 21, 2025 2:38 am

im amazed at how many people are using deepseek even tho it has been shown to communicate back with .cn hosts.