llmreference
checked today

1,713 models · 132 providers · 231 labs

Pick the right model in 30 seconds.

Benchmark-backed, price-aware, refreshed daily.

I'm building fortaskinuse caseatbudgetand needconstraint

Shortlist · coding / ≤ $5 / 200K + tools

Three picks, ranked by quality per dollar

DEFAULT PICK5d

DeepSeek · 1M context · released Apr 24

$/1M out

$0.280

$0.140 in

Q/$

B#34 of 98

quality per $

SWE-bench Pro

52.6%

SWE-bench Pro · 98 priced candidates

Best Q/$ at this budget. SWE-bench Pro 52.6 on DeepSeek Platform drives the rank.

ALT PICK30d

DeepSeek · 160K context · released Jan 1

$/1M out

$0.378

$0.252 in

Q/$

B#35 of 98

quality per $

SWE-bench Verified

70%

SWE-bench Verified · 98 priced candidates

Alternate pick — competitive SWE-bench Verified at 70.0 with OpenRouter pricing.

BUDGET PICK30d

DeepSeek · 164K context · released Apr 10

$/1M out

$0.410

$0.270 in

Q/$

B#39 of 98

quality per $

SWE-bench Verified

68.4%

SWE-bench Verified · 98 priced candidates

Budget pick — same task fit, cheapest output on OpenRouter.

Pulse · this week

What changed in the model market

Open Pulse →
99new models

Composer 2.5 · Sora 2 · GPT-5.3

78price cuts

Verified provider price reductions

13benchmark refreshes

646 scores tracked across major suites

cheapest output · $/1M

$0CoBuddy

Cheat sheet

Most-asked comparisons

Browse everything — 1,713 models · 132 providers · 231 labs