LLM Reference
checked today

1,714 models · 132 providers · 231 labs

Pick the right model in 30 seconds.

Benchmark-backed, price-aware, refreshed daily.

I'm building fortaskinuse caseatbudgetand needconstraint

Shortlist · coding / ≤ $5 / 200K + tools

Three picks, ranked by quality per dollar

DEFAULT PICK1d

DeepSeek · 1M context · released Apr 24

$/1M out

$0.224

$0.112 in

Q/$

B#30 of 104

quality per $

SWE-bench Pro

52.6%

SWE-bench Pro · 104 priced candidates

Best Q/$ at this budget. SWE-bench Pro 52.6 on OpenRouter drives the rank.

ALT PICKtoday

DeepSeek · released May 28

$/1M out

$0.300

$0.100 in

Q/$

B#33 of 104

quality per $

SWE-bench Verified

57.6%

SWE-bench Verified · 104 priced candidates

Alternate pick — competitive SWE-bench Verified at 57.6 with Novita AI pricing.

BUDGET PICK1d

DeepSeek · 160K context · released Dec 1

$/1M out

$0.378

$0.252 in

Q/$

B#34 of 104

quality per $

SWE-bench Verified

70%

SWE-bench Verified · 104 priced candidates

Budget pick — same task fit, cheapest output on OpenRouter.

Pulse · this week

What changed in the model market

Open Pulse →
1,076new models

Qwen3.7-Max · Gemini 3.5 Flash · Composer 2.5

96price cuts

Verified provider price reductions

8benchmark refreshes

713 scores tracked across major suites

top-lab output · $/1M

$0.260Hunyuan HY3 Preview

Cheat sheet

Most-asked comparisons

Browse everything — 1,714 models · 132 providers · 231 labs