LLM Reference

GPT-4o Models by OpenAI

OpenAIProprietaryProprietaryHighlight
11 models2024–2025Up to 128k ctxFrom $0.15/1M input

Details

ResearcherOpenAI
LicenseProprietary
Commercial useCommercial use with conditions
Models11
Released2024–2025
Max context128k

Capabilities

Vision6 of 11 models
Multimodal3 of 11 models
Function Calling1 of 11 models
Tool Use1 of 11 models
Structured Outputs8 of 11 models
Code Execution6 of 11 models

About

GPT-4o is OpenAI's most advanced model to date. This multimodal model handles both text and image inputs while generating text outputs. Matching the intelligence of GPT-4 Turbo, it is remarkably more efficient, delivering text at twice the speed and at half the cost. Additionally, GPT-4o exhibits the highest vision performance and excels in non-English languages compared to previous OpenAI models.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

8 in view3 retired

Use when the workload needs 128k context and structured outputs.

2025-02128k contextstructured outputs

Use when the workload needs 128k context and structured outputs.

2025-02128k contextstructured outputs

Use when the workload needs 128k context, code execution, and multimodal inputs.

2024-11128k contextcode executionmultimodal inputs

Use when the workload needs 128k context and structured outputs.

2024-11128k contextstructured outputs

Use when the workload needs 128k context.

2024-10128k context

Use when the workload needs 128k context, structured outputs, and prompt caching.

2024-07128k contextstructured outputsprompt caching
ChatGPT-4oCurrent

Use when the workload needs 128k context, code execution, and multimodal inputs.

2024-05128k contextcode executionmultimodal inputs
GPT-4oCurrent

Use when the workload needs 128k context, tool use, and function calling.

2024-05128k contexttool usefunction calling

Release Timeline

6 release groups
2025-02
2 current
GPT-4o Search Preview
128k contextstructured outputs
Current
GPT-4o-mini Search Preview
128k contextstructured outputs
Current
2024-11
2 current
GPT-4o (11-20)
128k contextcode executionmultimodal inputs
Current
GPT-4o (2024-11-20)
128k contextstructured outputs
Current
2024-10
1 current
GPT-4o Audio
128k context
Current
2024-08
1 retired
GPT-4o (08-06)
128k contextstructured outputscode execution
Replaced
2024-07
1 current · 1 retired
GPT-4o Mini (07-18)
128k contextstructured outputscode execution
Replaced
GPT-4o-mini
128k contextstructured outputsprompt caching
Current
2024-05
2 current · 1 retired
ChatGPT-4o
128k contextcode executionmultimodal inputs
Current
GPT-4o
128k contexttool usefunction calling
Current
GPT-4o (05-13)
128k contextstructured outputscode execution
Replaced

Replaced By

Keep for legacy integrations; evaluate GPT-4o before new work.

Keep for legacy integrations; evaluate GPT-4o-mini before new work.

Keep for legacy integrations; evaluate GPT-4o before new work.

Specifications(11 models)

GPT-4o model specifications comparison
ModelReleasedContextParametersVisionMultimodalFn CallingTool UseStructured OutputsCode Exec
GPT-4o-mini Search Preview2025-02128kNoNoNoNoYesNo
GPT-4o Search Preview2025-02128kNoNoNoNoYesNo
GPT-4o (11-20)2024-11128k1.76T (8x222B MoE)*YesNoNoNoNoYes
GPT-4o (2024-11-20)2024-11128kNoNoNoNoYesNo
GPT-4o Audio2024-10128kNoNoNoNoNoNo
GPT-4o-mini2024-07128kNoNoNoNoYesNo
ChatGPT-4o2024-05128kYesNoNoNoNoYes
GPT-4o2024-05128kYesYesYesYesYesYes

Pricing

GPT-4o model pricing by provider
ModelProviderInput / 1MOutput / 1MType
GPT-4o-miniOpenAI API$0.15$0.6Serverless
GPT-4o-miniOpenRouter$0.15$0.6Serverless
GPT-4o-mini Search PreviewOpenRouter$0.15$0.6Serverless
GPT-4o-miniVercel AI Gateway$0.15$0.6Serverless
GPT-4o-mini Search PreviewVercel AI Gateway$0.15$0.6Serverless
GPT-4oReplicate API$2.5$10Serverless
GPT-4o AudioOpenRouter$2.5$10Serverless
GPT-4o Search PreviewOpenRouter$2.5$10Serverless
GPT-4o (2024-11-20)OpenRouter$2.5$10Serverless
GPT-4oOpenRouter$2.5$10Serverless
GPT-4oOpenAI API$2.5$10Serverless
GPT-4o (2024-11-20)OpenAI API$2.5$10Serverless
GPT-4oVercel AI Gateway$2.5$10Serverless

Frequently Asked Questions

What is GPT-4o used for?
GPT-4o is used for vision and multimodal work, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does GPT-4o compare to GPT Realtime 2?
GPT-4o by OpenAI is strongest where you need vision and multimodal work, while GPT Realtime 2 by OpenAI is the closest related family to check for translation. GPT-4o has 11 listed variants and reaches up to 128k context, while GPT Realtime 2 reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
Which GPT-4o model should I use?
For the lowest listed input price, start with GPT-4o Mini (07-18) through OpenAI API at $0.15/1M input tokens. For the most capable/latest local choice, evaluate GPT-4o with 128k context and tool use, function calling, structured outputs, and multimodal inputs.