losclouds

AI Models / Compare

GPT-4o mini

Low-cost OpenAI family for high-volume chat, classification, and lightweight tool tasks.

Creator
OpenAI
Lifecycle
Active
Context
128.0K
Max output
16.4K
Released
Jul 18, 2024
Status
up
Input
$0.15 / 1M tokens
Output
$0.60 / 1M tokens
Cached read
$0.07 / 1M tokens
Cached write
$0.15 / 1M tokens
Batch discount
50%
Source
OpenAI API pricing
Verified
Apr 1, 2026 (High)

Capabilities

Modalities
textimagetext
Capabilities
webSearchimageInputbatchSupportpromptCachingfunctionCallingstructuredOutputs
Strengths
Lowest-cost OpenAI text family, Good throughput for agent fan-out, Useful multimodal input support
Tradeoffs
Less robust reasoning than GPT-5.4, Smaller output budget
Official Links

Benchmark Coverage

BenchmarkVersionScoreDateSourceNotes
MMLU2025-rolling82.4 accuracyDec 12, 2025OpenAI model guideUse for directional comparison only.

Release History

ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
GPT-4o minigpt-4o-miniActiveJul 18, 2024Cost-efficient family released for high-volume production traffic.

Host Coverage

HostTypeContextPricing NoteDifferences
OpenAI APIfirst-party128.0KReference pricing and full tool support.Prompt caching; Batch API
Azure OpenAIcloud128.0KEnterprise hosting with separate regional rollouts.Azure quota management
Migration Guidance

Default cost-down migration target for GPT-4o or GPT-5.4 traffic that does not need top-end reasoning.

Replacement models: gpt-5-4

Breaking changes: Reasoning depth and output budget are reduced versus flagship families.

Change Events
DateTypeTitleDescriptionSource
Jul 18, 2024family_addedFamily launchedGPT-4o mini launched as OpenAI's low-cost production family.OpenAI changelog

Other models from OpenAI

GPT-4.1, GPT-4.1 mini, GPT-4.1 nano, GPT-4o, GPT-5.4, GPT-5.4 mini, GPT-5.4 nano