GPT-5

OpenAI

GPT-5 · OpenAI
Context128K
Max output16K
Licenseproprietary
Released2026-02-14
Modalitiestext, vision, audio
Pricinginput $5.00/1M · output $20.00/1M · effective 2026-02-14
Benchmarks
MMLU87.5%
GPQA Diamond68.0%
SWE-bench59.5%
LiveCodeBench52.1%
AIME 202541.9%
ARC-AGI 216.4%
Changelog
  1. 2026-03-01
    Vision + audio modalities added
    modalities [text][text, vision, audio]
  2. 2026-02-14
    GA launch
    model.released null2026-02-14

GPT-5

OpenAI's 2026 flagship multimodal model. Successor to GPT-4o. Shipped with a 128K context window, text + vision + audio input modalities, and a pricing structure that sits slightly above Claude Sonnet 4 on input but comparable on output.

Notable features

  • Multimodal from day one — text, vision, and audio in a single endpoint.
  • 16K output cap — double the 8K output ceiling of most contemporaries; useful for long-form structured generation.
  • ARC-AGI 2 score — 16.4%, the first major model to score meaningfully on the upgraded ARC-AGI benchmark (most models are in the low single digits).

Connections

  • [[OpenAI]] — the provider
  • [[MultimodalModels]] — primary category
  • [[ARC-AGI 2]] — novel benchmark