GPT-5¶

OpenAI's 2026 flagship multimodal model. Successor to GPT-4o. Shipped with a 128K context window, text + vision + audio input modalities, and a pricing structure that sits slightly above Claude Sonnet 4 on input but comparable on output.

Notable features¶

Multimodal from day one — text, vision, and audio in a single endpoint.
16K output cap — double the 8K output ceiling of most contemporaries; useful for long-form structured generation.
ARC-AGI 2 score — 16.4%, the first major model to score meaningfully on the upgraded ARC-AGI benchmark (most models are in the low single digits).

Connections¶

[[OpenAI]] — the provider
[[MultimodalModels]] — primary category
[[ARC-AGI 2]] — novel benchmark

`⌘K` / `Ctrl+K`	Open command palette
`/`	Focus search
`g h`	Go to home
`g p`	Go to projects
`g s`	Go to sessions
`j` / `k`	Next / prev row (tables)
`?`	Show this help
`Esc`	Close dialogs