Skip to content
OpenAI introduces GPT-5-Codex: faster, more reliable coding assistant with advanced code reviews
Source: openai.com

OpenAI introduces GPT-5-Codex: faster, more reliable coding assistant with advanced code reviews

Sources: https://openai.com/index/introducing-upgrades-to-codex, OpenAI

TL;DR

  • GPT‑5‑Codex is a version of GPT‑5 optimized for agentic coding in Codex, designed for quick interactive sessions and long, complex tasks.
  • It includes enhanced code review capabilities that can identify critical bugs before they ship.
  • Codex now works everywhere you develop: terminal, IDE, web, GitHub, and the ChatGPT iOS app, with cloud and local task support via Codex CLI and the IDE extension.
  • Performance and workflow improvements include faster task completion through caching, automatic environment setup, image sharing in the CLI, and improved tool use for external integrations.
  • Security is prioritized: Codex runs in a sandbox by default, with configurable privacy and network controls across cloud and local environments.

Context and background

Codex has evolved to become a more effective coding collaborator across multiple environments. Earlier steps included the launch of Codex CLI in April and Codex web in May, followed by unifying Codex into a single product experience connected by a user’s ChatGPT account two weeks ago. This unification enables seamless movement of work between local environments and the cloud without losing context, and expands Codex’s reach to terminals, IDEs, the web, GitHub, and even the ChatGPT iOS app. Codex is included with ChatGPT Plus, Pro, Business, Edu, and Enterprise plans. OpenAI

What’s new

OpenAI introduces GPT‑5‑Codex, a version of GPT‑5 further optimized for agentic software engineering in Codex. It is trained on complex, real‑world engineering tasks, including building full projects from scratch, adding features and tests, debugging, performing large‑scale refactors, and conducting code reviews. It is more steerable and adheres better to AGENTS.md instructions, producing higher‑quality code with less need for long setup prompts.

  • It can power cloud tasks and code review by default and can be used for local tasks via Codex CLI and the IDE extension.
  • It supports long-running independent work (tests show independence for more than 7 hours on large tasks) and remains snappy on small requests or during interactive chats.
  • GPT‑5‑Codex has been validated for code reviews by navigating codebases, reasoning about dependencies, and running code and tests to validate behavior.
  • In testing, it demonstrates improved accuracy in review comments and reduces the likelihood of incorrect or unimportant feedback.
  • It supports visual input in the cloud (images or screenshots) and can display progress screenshots of its work.
  • The model is purpose-built for Codex CLI, the Codex IDE extension, the Codex cloud environment, and GitHub, and it also supports versatile tool use. Unlike GPT‑5, it is recommended specifically for agentic coding tasks in Codex or Codex‑like environments.
FeatureDetails
AvailabilityDefault for cloud tasks and code review; usable for local tasks via Codex CLI and IDE extension
Training emphasisReal-world software engineering tasks, including full projects, features, tests, debugging, and large refactors
StrengthsFaster on short prompts, capable of long independent execution, improved code reviews, better alignment with task goals
SecuritySandboxed by default; network access configurable; commands require permission; cloud network restrictions
Tools and environmentsCodex CLI, Codex IDE extension, cloud environment, GitHub integration, supports images in CLI and progress tracking
  • SWE-bench Verified: the model now reports on all 500 SWE-bench tasks and includes refactoring tasks across Python, Go, and OCaml. An example PR in Gitea demonstrates large-scale changes.
  • It adapts its thinking time based on task complexity and balances interactive pairing with persistent, long-running execution. During testing, the model handled large refactors and maintained momentum over hours. OpenAI

Why it matters (impact for developers/enterprises)

  • A faster, more reliable teammate reduces turnaround time for both quick questions and long-running engineering tasks.
  • Advanced code reviews can catch critical flaws earlier, reducing reviewer load and increasing confidence before shipping.
  • The unified Codex experience, connected through a ChatGPT account, enables seamless context retention when moving work between the cloud and local environments. This helps teams maintain continuity across tools and locations.
  • Cross‑environment support means developers can work in the terminal, IDEs, web, GitHub, and mobile, aligning with modern developer workflows.
  • Security controls, sandboxing, and configurable network access help protect code and data while enabling productive automation in trusted environments. OpenAI

Technical details or Implementation

  • Codex CLI has been rebuilt around agentic coding workflows to make the agent more capable and reliable. It now supports attaching and sharing images (screenshots, wireframes, diagrams) in the CLI to build shared context and clarify design decisions. A to-do list tracks progress on complex tasks, and web search and MCP enable connecting to external systems, with more accurate tool use overall. The terminal UI now presents tool calls and diffs more clearly, and approval modes are simplified to read-only, auto, and full access levels. Conversation state can be compacted to make long sessions easier to manage.
  • The IDE extension brings the Codex agent into VS Code, Cursor, and other forks, enabling you to preview local changes, edit code with Codex, and move work between the cloud and your local environment without leaving the editor. You can create cloud tasks, monitor in-progress work, and review completed tasks from within the IDE, including opening cloud tasks in the editor to maintain context. Quickstart guides explain how to maximize the IDE extension. OpenAI
  • In the cloud, Codex can inspect images you provide as input, look at its own progress visually, and attach screenshots of the result to tasks and PRs. Cloud infrastructure improvements include caching containers to dramatically reduce median completion time for new tasks and follow-ups by 90%. Codex can automatically set up its own environment by scanning common setup scripts and running them, and can fetch dependencies at runtime when configured with internet access. Security settings are customizable to match risk tolerance; cloud network access can be restricted to trusted domains. In the CLI and IDE extensions, developers can approve commands to run with full access or permit web search and MCP connections, expanding capabilities while balancing risk. OpenAI
  • Code review capabilities: Codex reviews PRs as they move from draft to ready, posts analysis on the PR, and can implement suggested edits if asked with a command like “@codex review.” The team uses Codex for a majority of PR reviews at OpenAI and highlights its role in accelerating shipping with greater confidence. Codex emphasizes protecting code and data from exfiltration, running in sandboxed environments by default, and asking for permission before potentially dangerous actions. OpenAI

Key takeaways

  • GPT‑5‑Codex is tailored for agentic coding in Codex, delivering faster responses and robust long‑form task handling.
  • It strengthens code review, navigation, and testing workflows to improve software quality and reduce manual review effort.
  • Codex now offers a unified, cross‑environment experience connected by a ChatGPT account, enabling seamless movement between local and cloud work.
  • Security and containment features, plus configurable network controls, help protect code and data while enabling automation.
  • The Codex CLI and IDE extension team up to bring the cloud agent into your favorite development environments, with richer collaboration features like image sharing and to‑do tracking.

FAQ

  • Q: What is GPT‑5‑Codex and how is it specialized for Codex? A: It is a version of GPT‑5 optimized for agentic software engineering tasks in Codex, trained on real-world coding work, and focused on interactive sessions as well as independent long-running tasks. OpenAI
  • Q: Where can Codex be used with this upgrade? A: In cloud tasks and code reviews by default, with local task support via Codex CLI and the IDE extension, and across terminal, IDE, web, GitHub, and the ChatGPT iOS app. OpenAI
  • Q: How does code review work with GPT‑5‑Codex? A: It navigates codebases, reasons about dependencies, runs code and tests to validate behavior, and can post analysis on PRs and implement edits when asked. OpenAI
  • Q: What about security and sandboxing? A: Codex runs in a sandboxed environment by default with network access disabled; it can ask for permission before dangerous actions and can be restricted or expanded via configurable settings. OpenAI
  • Q: What performance improvements are highlighted? A: Caching of containers reduces median task completion time by about 90%, and Codex can adapt its thinking time to task complexity while maintaining responsiveness on simpler requests. OpenAI

References

More news