Editorial · ai in production

Coding agents are a production security problem now

OpenAI published its internal Codex controls — here's the sandbox, approval, and telemetry stack your platform team needs before any agent touches a repo.

May 11, 2026· 5 min read· Domani AI

OpenAI quietly published its internal security posture for running Codex in production — not a marketing brief, but an operator-grade document covering sandboxing, network egress, approval gates, and agent-native telemetry. For CTOs fielding pressure to roll out coding agents this quarter, this is the first credible reference architecture from a team that actually runs one at scale. The strategic gap it exposes isn't whether to adopt coding agents — it's whether your platform team has a containment story before your engineering org opens the door.

What changed in how OpenAI runs Codex

OpenAI's Running Codex Safely post describes the control layer the company operates internally when Codex agents work inside real codebases. The architecture rests on four pillars: isolated sandbox environments per agent task, explicit network egress restrictions that block arbitrary outbound calls, human-in-the-loop approval gates for actions above a defined blast-radius threshold, and structured telemetry that captures agent decisions — not just outputs — for audit.

The sandboxing model is task-scoped, not session-scoped. Each coding task gets its own ephemeral environment with no persistent state carried across runs. Network policies default to deny, with allowlists maintained per project rather than per agent. Approval gates are tiered: low-risk operations (read, lint, test) run autonomously; write operations to main branches or secrets-adjacent files require a human sign-off before execution. Telemetry is designed to log the agent's reasoning trace alongside its actions, making post-incident review tractable rather than a guessing game.

What makes this document unusual is provenance. Most coding agent guidance comes from vendors selling the agent. This describes the controls an organization chose for itself when the agent is working on its own production systems — a materially different accountability structure.

Why this changes the math on agent ownership for your stack

Most engineering orgs evaluating coding agents in May 2026 are treating this as a developer-tooling procurement decision — which IDE plugin, which model, which pricing tier. That framing misses where the risk actually lands. A coding agent with repo write access, access to environment variables, and the ability to open pull requests is a privileged actor in your CI/CD chain. The threat model isn't the agent going rogue; it's the agent doing exactly what it's told, by a prompt that has been manipulated, a misconfigured allowlist, or an approval gate that nobody reviewed because the queue moved too fast.

The part most outlets miss: telemetry for coding agents requires a different schema than application telemetry. You cannot reconstruct what an agent decided to do from standard logs that only capture what it executed. Without reasoning-trace capture — the agent's intermediate steps, tool calls, and discarded alternatives — your incident response team is reading tea leaves. OpenAI's posture bakes this in from the start. Most platform teams we talk to have none of it.

The second gap is approval gate design. A gate that fires on every commit defeats the productivity case. A gate that only fires on deploys misses the class of risk that lives in dependency changes, secret references, and test file modifications that lower coverage thresholds without triggering obvious flags. The OpenAI approach tiers by blast radius — a concept your platform team needs to operationalize for your specific repo topology before agents go live, not after the first incident.

Talk to Domani AI about building this →

What your platform team should do this week

This week's task is a readiness audit, not a rollout decision. Before any coding agent gets repo access — even in a staging environment — your platform team needs answers to three questions that branch differently depending on your organization's repo sensitivity and headcount.

Run this decision tree:

Do any repos the agent would touch contain secrets, credentials, or environment configs? If yes: network egress controls and secrets-scanning in the sandbox are non-negotiable preconditions, not phase-two items.
Is your CI/CD pipeline triggered automatically on PR creation? If yes: you need approval gates before PR creation, not just before merge. Automated pipelines collapse the window between agent action and production effect.
Can you reconstruct what the agent decided — not just what it did — from your current logging setup? If no: instrument reasoning-trace capture before you go live. Your security team will need it the first time something unexpected lands in production.

For teams under 150 engineers: start with a single low-sensitivity repo, deny-by-default network policy, and mandatory human review on all writes. Expand access scope only after you have 4 weeks of telemetry you can actually read. For teams above 150: the approval gate queue will become a bottleneck faster than you expect — design the tiering logic now, not when engineers start bypassing the gate because it blocks their sprint.

What this costs — and what skipping it costs

Building a sandbox-plus-telemetry control layer for coding agents is a 3–6 week platform investment for a team of two senior engineers, assuming you're not starting from scratch on ephemeral environment infrastructure. If you're already running task-scoped CI environments, the incremental work is closer to 2 weeks. The approval gate logic — especially the blast-radius tiering — is where the design time goes, not the tooling.

The cost of skipping it is harder to price in advance and very easy to price after the fact. A single incident where an agent commits a dependency with a known CVE, exposes a secrets reference in a PR, or degrades test coverage across a critical service will cost more in remediation and trust repair than the platform build. More practically: if your engineering org is already feeling pressure to ship coding agent access, the path of least resistance is to open it without controls and patch later. That path has a history, and it doesn't end well. The OpenAI posture exists because they chose to do this work before incidents forced it. That's the choice on the table this week.

Talk to Domani AI about building this →

Source: https://openai.com/index/running-codex-safely

Have a similar build in mind? → Start the conversation

Start the conversation →

← Back to Insights