Advanced Codex Tips

You have mastered the basics of each Codex surface. You know how worktrees work, your config is tuned, and your AGENTS.md is structured. Now you want to push further: running Codex as an MCP server consumed by other agents, pointing it at local models through Ollama, piping structured output into your build system, and tracking every API call through OpenTelemetry. These are the techniques that turn Codex from a coding assistant into a development infrastructure component.

What You’ll Walk Away With

Multi-surface orchestration patterns that combine App, CLI, and Cloud
Custom model provider configurations for proxies, local models, and Azure
Programmatic integration via the @openai/codex-sdk and the codex exec --json event stream
Observability setup with OpenTelemetry for tracking Codex usage
Advanced sandbox tuning for security-sensitive environments

Multi-Surface Orchestration

The Surface Handoff Pattern

Each surface has a sweet spot. Chain them for maximum effect:

Start in the CLI for quick diagnosis:

codex "What's causing the TypeScript errors in src/services/?"

Move to the App for parallel implementation:
- Open a Worktree thread for the fix
- Open another Worktree thread for tests
- Review diffs visually in the App’s diff pane

Submit to Cloud for verification:

codex cloud exec --env staging --attempts 2 \
  "Run the full integration test suite against these changes"

Back to the CLI for the final commit:

codex exec --full-auto "Create a PR with a summary of all changes"

Cross-Surface Context

The App and CLI share config, AGENTS.md, and skills — but not thread history. To pass context between surfaces:

Use the App’s integrated terminal to run CLI commands
Copy relevant summaries from App threads into CLI prompts
Use codex resume to reopen a previous CLI session (it picks up locally-stored CLI transcripts under ~/.codex/sessions, not App threads)

When the IDE Extension and App are synced, they share thread visibility and auto-context. This is the smoothest cross-surface flow.

Start in the CLI:

codex "Analyze the performance regression introduced in the last 5 commits. Identify the suspect commit and explain what changed."

Then move to the App with a Worktree thread:

“Based on commit abc1234 being the cause of the performance regression, implement a fix that restores the previous query behavior while keeping the new feature. Run benchmarks to verify the fix.”

Custom Model Providers

Route Through a Proxy

model = "gpt-5.5"
model_provider = "proxy"

[model_providers.proxy]
name = "OpenAI via internal proxy"
base_url = "http://proxy.internal.company.com"
env_key = "OPENAI_API_KEY"

Use Ollama for Local Models

The [model_providers.ollama] block holds only the connection details. The provider Codex uses when you pass --oss is selected by oss_provider, which is a top-level key (not nested inside the table):

[model_providers.ollama]
name = "Ollama"
base_url = "http://localhost:11434/v1"

# top-level key, NOT inside the [model_providers.ollama] table above
oss_provider = "ollama"  # or "lmstudio"

Then run: codex --oss "Explain this function". With no provider passed, --oss falls back to whatever oss_provider points at.

Azure OpenAI

[model_providers.azure]
name = "Azure"
base_url = "https://YOUR_PROJECT.openai.azure.com/openai"
env_key = "AZURE_OPENAI_API_KEY"
query_params = { api-version = "2025-04-01-preview" }
wire_api = "responses"

Provider-Specific Tuning

[model_providers.openai]
request_max_retries = 4
stream_max_retries = 10
stream_idle_timeout_ms = 300000

Increase stream_idle_timeout_ms if you see timeout errors on long-running tasks. Increase retry counts for unreliable network conditions.

Quick Endpoint Override

If you just need to point the built-in OpenAI provider at a different endpoint (e.g., for data residency):

# replace with your actual regional/proxy endpoint
export OPENAI_BASE_URL="https://your-region.api.openai.com/v1"
codex

No config changes needed. OPENAI_BASE_URL overrides the built-in OpenAI provider’s endpoint for the current shell.

Model Reasoning and Output Control

Adjust Reasoning Effort

model_reasoning_effort = "high"    # For complex architectural decisions
# or
model_reasoning_effort = "low"     # For simple, fast tasks

Options: minimal, low, medium, high, xhigh (model-dependent).

Control Verbosity

model_verbosity = "low"      # Shorter responses, less explanation
model_reasoning_summary = "concise"  # Brief reasoning summaries

For CI logs, suppress reasoning entirely:

hide_agent_reasoning = true

Context Window Tuning

model_context_window = 128000
model_auto_compact_token_limit = 100000  # Compact earlier to leave headroom

Set model_auto_compact_token_limit lower than the context window to trigger compaction before the window fills completely.

Codex as an MCP Server

Run Codex itself as an MCP server so other agents can consume it:

codex mcp-server

This runs Codex over stdio, allowing another tool or agent to connect and use Codex as a tool. Useful for building multi-agent systems where a coordinator dispatches tasks to Codex.

Programmatic Integration (SDK and JSON Stream)

When you want Codex inside your own tooling — a custom CLI, a CI gate, a chat bot — there are two integration surfaces.

The TypeScript SDK

Install @openai/codex-sdk and drive threads from code. The SDK runs the same agent as the CLI, so your AGENTS.md, skills, and config all apply:

import { Codex } from '@openai/codex-sdk';

const codex = new Codex();
const thread = codex.startThread();

const result = await thread.run('Diagnose and fix the failing CI tests, then summarize what changed.');
console.log(result);

// Continue the same thread, or resume a prior one by ID with codex.resumeThread(id)
await thread.run('Now add a regression test for the bug you fixed.');

Piping the JSON Event Stream

For language-agnostic automation, add --json to codex exec to get newline-delimited JSON events (one per state change). Pipe it into jq or any parser to wire Codex into a build step:

Each line carries a top-level .type (thread.started, turn.started, turn.completed, turn.failed, item.started, item.completed, error); the kind of work an item represents (agent_message, command_execution, file_change, mcp_tool_call, …) lives at .item.type. To watch completed command runs and the model’s messages:

codex exec --json --full-auto \
  "Bump every dependency to its latest minor and run the test suite" \
  | jq -c 'select(.type == "item.completed") | select(.item.type == "command_execution" or .item.type == "agent_message") | .item'

Pair --json with --output-last-message out.txt in CI to capture both the machine-readable event stream and a final natural-language summary in one run.

Sandbox Tuning

Workspace Write with Network

sandbox_mode = "workspace-write"

[sandbox_workspace_write]
network_access = true
writable_roots = ["/Users/me/.pyenv/shims", "/tmp"]
exclude_tmpdir_env_var = false
exclude_slash_tmp = false

Grant Write to Additional Directories

Use --add-dir instead of broadening the sandbox:

codex --cd apps/frontend --add-dir ../backend --add-dir ../shared \
  "Coordinate API changes between frontend and backend"

This grants scoped write access without opening danger-full-access.

Test Sandbox Behavior

Use the codex sandbox command to test what a command can do under your current settings:

# Use `macos` on macOS, `linux` on Linux (aliases: seatbelt / landlock)
codex sandbox macos -- ls /etc
codex sandbox macos -- cat /etc/passwd
codex sandbox macos --full-auto -- npm test

The platform subcommand is required; everything after -- is the command to run. This executes it under the same sandbox Codex uses internally, so you can verify policies before the agent encounters them.

Observability with OpenTelemetry

Enable OTel Export

[otel]
environment = "production"
log_user_prompt = false  # Don't export raw prompts

[otel.exporter.otlp-http]
endpoint = "https://otel-collector.internal.company.com/v1/logs"
protocol = "binary"
headers = { "x-otlp-api-key" = "${OTLP_TOKEN}" }

What Gets Exported

Codex emits structured log events for:

codex.conversation_starts — Model, settings, sandbox policy
codex.api_request — Status, duration, error details
codex.tool_decision — Approved/denied, by config vs user
codex.tool_result — Duration, success, output snippet

Disable Anonymous Metrics

Codex sends anonymous usage data by default. Disable it:

[analytics]
enabled = false

This is separate from OTel export — analytics goes to OpenAI, OTel goes to your infrastructure.

Advanced Notification Patterns

Custom Notification Script

notify = ["python3", "/path/to/notify.py"]

The script receives a JSON argument with event details:

#!/usr/bin/env python3
import json, subprocess, sys

def main():
    notification = json.loads(sys.argv[1])
    if notification.get("type") != "agent-turn-complete":
        return 0
    title = f"Codex: {notification.get('last-assistant-message', 'Done!')}"
    subprocess.run([
        "terminal-notifier",
        "-title", title,
        "-message", " ".join(notification.get("input-messages", [])),
        "-group", "codex-" + notification.get("thread-id", ""),
    ])
    return 0

if __name__ == "__main__":
    sys.exit(main())

TUI Notification Filtering

[tui]
notifications = ["agent-turn-complete", "approval-requested"]
notification_method = "osc9"  # Desktop notifications via OSC 9 escape sequence

Feature Flags Worth Knowing

Flag	Status	What It Does
`shell_snapshot`	Beta	Snapshots shell environment for faster repeated commands
`unified_exec`	Beta	Uses PTY-backed exec for better terminal handling
`remote_compaction`	Experimental	Offloads context compaction to the server
`request_rule`	Stable	Smart approval suggestions based on command patterns

Enable with:

codex features enable shell_snapshot
codex features enable unified_exec

Prompt Editor for Long Instructions

For complex, multi-paragraph prompts, press Ctrl + G in the TUI to open your configured editor. Set the editor:

export VISUAL=code  # Or vim, nvim, nano, etc.

Write the full prompt in your editor, save and close, and Codex sends it. This is far more ergonomic than typing long instructions in the composer.

When This Breaks

Custom provider authentication fails: Verify the env_key environment variable is set and exported. Use codex login status to check auth.
OTel events not appearing: Check that exporter is set to otlp-http or otlp-grpc, not none. Verify the endpoint is reachable from your machine.
Sandbox too restrictive for your workflow: Use codex sandbox to test specific commands. Add writable_roots for directories the agent needs.
MCP server mode disconnects: Codex exits when the downstream client closes the connection. Ensure your client maintains the stdio pipe.
Feature flags disappear after restart: Feature flags are persisted to config.toml but profile-scoped flags only apply when that profile is active.

What’s Next

Team Collaboration — Share these advanced configurations across your team
Setup and Configuration — Foundation config that supports these techniques
AGENTS.md Optimization — Layer instructions on top of advanced config