Security Operations Automation

Your scanner just flagged 412 “critical” findings across 50 container images, the on-call channel is on fire, and your SOC 2 auditor wants the access-review evidence by Friday. The findings are real, but most are noise — and you have no fast way to tell which three actually matter. This is where an AI agent wired to your real security tooling earns its keep: not by inventing magic, but by reading scanner output, correlating it with your code, and drafting the evidence you’d otherwise assemble by hand.

What you’ll walk away with

A pattern for piping real scanner output (Trivy, your cloud security tool) into Cursor, Claude Code, or Codex so the agent triages instead of you.
Verified MCP setups for AWS and Azure security data, plus the one MCP-hygiene scanner you should actually run.
Copy-paste prompts for vulnerability triage, a security-gated CI pipeline, and SOC 2 / GDPR evidence drafting.
A Kubernetes hardening prompt that uses current APIs (Pod Security Admission, not the removed PodSecurityPolicy).
A clear-eyed “when this breaks” list so you don’t ship false confidence.

The workflow: agent on top of real tools

The mistake most “AI SecOps” writeups make is asking the model to be the scanner. It isn’t. The durable pattern is: a deterministic tool produces machine-readable output, and the agent reasons over that output — prioritizing by exploitability, mapping a finding to the exact line in your repo, and drafting the fix or the evidence.

# Trivy is a real CLI. Scan an image to JSON, then hand the JSON to your agent.
trivy image --severity HIGH,CRITICAL --format json -o scan.json myorg/api:latest

That scan.json is the input the agent is genuinely good at. Everything below builds on this shape: run the real tool, then prompt the agent over its output.

MCP servers for security context

A few security data sources are worth wiring in as MCP servers so the agent can query live cloud posture instead of working from a stale export. The server (command, args, env) is the same everywhere — only the config file and its format differ: Cursor and Claude Code (.mcp.json) use the JSON mcpServers shape shown below, while Codex (~/.codex/config.toml) expresses the same fields as a TOML [mcp_servers.<name>] table.

AWS security posture — the aws-security-mcp package (real, on npm) exposes Security Hub, GuardDuty, and IAM findings:

{
  "mcpServers": {
    "aws-security": {
      "command": "npx",
      "args": ["-y", "aws-security-mcp"],
      "env": {
        "AWS_REGION": "us-east-1"
      }
    }
  }
}

The same server in Codex’s ~/.codex/config.toml — identical fields, TOML syntax:

[mcp_servers.aws-security]
command = "npx"
args = ["-y", "aws-security-mcp"]

[mcp_servers.aws-security.env]
AWS_REGION = "us-east-1"

Do not hardcode AWS_ACCESS_KEY_ID / AWS_SECRET_ACCESS_KEY as literals in a config you might commit. Let the server pick up credentials from the standard AWS chain — an assumed IAM role, SSO, or ~/.aws/credentials — the same way the AWS CLI does.

Azure security posture — Microsoft’s first-party @azure/mcp server covers Defender for Cloud and resource queries:

{
  "mcpServers": {
    "azure": {
      "command": "npx",
      "args": ["-y", "@azure/mcp@latest", "server", "start"],
      "env": {
        "AZURE_SUBSCRIPTION_ID": "your-subscription-id",
        "AZURE_TENANT_ID": "your-tenant-id"
      }
    }
  }
}

Scan your MCP setup itself

If you’re adding MCP servers to a security workflow, audit them. snyk-agent-scan (run via uvx) statically inspects your installed MCP servers for prompt injection, tool poisoning, and rug-pull attacks — the supply-chain risks specific to MCP. This is Invariant Labs’ former mcp-scan, renamed after Snyk acquired Invariant Labs in June 2025; the old mcp-scan package on PyPI is now just a redirect shim that installs and forwards to snyk-agent-scan, so use the canonical name:

# Scan all installed MCP server configs for malicious tool descriptions
uvx snyk-agent-scan@latest scan

# Inspect the exact tool descriptions your model is being fed
uvx snyk-agent-scan@latest inspect

This is a real, current command — and a genuinely good habit once you depend on third-party MCP servers for security data.

Vulnerability triage: from 412 findings to 3 that matter

Run the scanner, then let the agent do the prioritization a human would otherwise grind through. The agent reads the JSON, cross-references your code to see whether a vulnerable path is actually reachable, and ranks by real exploitability rather than raw CVSS.

Open the repo, drop scan.json into the Composer context, and run the triage prompt. With the @aws-security MCP server connected, add @aws-security to let it check whether an affected service is actually internet-exposed.

From the repo root, pass the scanner output as context. Claude Code can run grep/rg itself to confirm reachability, which makes its prioritization concrete rather than guessed.

claude "Read scan.json and the codebase, then triage the HIGH/CRITICAL findings"

Copy-paste prompt for a one-shot triage report (headless, e.g. in CI):

claude -p "Read scan.json (Trivy JSON). For each HIGH/CRITICAL: name the CVE and package, use Grep to check if the vulnerable path is reachable in our source, and assign P0/P1/P2 with a one-line justification. Output a markdown table sorted by priority." --output-format json > triage.json

Run inside the Codex TUI so you can review each command before it runs. Codex defaults to a sandbox; keep approvals on for anything that touches credentials or cloud APIs.

codex "Read scan.json and triage the HIGH/CRITICAL findings against our source"

What good output looks like — a real, reachable finding rather than a fabricated one:

P0 — CVE-2026-42945 (nginx, ngx_http_rewrite_module heap overflow, CVSS 9.2). Affected: nginx:1.30.0 base image in 3 services. Reachable: yes — all three use rewrite directives. Internet-exposed: yes (edge ingress). Fix: rebuild on nginx:1.31.0 (or 1.30.1), which ships the patch. Patch within 24h.

That CVE is real (introduced in 2008, disclosed 2026, fixed in nginx 1.31.0 / 1.30.1) — use verifiable findings in security work; never let the agent present an invented CVE number as fact.

Security-gated CI pipeline

The highest-leverage move is to put the scanner in front of deploys so a critical vulnerability blocks the merge automatically. Trivy ships a maintained GitHub Action; the agent’s job is to wire it into your pipeline and set the gate correctly.

A correct gate looks roughly like this — note it scans the built image and fails the build, rather than just reporting:

- name: Scan image with Trivy
  uses: aquasecurity/trivy-action@master
  with:
    image-ref: myorg/api:${{ github.sha }}
    severity: HIGH,CRITICAL
    ignore-unfixed: true
    exit-code: '1'        # fail the build on a fixable HIGH/CRITICAL
    format: sarif
    output: trivy.sarif

The pattern generalizes: build-time scanning in CI, plus continuous registry scanning for images already deployed, plus an admission policy at the cluster (next section) so a vulnerable image can’t be scheduled even if it slips past CI.

Container and Kubernetes hardening

Real-world container security workflow

The challenge: your team deploys 50+ images weekly. Manual review is a bottleneck; unscanned images are a risk. The fix is layered, automated gates — and an admission policy that uses current Kubernetes APIs.

Use the agent to:

Wire Trivy into CI as a blocking gate (prompt above).
Enable continuous registry scanning for already-deployed images.
Enforce Pod Security Standards at the cluster via Pod Security Admission (the built-in successor to the removed PodSecurityPolicy), or a policy engine like Kyverno / OPA Gatekeeper for richer rules.
Audit live workloads against the CIS Kubernetes Benchmark.

PodSecurityPolicy was removed in Kubernetes 1.25 — any guide still telling you to “implement pod security policies” is out of date. The current built-in mechanism is Pod Security Admission enforcing the three Pod Security Standards (privileged / baseline / restricted).

Compliance is mostly evidence assembly, and that is exactly the kind of structured-collation work an agent is good at — provided you point it at real artifacts (access logs, IaC, config exports) rather than asking it to assert compliance from nothing.

Drop your access-review export and IaC into context, connect @aws-security for live config, and ask for an evidence package mapped to specific controls.

Claude Code can read the repo, run git log for change-management evidence, and assemble the package in one pass.

Use Codex with read-only sandboxing for an evidence pass — it should read and report, not modify infrastructure.

codex --sandbox read-only "Assemble SOC 2 CC6.1 access-control evidence from access-review.csv and infra/"

The discipline that makes this trustworthy: every “satisfied” claim must cite a real artifact. An evidence package the agent fabricated will fail the moment an auditor asks to see the underlying log — so prompt explicitly for citations and gap-flagging, as above.

When this breaks

False-positive storms. Scanners flag hundreds of “criticals” that are unreachable transitive deps or unfixable base-image noise. Always pass ignore-unfixed where appropriate and make the agent confirm reachability before it calls something a P0 — otherwise you’ve automated alert fatigue.
Scanner / cloud auth failures. If the agent’s triage looks suspiciously empty, the MCP server or CLI probably failed to authenticate (expired SSO, wrong region, missing role) and returned nothing. Verify the underlying tool runs standalone (aws sts get-caller-identity, trivy image …) before trusting the agent’s summary.
Admission-controller lockouts. Switching Pod Security Admission straight to enforce: restricted, or a too-broad Kyverno policy, can block your own deploys and even cluster system pods. Stage with warn/audit, exempt kube-system, and test on one namespace first.
Fabricated CVEs or evidence. The single biggest risk: an agent confidently citing a CVE number, CVSS score, or compliance artifact that doesn’t exist. Never accept a finding you can’t trace to scanner output or a real artifact. Make “cite the source or flag it as unknown” a standing instruction in every security prompt.
MCP supply-chain risk. A malicious third-party MCP server can poison the agent’s context. Run uvx snyk-agent-scan@latest scan after adding any server, and pin versions.

What’s next

Deployment Operations overview — where security gating fits in the broader deploy pipeline.
The MCP Ecosystem — how to wire up, scope, and audit MCP servers safely.
Pipeline Automation with AI — building the pipeline your security gate plugs into.