Glossary

So “agent”, “tool”, and “policy” mean exactly one thing to both the developers who integrate Shield and the security team who operates it.


Agent — an AI system that takes actions by calling tools. Identified to Shield by an agent_key.

Agent key — the string an agent presents on every request (X-Agent-Key). How Shield knows who is acting.

Tool — a single capability an agent can invoke (e.g. get_account, wire_transfer). May come from an MCP server or be generated from an API.

Tool call — one invocation of a tool with arguments. The unit Shield authorizes.

Role / RBAC — the caller’s role (X-User-Role, e.g. reader, admin) gates which tools it may call. RBAC = role-based access control.

Role permissions — the map, per agent, of role → allowed tools. Set once when the agent is registered.

Capability minting — defining what an agent can do (its tools and role-permissions) before it runs. Done once at registration.

Guardrail — one check Shield runs. Fast-tier guardrails (RBAC, kill switch) are CPU-only and sub-millisecond; slow-tier (content inspection, output sanitization) use an LLM.

Risk tier — a zero-config low / medium / high score Shield assigns every tool call from its name, HTTP method, and arguments — so dangerous calls surface before any policy is written.

Policy modemonitor or enforce per tenant. Monitor evaluates everything and records what would block, but blocks nothing (a dry run). Enforce actually blocks. Default is enforce.

Would-block — in monitor mode, the list of guardrails that would have blocked a call. The review queue before turning on enforcement.

Kill switch — an operator’s one-click disable of a tool, fleet-wide and instant. An administrative block: enforced even in monitor mode.

Agent disable — toggling an agent off in the registry; it cannot act at all, regardless of role. Also administrative.

Shadow agent / shadow tool — an agent or tool that called Shield without being registered. Surfaced for the operator to approve or block.

Tenant — an isolated customer/workspace. Its agents, policies, and audit log are scoped to it. Resolved from the API key.

Data policy / sanitization — rules that redact or block sensitive data (PII, secrets) in a tool’s output before the agent sees it.

Taint graph — the per-session lineage of tool calls, used to trace a result back through the chain to the originating prompt (forensics).

Audit log / SIEM feed — the record of every decision (agent, tool, role, result, reason, timestamp), kept locally and streamed to the security team’s SIEM (Splunk, Elastic).

Upstream MCP server — an existing MCP server (third-party or internal) that Shield’s proxy sits in front of.

MCP proxy — Shield mediating between an MCP client and upstream MCP servers: filtering tools/list, enforcing tools/call, sanitizing output.

Generated server — a standalone MCP server Shield emits from an OpenAPI spec (Python or TypeScript), with the enforcement hook baked in.

Sandbox key — any sk-test-* API key; resolves to a shared sandbox tenant Shield auto-provisions, for zero-setup trials.