Security operations quickstart
You operate Shield from the portal — no code. The posture is observe first: watch what your agents and tools actually do, see the dangerous calls ranked, then turn on enforcement for what you don’t like.
Table of contents
The eight concepts you need
- Agent — an AI system that acts by calling tools. Each has an identity.
- Tool — one capability an agent can invoke (
get_account,wire_transfer). - Role (RBAC) — the caller’s role gates which tools it may use.
- Risk tier — every tool call is auto-scored
low/medium/highwith no policy needed, so dangerous activity surfaces on day one. - Policy mode —
monitor(dry run, blocks nothing, records what would block) orenforce(actually blocks). You move tools from one to the other. - Kill switch — disable a tool fleet-wide, instantly, one click.
- Shadow agent/tool — something that called Shield without being registered; flagged for you to approve or block.
- Audit / SIEM — every decision recorded and streamed to your SIEM.
Fuller definitions: the glossary (shared with your developers).
The lifecycle: template → monitor → review → enforce
This is the safe path to production. Nobody flips a new policy to “block” blind.
- Start from a template. Pick your industry (healthcare/HIPAA, financial/PCI, general) instead of a blank policy.
- Run in monitor mode. Shield evaluates everything and records a would-block list, but blocks nothing. Zero risk to live traffic.
- Review. In the portal, look at what would have been blocked and the high-risk calls. Confirm the policy matches reality.
- Enforce. Flip the tools you’re confident about to enforce. Leave the rest in monitor. Repeat.
Details and the operator walkthrough: policy lifecycle.
Why this matters: monitor mode is the single biggest trust-builder. You prove a policy is safe against real traffic before it can break anything.
Reading a block
Every block answers four questions, in the audit row and the SIEM event:
- Which agent acted (
agent_key) and as which role. - Which tool it tried to call.
- Which guardrail stopped it and why (the reason message).
- When (
trace_id+ timestamp), correlatable across the whole request.
If a block looks wrong, the reason tells you exactly which rule to change — you never have to guess.
Tabletop: “an agent goes rogue”
Run this with your team to learn the incident path.
- Detect. An agent starts calling a high-risk tool (e.g.
delete_records) it shouldn’t. You see it ranked high in the discovery / activity view, or get a webhook/SIEM alert. - Contain. One click: kill switch the tool (fleet-wide, instant) and/or disable the agent. Both are administrative — they enforce even if the tenant is still in monitor mode.
- Investigate. Open the audit log for that
trace_id. The taint graph traces the action back through the tool-call chain to the originating prompt — prompt → execution, end to end. - Remediate. Tighten the role’s permissions or move the tool to enforce. Re-enable the agent when safe.
The point to internalize: disable is real enforcement, not a label — a toggled-off agent cannot act through any path, and a kill-switched tool is dead fleet-wide, immediately.
Compliance
Shield’s controls map to NIST AI RMF, OWASP LLM Top 10, and ISO 42001; the audit trail and evidence pack support those frameworks. See the compliance mapping in the docs for the control-by-control table.
Where to next
- Policy lifecycle — monitor → enforce in the portal
- Glossary — shared vocabulary with developers
- Developer guide — how your developers connect agents