RUNTIME PROTECTION FOR AI AGENTS
Runtime protection that steers AI agents away from dangerous actions while preserving their ability to complete the task.
Attacks blocked
Claude Code on AgentDojo (n=20)
Attacks blocked
OpenHands on AgentDojo (n=20)
Latency per call
median response time
WHAT WE PROTECT AGAINST
AI coding agents can be hijacked through prompt injection to perform unauthorized actions. AgentSteer intercepts every tool call and blocks threats before they execute.
Agents tricked into sending sensitive data to external endpoints via email, HTTP requests, or file uploads.
Detected by delegation policy (P2) and target verification (P4)
Injected commands that run arbitrary code, install backdoors, or execute remote payloads.
Detected by category mismatch policy (P3)
Agents deleting files, dropping databases, or modifying configurations they should not touch.
Detected by read-only policy (P1) and category mismatch (P3)
Sending task details to external URLs, emails, or services the agent was never asked to contact.
Detected by delegation policy (P2)
HOW IT WORKS
AgentSteer doesn't just block. When a dangerous action is prevented, the agent receives the reason and continues its task on the right path.
A PreToolUse hook captures every tool call before execution. Works with Claude Code, OpenHands, or any Python agent.
A security model scores the action against the task description across four policies: read-only, delegation, category, and target.
Dangerous actions are blocked with a clear reason. The agent adapts and completes the task safely, preserving capability.
IN ACTION
A prompt injection tries to exfiltrate data. AgentSteer blocks the malicious action and the agent continues with the legitimate task.
GET STARTED
Continuous monitoring for every AI agent action. Audit trail of all scored actions with block decisions, policy violations, and reasoning.
100% attack blocking on Claude Code, 95% on OpenHands (AgentDojo, n=20)
Install in minutes. Works with Claude Code and OpenHands out of the box. Three lines of code for any Python agent framework.
pip install agentsteer