Detect prompt injection, jailbreak, and social-engineering attacks in LLM agents.
Official MCP (Model Context Protocol) server for AgentShield — the runtime gateway and real-time classifier that detects prompt-injection, jailbreak, and social-engineering attempts in text while your agent is running, not in an offline audit pass. Works with any MCP-compatible client: Claude Desktop, Cursor, Cline, Zed, Continue, and custom agents. Single-shot per request, p50 ~2.4 ms — designed…
Verification confirms publisher identity (repo ownership), not code safety. The security scan covers known CVEs and suspicious install scripts — it cannot prove the absence of malicious code.
Official MCP (Model Context Protocol) server for AgentShield — the runtime gateway and real-time classifier that detects prompt-injection, jailbreak, and social-engineering attempts in text while your agent is running, not in an offline audit pass. Works with any MCP-compatible client: Claude Desktop, Cursor, Cline, Zed, Continue, and custom agents. Single-shot per request, p50 ~2.4 ms — designed to sit in the agent's hot path on every untrusted input. Exposes one tool to the agent: . Call it…