Detect prompt injection and jailbreak attacks in real-time. F1 0.92, 2.4ms latency.
AgentShield protects your AI agent from prompt injection, jailbreak, and social-engineering attacks. One tool call classifies any untrusted text before it reaches your LLM.
HOW IT WORKS Call classify_text with any untrusted input — user messages, RAG documents, email bodies, tool-call results. AgentShield returns a verdict with confidence score in ~2.4ms (p50).
PUBLIC BENCHMARK (5,972 samples, 5 datasets) Aggregate F1: 0.921 | Precision: 0.905 | Recall: 0.936 Datasets: deepset, gandalf, jackhhao, SPML, safeguard Full methodology and failure analysis: https://agentshield.pro/benchmark
DECISION RULE If is_injection=true AND confidence >= 0.8 → refuse or quarantine the input. Below 0.8 → log the verdict and proceed with caution.
MULTI-LAYER PIPELINE → Input sanitization (homoglyphs, invisible Unicode, encoding tricks) → Pattern detection (200+ known injection templates) → Semantic ML classification (embedding-based intent analysis) → Policy engine (custom rules per application)
Free tier: 100 classifications/day, no credit card needed. EU-hosted (Frankfurt), GDPR compliant, input data not stored after processing.
Works with Claude Desktop, Cursor, Cline, and any MCP-compatible client.
Valid MCP server (2 strong, 4 medium validity signals). 1 code issue detected. 7 known CVEs in dependencies (0 critical, 2 high severity) Package registry verified. Imported from the Official MCP Registry.
10 files analyzed · 9 issues found
Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.
This plugin requests these system permissions. Most are normal for its category.
Set these up before or after installing:
Environment variable: AGENTSHIELD_API_KEY
Sign up freeAdd this to your MCP configuration file:
{
"mcpServers": {
"io-github-dl-eigenart-agentshield-mcp": {
"env": {
"AGENTSHIELD_API_KEY": "your-agentshield-api-key-here",
"AGENTSHIELD_BASE_URL": "your-agentshield-base-url-here"
},
"args": [
"-y",
"@eigenart/agentshield-mcp"
],
"command": "npx"
}
}
}Once installed, try these example prompts and explore these capabilities:
Be the first to review this server!
by Toleno · Developer Tools
Toleno Network MCP Server — Manage your Toleno mining account with Claude AI using natural language.
by mcp-marketplace · Developer Tools
Create, build, and publish Python MCP servers to PyPI — conversationally.
by Microsoft · Content & Media
Convert files (PDF, Word, Excel, images, audio) to Markdown for LLM consumption
by mcp-marketplace · Developer Tools
Scaffold, build, and publish TypeScript MCP servers to npm — conversationally
by mcp-marketplace · Finance
Free stock data and market news for any MCP-compatible AI assistant.
by Taylorwilsdon · Productivity
Control Gmail, Calendar, Docs, Sheets, Drive, and more from your AI