The first MCP-native eval and observability tool for AI agents. Any MCP-compatible agent discovers and uses Iris automatically — no SDK, no code changes. Log traces with hierarchical span trees, evaluate output quality with 12 built-in rules (PII detection, prompt injection, cost thresholds), and track what your agents are actually doing and costing you. Real-time dark-mode dashboard, OpenTelemetry-compatible span structure, self-hosted with SQLite. MIT licensed.
Server Config
{
"mcpServers": {
"iris": {
"command": "npx",
"args": [
"-y",
"@iris-eval/mcp-server"
]
}
}
}