Iris

@iris-eval

Visit Server

3 months ago

The first MCP-native eval and observability tool for AI agents. Any MCP-compatible agent discovers and uses Iris automatically — no SDK, no code changes. Log traces with hierarchical span trees, evaluate output quality with 12 built-in rules (PII detection, prompt injection, cost thresholds), and track what your agents are actually doing and costing you. Real-time dark-mode dashboard, OpenTelemetry-compatible span structure, self-hosted with SQLite. MIT licensed.

Overview Tools Comments

Comments

Try in Playground

Server Config

{
  "mcpServers": {
    "iris": {
      "command": "npx",
      "args": [
        "-y",
        "@iris-eval/mcp-server"
      ]
    }
  }
}

Build with ShipAny.

Explore
Playground
Blog
Cases
DXT
Partners

Privacy
Terms