#benchmarks
6 results found
Gecko Tests Mcp Server
Daily tests covering censorship, race bias, political orientation, IQ, rules vs human survival, real-life judgment, and model drift. 16 frontier & widely used models · 7 tests prepared · Censorship Index launching first · raw answers public after each run BenchGecko asks the questions people actually worry about: what AI refuses, who it protects, what it believes, and whether it changes over time.
PricePilot
Cross-channel CPG pricing intelligence — Amazon category benchmarks (percentile rank, trend, tier breakdown) for multi-channel brands selling on Amazon plus retail, DTC, or wholesale. Six tools across 800+ tracked products in Grocery, Health & Beauty, Household, and Pet Supplies. Free alternative to enterprise category data.
AssayChain
Mineral Intelligence, USGS commodity benchmarks, attested field run logs, and mining/geologic district data for AI agents. Nine data endpoints gated by x402 ($0.10 USDC on Base). Tools: get_commodity_benchmark (gold, silver, copper, and 17 more critical minerals), get_ultrasound_run_data (on-chain EAS-attested gravity-separation field runs), district.history (MRDS-sourced deposit and assay history by country/state/district), and ask_sales_agent. No API keys — pay per call in USDC.
Brightbean - Youtube intelligence MCP for AI agents
Youtube intelligence for AI agents - score titles & thumbnails, surface niche content gaps, benchmark channels and videos