agent-benchmark

Framework for measuring and tracking agent response quality over time. Detects regressions before they reach production. Use when evaluating agent changes, auditing quality, or establishing performanc

by vibeeval· Repository·other
Also installable via skills CLI
npx skills add vibeeval/vibecosystem/skills/agent-benchmark

Source

Path:skills/agent-benchmark/SKILL.md(main)

Related in other

agent-benchmark | AgentArea Skills