anthropic-evaluations

This skill should be used when the user asks to "create evals", "evaluate an agent", "build evaluation suite", or mentions agent testing, graders, or benchmarks. Also suggest when building coding agen

by dwmkerr· Repository·other
Also installable via skills CLI
npx skills add dwmkerr/claude-toolkit/plugins/toolkit/skills/anthropic-evaluations

Source

Path:plugins/toolkit/skills/anthropic-evaluations(main)

Related in other

anthropic-evaluations | AgentArea Skills