agent-evaluation
Evaluate and improve Claude Code commands, skills, and agents. Use when testing prompt effectiveness, validating context engineering choices, or measuring improvement quality.
Also installable via skills CLI
npx skills add NeoLabHQ/context-engineering-kit/plugins/customaize-agent/skills/agent-evaluation