eval-recipes-runner-rysweet-amplihack
13Run Microsoft's eval-recipes benchmarks to validate amplihack improvements against baseline agents.
Also installable via skills CLI
npx skills add rysweet/amplihack/docs/claude/skills/eval-recipes-runner
Run Microsoft's eval-recipes benchmarks to validate amplihack improvements against baseline agents.