ai-evals
Create an AI Evals Pack (eval PRD, test set, rubric, judge plan, results + iteration loop). Use for LLM evaluation, benchmarks, rubrics, error analysis/open coding, and ship/no-ship quality gates for
Also installable via skills CLI
npx skills add liqiongyu/lenny_skills_plus/skills/ai-evals