ai-evals

Create an AI Evals Pack (eval PRD, test set, rubric, judge plan, results + iteration loop). Use for LLM evaluation, benchmarks, rubrics, error analysis/open coding, and ship/no-ship quality gates for

by liqiongyu· Repository·other
Also installable via skills CLI
npx skills add liqiongyu/lenny_skills_plus/skills/ai-evals

Source

Path:skills/ai-evals/SKILL.md(main)

Related in other

ai-evals | AgentArea Skills