Evaluate LLM outputs with multi-dimensional rubrics, handle non-determinism, and implement LLM-as-judge patterns. Essential for production LLM systems. Use when testing prompts, validating outputs, co
product/grey-haven-evaluation
Interactive Product Owner skill for requirements gathering, analysis, and PRD generation. Triggers when users request pr...
Convert technical designs into actionable, sequenced implementation tasks. Create clear coding tasks that enable increme...