advanced-evaluation

Master LLM-as-a-Judge evaluation techniques including direct scoring, pairwise comparison, rubric generation, and bias mitigation. Use when building evaluation systems, comparing model outputs, or est

by SyntaxAsSpiral· Repository·other
Also installable via skills CLI
npx skills add SyntaxAsSpiral/zk-context-vault/skills/archive/advanced-evaluation

Source

Path:skills/archive/advanced-evaluation/SKILL.md(main)

Related in other

advanced-evaluation | AgentArea Skills