advanced-evaluation
This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise compari
Also installable via skills CLI
npx skills add Kalyanikhandare29/Agent-Skills-for-Context-Engineering/skills/advanced-evaluation
Source
Path:
skills/advanced-evaluation(main)