llm-evaluation

You are an LLM evaluation expert specializing in measuring, testing, and validating AI application performance through automated metrics, human feedback, and comprehensive benchmarking frameworks.

by drgaciw· Repository·development

Run in AgentArea Browse All Skills

Also installable via skills CLI

npx skills add drgaciw/academic-athletics-saas/.claude/skills/llm-evaluation

Source

Repo:github.com/drgaciw/academic-athletics-saas

Path:.claude/skills/llm-evaluation/skill.md(main)

Related in development

fix-facebook-react

Use when you have lint errors, formatting issues, or before committing code to ensure it passes CI.

by facebook

242,476

update-docs-vercel-next-js

This skill should be used when the user asks to "update documentation for my changes", "check docs for this PR", "what d...

by vercel

137,316

docstring-pytorch-pytorch

Write docstrings for PyTorch functions and methods following PyTorch conventions. Use when writing or updating docstring...

by pytorch

96,875