Master LLM-as-a-Judge evaluation techniques including direct scoring, pairwise comparison, rubric generation, and bias mitigation. Use when building evaluation systems, comparing model outputs, or est
.claude/skills/advanced-evaluation
Use when you have lint errors, formatting issues, or before committing code to ensure it passes CI.
This skill should be used when the user asks to "update documentation for my changes", "check docs for this PR", "what d...
Write docstrings for PyTorch functions and methods following PyTorch conventions. Use when writing or updating docstring...