agent-evaluation-mlflow-majiayu000-claude-skill-regist
48Implement agent evaluation and safety gates using MLflow 3.x. Use for creating LLM-as-Judge scorers, evaluation datasets, quality gates, tracing, and continuous evaluation. Triggers on "evaluate agent
Also installable via skills CLI
npx skills add majiayu000/claude-skill-registry/skills/testing/agent-evaluation-mlflow
Source
Path:
skills/testing/agent-evaluation-mlflow/SKILL.md(main)