agent-evaluation-mlflow-majiayu000-claude-skill-regist

Implement agent evaluation and safety gates using MLflow 3.x. Use for creating LLM-as-Judge scorers, evaluation datasets, quality gates, tracing, and continuous evaluation. Triggers on "evaluate agent

by majiayu000· Repository·testing

Run in AgentArea Browse All Skills

Also installable via skills CLI

npx skills add majiayu000/claude-skill-registry/skills/testing/agent-evaluation-mlflow

Source

Repo:github.com/majiayu000/claude-skill-registry

Path:skills/testing/agent-evaluation-mlflow/SKILL.md(main)

Related in testing

frontend-testing-langgenius-dify

Generate Vitest + React Testing Library tests for Dify frontend components, hooks, and utilities. Triggers on testing, s...

by langgenius

127,079

skill-creator-langgenius-dify

Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an exist...

by langgenius

127,079

browser-use-browser-use-browser-use

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs...

by browser-use

76,633