ai-evaluation-suite

Comprehensive AI/LLM evaluation toolkit for production AI systems. Covers LLM output quality, prompt engineering, RAG evaluation, agent performance, hallucination detection, bias assessment, cost/toke

by doctorduke· Repository·other
Also installable via skills CLI
npx skills add doctorduke/claude-config/skills/ai-evaluation-suite

Source

Path:skills/ai-evaluation-suite/SKILL.md(main)

Related in other

ai-evaluation-suite | AgentArea Skills