verl

Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexib

by MesferAli· Repository·other
Also installable via skills CLI
npx skills add MesferAli/XCircle/.claude/skills/verl

Source

Path:.claude/skills/verl/SKILL.md(main)

Related in other

verl | AgentArea Skills