trl
This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward mod
trl
Also installable via skills CLI
npx skills add evalstate/skills-dev/trl