This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward mod
trl
Guide for performing Chromium version upgrades in the Electron project. Use when working on the roller/chromium/main bra...
指导Claude按照二哥的风格撰写求职类文章,包括公司薪资爆料、年终奖盘点、求职攻略、offer选择建议等内容。
The chat will provide safety guides, equipment suggestions, reviews, and techniques, with context-driven summary and mul...