training-check
Periodically check WandB metrics during training to catch problems early (NaN, loss divergence, idle GPUs). Avoids wasting GPU hours on broken runs. Use when training is running and you want automated
Also installable via skills CLI
npx skills add wanshuiyin/Auto-claude-code-research-in-sleep/skills/training-check
Source
Path:
skills/training-check/SKILL.md(main)