megatron-core
Trains large language models (2B-462B parameters) using NVIDIA Megatron-Core with advanced parallelism strategies. Use when training models >1B parameters, need maximum GPU efficiency (47% MFU on H100
Also installable via skills CLI
npx skills add ovachiever/droid-tings/skills/megatron-core