unsloth-inference
Unsloth models can be deployed using native optimized inference or through production serving engines like vLLM and SGLang. Native inference is accelerated 2x via forinference(), while production serv
Also installable via skills CLI
npx skills add cuba6112/skillfactory/skills/unsloth-inference