unsloth-inference

Unsloth models can be deployed using native optimized inference or through production serving engines like vLLM and SGLang. Native inference is accelerated 2x via forinference(), while production serv

by cuba6112· Repository·other
Also installable via skills CLI
npx skills add cuba6112/skillfactory/skills/unsloth-inference

Source

Path:skills/unsloth-inference(main)

Related in other

unsloth-inference | AgentArea Skills