inference

Fast inference with Unsloth and vLLM backend. Covers model loading, fast_generate(),thinking model output parsing, and memory management for efficient inference.

by atrawog· Repository·devops

Run in AgentArea Browse All Skills

Also installable via skills CLI

npx skills add atrawog/bazzite-ai-plugins/bazzite-ai-jupyter/skills/inference

Source

Repo:SkillsMP + GitHub Raw

Path:bazzite-ai-jupyter/skills/inference(main)

Related in devops

create-pr-n8n-io-n8n

Creates GitHub pull requests with properly formatted titles that pass the check-pr-title CI validation. Use when creating PRs, sub...

by n8n-io

170,998

create-database-migration-tryghost-ghost

Create a database migration to add a table, add columns to an existing table, add a setting, or otherwise change the schema of Gho...

by TryGhost

51,672

internal-comms

A set of resources to help me write all kinds of internal communications, using the formats that my company likes to use. Claude s...