ml-inference-optimization

ML inference latency optimization, model compression, distillation, caching strategies, and edge deployment patterns. Use when optimizing inference performance, reducing model size, or deploying ML at

by melodic-software· Repository·devops

Run in AgentArea Browse All Skills

Also installable via skills CLI

npx skills add melodic-software/claude-code-plugins/devops/ml-inference-optimization

Source

Repo:SkillsMP + GitHub Raw

Path:devops/ml-inference-optimization(main)

Related in devops

create-pr-n8n-io-n8n

Creates GitHub pull requests with properly formatted titles that pass the check-pr-title CI validation. Use when creating PRs, sub...

by n8n-io

170,998

create-database-migration-tryghost-ghost

Create a database migration to add a table, add columns to an existing table, add a setting, or otherwise change the schema of Gho...

by TryGhost

51,672

internal-comms

A set of resources to help me write all kinds of internal communications, using the formats that my company likes to use. Claude s...