Guidance for optimizing LLM inference request batching and scheduling problems. This skill applies when designing batch schedulers that minimize cost while meeting latency and padding constraints, inv
design/llm-inference-batching-scheduler
Add unsigned integer (uint) type support to PyTorch operators by updating AT_DISPATCH macros. Use when adding support fo...
Guide users through creating Agent Skills for Claude Code. Use when the user wants to create, write, author, or design a...
Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to...