Guidance for implementing tensor parallelism in PyTorch, including ColumnParallelLinear and RowParallelLinear layers. This skill should be used when implementing distributed tensor parallel operations
design/torch-tensor-parallelism
Add unsigned integer (uint) type support to PyTorch operators by updating AT_DISPATCH macros. Use when adding support fo...
Guide users through creating Agent Skills for Claude Code. Use when the user wants to create, write, author, or design a...
Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to...