vllm-skill

Serves LLMs with high throughput using vLLM's PagedAttention and continuous batching. Use when deploying production LLM APIs, optimizing inference latency/throughput, or serving models with limited GP

by DoanNgocCuong· Repository·other

Run in AgentArea Browse All Skills

Also installable via skills CLI

npx skills add DoanNgocCuong/home/3 - THE ROAD/3.2 [STRUCTURES - B- MILESTONES]/3.2.2 MONEYGame/3.2.1.1 KIẾM TIỀN - SKILL/your_project/claude/skills/DataScienceAndAI/6 - Applications/1_LLMs/vllm-skill

Source

Repo:github.com/DoanNgocCuong/home

Path:

3 - THE ROAD/3.2 [STRUCTURES - B- MILESTONES]/3.2.2 MONEYGame/3.2.1.1 KIẾM TIỀN - SKILL/your_project/claude/skills/DataScienceAndAI/6 - Applications/1_LLMs/vllm-skill/SKILL.md

(main)

Related in other

agent-memory-yamadashy-repomix

Use this skill when the user asks to save, remember, recall, or organize memories. Triggers on: 'remember this', 'save t...

by yamadashy

21,427

task-execution-engine

CLI tool for configuring and monitoring Claude Code

by davila7

18,218

qiuzhi

指导Claude按照二哥的风格撰写求职类文章，包括公司薪资爆料、年终奖盘点、求职攻略、offer选择建议等内容。

by itwanger

16,619