Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on NVIDIA GPUs (A100/H100), when you need 10-100x faster inference than PyTorch, o
.claude/skills/tensorrt-llm/SKILL.md
Use this skill when the user asks to save, remember, recall, or organize memories. Triggers on: 'remember this', 'save t...
CLI tool for configuring and monitoring Claude Code
指导Claude按照二哥的风格撰写求职类文章,包括公司薪资爆料、年终奖盘点、求职攻略、offer选择建议等内容。