Guidance for optimizing LLM inference request batching and scheduling problems. This skill applies when designing batch schedulers that minimize cost while meeting latency and padding constraints, inv
letta/benchmarks/trajectory-only/llm-inference-batching-scheduler(main)