vllm¶ vLLM integration for high-performance inference. vllm ¶ Submodules¶ vllm - Main VLLM wrapper batching - Batching support sampling - Sampling utilities engines - Engine implementations model_runners - Model runners workers - Worker implementations