llama-stack-mirror/llama_stack/templates/vllm-gpu
2025-01-16 15:09:20 -08:00
..
__init__.py Update more distribution docs to be simpler and partially codegen'ed 2024-11-20 22:03:44 -08:00
build.yaml template update 2025-01-16 15:09:20 -08:00
run.yaml template update 2025-01-16 15:09:20 -08:00
vllm.py rename LLAMASTACK_PORT to LLAMA_STACK_PORT for consistency with other env vars (#744) 2025-01-10 11:09:49 -08:00