llama-stack-mirror/llama_stack/templates/vllm/build.yaml
2024-10-25 12:37:15 -07:00

9 lines
240 B
YAML

name: vllm
distribution_spec:
description: Like local, but use vLLM for running LLM inference
providers:
inference: vllm
memory: meta-reference
safety: meta-reference
agents: meta-reference
telemetry: meta-reference