llama-stack-mirror/llama_stack/templates/remote-vllm/build.yaml at 4971113f923597a39738c66f9b2e578d975089cd - phoenix-oss/llama-stack-mirror - Git for basel.kvant.cloud

phoenix-oss/llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-15 14:43:48 +00:00

Ashwin Bharambe 4971113f92 Update provider_type -> inline::llama-guard in templates, update run.yaml

2024-11-11 09:28:07 -08:00

12 lines

317 B

YAML

Raw Blame History

 name: remote-vllm
 distribution_spec:
   description: Use (an external) vLLM server for running LLM inference
   providers:
     inference: remote::vllm
     memory:
     - meta-reference
     - remote::chromadb
     - remote::pgvector
     safety: inline::llama-guard
     agents: meta-reference
     telemetry: meta-reference