llama-stack-mirror/llama_stack/templates/hf-endpoint/build.yaml at 4971113f923597a39738c66f9b2e578d975089cd - phoenix-oss/llama-stack-mirror - Git for basel.kvant.cloud

phoenix-oss/llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-15 14:43:48 +00:00

Ashwin Bharambe 4971113f92 Update provider_type -> inline::llama-guard in templates, update run.yaml

2024-11-11 09:28:07 -08:00

9 lines

337 B

YAML

Raw Blame History

 name: hf-endpoint
 distribution_spec:
   description: "Like local, but use Hugging Face Inference Endpoints for running LLM inference.\nSee https://hf.co/docs/api-endpoints."
   providers:
     inference: remote::hf::endpoint
     memory: inline::faiss
     safety: inline::llama-guard
     agents: meta-reference
     telemetry: meta-reference