forked from phoenix-oss/llama-stack-mirror
# What does this PR do? Create a distribution template using Groq as inference provider. Link to issue: https://github.com/meta-llama/llama-stack/issues/958 ## Test Plan Run `python llama_stack/scripts/distro_codegen.py` to generate run.yaml and build.yaml Test the newly created template by running `llama stack build --template <template-name>` `llama stack run <template-name>` |
||
---|---|---|
.. | ||
bedrock | ||
cerebras | ||
dell-tgi | ||
fireworks | ||
meta-reference-gpu | ||
meta-reference-quantized-gpu | ||
ollama | ||
remote-nvidia | ||
remote-vllm | ||
runpod | ||
sambanova | ||
tgi | ||
together | ||
vllm-gpu | ||
dependencies.json |