forked from phoenix-oss/llama-stack-mirror
# What does this PR do? Create a distribution template using Groq as inference provider. Link to issue: https://github.com/meta-llama/llama-stack/issues/958 ## Test Plan Run `python llama_stack/scripts/distro_codegen.py` to generate run.yaml and build.yaml Test the newly created template by running `llama stack build --template <template-name>` `llama stack run <template-name>` |
||
---|---|---|
.. | ||
bedrock.md | ||
cerebras.md | ||
dell-tgi.md | ||
dell.md | ||
fireworks.md | ||
groq.md | ||
meta-reference-gpu.md | ||
meta-reference-quantized-gpu.md | ||
nvidia.md | ||
ollama.md | ||
remote-vllm.md | ||
sambanova.md | ||
tgi.md | ||
together.md |