Commit graph

5 commits

Author SHA1 Message Date
Martin Hickey
73e33fb747 Fix conda env names in distribution example run template
The Self-Hosted Distribution documentation contain steps
to start llama server via conda environment. If the user
generates conda environment using llaama build command and template,
it generate an environment with name of the distribution and not
defaulyt name of local as per the example run yaml file. It
will thereore fail when user tried to run the server.

This PR fixes the conda env name in the run yaml file.

Signed-off-by: Martin Hickey <martin.hickey@ie.ibm.com>
2024-11-15 15:32:52 +00:00
Ashwin Bharambe
3d7561e55c
Rename all inline providers with an inline:: prefix (#423) 2024-11-11 22:19:16 -08:00
Ashwin Bharambe
c1f7ba3aed
Split safety into (llama-guard, prompt-guard, code-scanner) (#400)
Splits the meta-reference safety implementation into three distinct providers:

- inline::llama-guard
- inline::prompt-guard
- inline::code-scanner

Note that this PR is a backward incompatible change to the llama stack server. I have added deprecation_error field to ProviderSpec -- the server reads it and immediately barfs. This is used to direct the user with a specific message on what action to perform. An automagical "config upgrade" is a bit too much work to implement right now :/

(Note that we will be gradually prefixing all inline providers with inline:: -- I am only doing this for this set of new providers because otherwise existing configuration files will break even more badly.)
2024-11-11 09:29:18 -08:00
Ashwin Bharambe
161aef0aae Small updates to quantization config 2024-10-24 12:08:56 -07:00
Ashwin Bharambe
05a8d47b98 Add a meta-reference-quantized-gpu distribution 2024-10-23 21:45:50 -07:00