llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-17 02:32:37 +00:00

Author	SHA1	Message	Date
Martin Hickey	73e33fb747	Fix conda env names in distribution example run template The Self-Hosted Distribution documentation contain steps to start llama server via conda environment. If the user generates conda environment using llaama build command and template, it generate an environment with name of the distribution and not defaulyt name of local as per the example run yaml file. It will thereore fail when user tried to run the server. This PR fixes the conda env name in the run yaml file. Signed-off-by: Martin Hickey <martin.hickey@ie.ibm.com>	2024-11-15 15:32:52 +00:00
Ashwin Bharambe	3d7561e55c	Rename all inline providers with an inline:: prefix (#423 )	2024-11-11 22:19:16 -08:00
Ashwin Bharambe	c1f7ba3aed	Split safety into (llama-guard, prompt-guard, code-scanner) (#400 ) Splits the meta-reference safety implementation into three distinct providers: - inline::llama-guard - inline::prompt-guard - inline::code-scanner Note that this PR is a backward incompatible change to the llama stack server. I have added deprecation_error field to ProviderSpec -- the server reads it and immediately barfs. This is used to direct the user with a specific message on what action to perform. An automagical "config upgrade" is a bit too much work to implement right now :/ (Note that we will be gradually prefixing all inline providers with inline:: -- I am only doing this for this set of new providers because otherwise existing configuration files will break even more badly.)	2024-11-11 09:29:18 -08:00
Ashwin Bharambe	161aef0aae	Small updates to quantization config	2024-10-24 12:08:56 -07:00
Ashwin Bharambe	05a8d47b98	Add a meta-reference-quantized-gpu distribution	2024-10-23 21:45:50 -07:00

5 commits