llama-stack-mirror/llama_stack/providers/remote/inference/vllm
2024-11-17 19:49:15 -08:00
..
__init__.py impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00
config.py Allow setting environment variables from llama stack run and fix ollama 2024-11-17 19:49:15 -08:00
vllm.py unregister for memory banks and remove update API (#458) 2024-11-14 17:12:11 -08:00