llama-stack-mirror/llama_stack/providers/utils/inference
2024-11-07 12:43:55 -05:00
..
__init__.py Use inference APIs for executing Llama Guard (#121) 2024-09-28 15:40:06 -07:00
model_registry.py Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
openai_compat.py Merge branch 'main' into santiagxf/azure-ai-inference 2024-11-07 12:43:55 -05:00
prompt_adapter.py remote::vllm now works with vision models 2024-11-06 16:07:17 -08:00