llama-stack-mirror/llama_stack/providers/utils/inference
2024-11-08 12:21:02 -08:00
..
__init__.py Use inference APIs for executing Llama Guard (#121) 2024-09-28 15:40:06 -07:00
model_registry.py resource oriented object design for models 2024-11-08 12:21:02 -08:00
openai_compat.py Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376) 2024-11-05 16:22:33 -08:00
prompt_adapter.py remote::vllm now works with vision models 2024-11-06 16:07:17 -08:00