llama-stack-mirror/llama_stack/providers/utils/inference
2024-10-23 11:44:04 -07:00
..
__init__.py Use inference APIs for executing Llama Guard (#121) 2024-09-28 15:40:06 -07:00
model_registry.py Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
openai_compat.py dont set num_predict for all providers (#294) 2024-10-23 11:44:04 -07:00
prompt_adapter.py Add support for Structured Output / Guided decoding (#281) 2024-10-22 12:53:34 -07:00