llama-stack-mirror/llama_stack/providers/adapters/inference
2024-10-23 19:11:04 -07:00
..
bedrock Add support for Structured Output / Guided decoding (#281) 2024-10-22 12:53:34 -07:00
databricks refactor get_max_tokens and build_options 2024-10-23 19:11:04 -07:00
fireworks refactor get_max_tokens and build_options 2024-10-23 19:11:04 -07:00
ollama refactor get_max_tokens and build_options 2024-10-23 19:11:04 -07:00
sample Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
tgi refactor get_max_tokens and build_options 2024-10-23 19:11:04 -07:00
together refactor get_max_tokens and build_options 2024-10-23 19:11:04 -07:00
vllm refactor get_max_tokens and build_options 2024-10-23 19:11:04 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00