llama-stack-mirror/llama_stack/core/routers
Matthew Farrellee f754e1b65b chore: remove deprecated inference.chat_completion implementations
vllm -
 - requires max_tokens be set, use config value
 - set tool_choice to none if no tools provided
2025-10-02 10:39:30 -04:00
..
__init__.py chore: introduce write queue for inference_store (#3383) 2025-09-10 11:57:42 -07:00
datasets.py refactor(logging): rename llama_stack logger categories (#3065) 2025-08-21 17:31:04 -07:00
eval_scoring.py refactor(logging): rename llama_stack logger categories (#3065) 2025-08-21 17:31:04 -07:00
inference.py chore: remove deprecated inference.chat_completion implementations 2025-10-02 10:39:30 -04:00
safety.py refactor(logging): rename llama_stack logger categories (#3065) 2025-08-21 17:31:04 -07:00
tool_runtime.py refactor(logging): rename llama_stack logger categories (#3065) 2025-08-21 17:31:04 -07:00
vector_io.py feat(api): Add Vector Store File batches api stub (#3615) 2025-09-30 12:07:33 -07:00