llama-stack-mirror/llama_stack/providers/utils/inference
2025-09-15 12:32:34 -07:00
..
__init__.py chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
embedding_mixin.py fix: Make SentenceTransformer embedding operations non-blocking (#3335) 2025-09-04 13:58:41 -04:00
inference_store.py fix(inference_store): on duplicate chat completion IDs, replace (#3408) 2025-09-10 14:34:18 -07:00
litellm_openai_mixin.py chore: indicate to mypy that InferenceProvider.batch_completion/batch_chat_completion is concrete (#3239) 2025-08-22 14:17:30 -07:00
model_registry.py fix issue 3330 2025-09-15 12:32:34 -07:00
openai_compat.py refactor(logging): rename llama_stack logger categories (#3065) 2025-08-21 17:31:04 -07:00
openai_mixin.py chore: update the vLLM inference impl to use OpenAIMixin for openai-compat functions (#3404) 2025-09-11 09:04:38 -04:00
prompt_adapter.py refactor(logging): rename llama_stack logger categories (#3065) 2025-08-21 17:31:04 -07:00