llama-stack-mirror/llama_stack/providers/utils/inference
Eric Huang a1b356a2a2 impl
# What does this PR do?


## Test Plan
# What does this PR do?


## Test Plan
2025-05-21 22:02:54 -07:00
..
__init__.py chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
embedding_mixin.py chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
inference_store.py impl 2025-05-21 22:02:54 -07:00
litellm_openai_mixin.py feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
model_registry.py chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
openai_compat.py fix: multiple tool calls in remote-vllm chat_completion (#2161) 2025-05-15 11:23:29 -07:00
prompt_adapter.py chore: more mypy fixes (#2029) 2025-05-06 09:52:31 -07:00
stream_utils.py impl 2025-05-21 22:02:54 -07:00