llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-05 04:17:32 +00:00

History

ehhuang ee5e9b935a feat: better using get_default_tool_prompt_format (#1360 ) Summary: https://github.com/meta-llama/llama-stack/pull/1214 introduced `get_default_tool_prompt_format` but tried to use it on the raw identifier. Here we move calling this func later in the stack and rely on the inference provider to resolve the raw identifier into llama model, then call get_default_tool_prompt_format. Test Plan: ``` LLAMA_STACK_CONFIG=ollama pytest -s -v tests/client-sdk/inference/test_text_inference.py::test_text_chat_completion_with_tool_calling_and_non_streaming --inference-model=llama3.2:3b-instruct-fp16 --vision-inference-model="" ``` Before: <img width="1288" alt="image" src="https://github.com/user-attachments/assets/918c7839-1f45-4540-864e-4b842cc367df" /> After: <img width="1522" alt="image" src="https://github.com/user-attachments/assets/447d78af-b3b9-4837-8cb7-6ac549005efe" />		2025-03-03 14:50:06 -08:00
..
__init__.py	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
embedding_mixin.py	fix: dont assume SentenceTransformer is imported	2025-02-25 16:53:01 -08:00
litellm_openai_mixin.py	feat: add a configurable category-based logger (#1352 )	2025-03-02 18:51:14 -08:00
model_registry.py	feat(providers): support non-llama models for inference providers (#1200 )	2025-02-21 13:21:28 -08:00
openai_compat.py	chore(lint): update Ruff ignores for project conventions and maintainability (#1184 )	2025-02-28 09:36:49 -08:00
prompt_adapter.py	feat: better using get_default_tool_prompt_format (#1360 )	2025-03-03 14:50:06 -08:00