llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

Matthew Farrellee 62e0aef7bc fix: return llama stack model id from embeddings (#3525 ) # What does this PR do? the openai_embeddings method on OpenAIMixin was returning the provider's model id instead of the llama stack name ## Test Plan before - ``` $ ./scripts/integration-tests.sh --stack-config server:ci-tests --setup gpt --subdirs inference --inference-mode live --pattern test_openai_embeddings_single_string ... FAILED tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_single_string[openai_client-emb=openai/text-embedding-3-small] - AssertionError: assert 'text-embedding-3-small' == 'openai/text-...dding-3-small' FAILED tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_single_string[llama_stack_client-emb=openai/text-embedding-3-small] - AssertionError: assert 'text-embedding-3-small' == 'openai/text-...dding-3-small' ========================================== 2 failed, 95 deselected, 4 warnings in 3.87s =========================================== ``` after - ``` $ ./scripts/integration-tests.sh --stack-config server:ci-tests --setup gpt --subdirs inference --inference-mode live --pattern test_openai_embeddings_single_string ... ========================================== 2 passed, 95 deselected, 4 warnings in 2.12s =========================================== ```		2025-09-23 12:30:00 -04:00
..
__init__.py	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
embedding_mixin.py	fix: Make SentenceTransformer embedding operations non-blocking (#3335 )	2025-09-04 13:58:41 -04:00
inference_store.py	chore: simplify authorized sqlstore (#3496 )	2025-09-19 16:13:56 -07:00
litellm_openai_mixin.py	chore: indicate to mypy that InferenceProvider.batch_completion/batch_chat_completion is concrete (#3239 )	2025-08-22 14:17:30 -07:00
model_registry.py	fix: change ModelRegistryHelper to use ProviderModelEntry instead of hardcoded ModelType.llm (#3451 )	2025-09-22 12:55:32 -04:00
openai_compat.py	refactor(logging): rename llama_stack logger categories (#3065 )	2025-08-21 17:31:04 -07:00
openai_mixin.py	fix: return llama stack model id from embeddings (#3525 )	2025-09-23 12:30:00 -04:00
prompt_adapter.py	refactor(logging): rename llama_stack logger categories (#3065 )	2025-08-21 17:31:04 -07:00