llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-11 19:56:03 +00:00

History

Ashwin Bharambe d089a6d106 fix(inference): enable routing of models with provider_data alone Assume a remote inference provider which works only when users provide their own API keys via provider_data. By definition, we cannot list models and hence update our routing registry. But because we _require_ a provider ID in the models now, we can identify which provider to route to and let that provider decide. Note that we still try to look up our registry since it may have a pre-registered alias. Just that we don't outright fail when we are not able to look it up. Also, updated inference router so that the responses have the _exact_ model that the request had. Added an integration test		2025-10-27 18:58:32 -07:00
..
recordings	feat(api)!: BREAKING CHANGE: support passing `extra_body` through to providers (#3777 )	2025-10-10 16:21:44 -07:00
__init__.py	fix: remove ruff N999 (#1388 )	2025-03-07 11:14:04 -08:00
dog.png	refactor: tests/unittests -> tests/unit; tests/api -> tests/integration	2025-03-04 09:57:00 -08:00
test_openai_completion.py	fix: Fixed WatsonX remote inference provider (#3801 )	2025-10-14 14:52:32 +02:00
test_openai_embeddings.py	fix(inference): enable routing of models with provider_data alone	2025-10-27 18:58:32 -07:00
test_openai_vision_inference.py	feat(internal): add image_url download feature to OpenAIMixin (#3516 )	2025-09-26 17:32:16 -04:00
test_provider_data_routing.py	fix(inference): enable routing of models with provider_data alone	2025-10-27 18:58:32 -07:00
test_tools_with_schemas.py	feat(tools)!: substantial clean up of "Tool" related datatypes (#3627 )	2025-10-02 15:12:03 -07:00
test_vision_inference.py	chore(apis): unpublish deprecated /v1/inference apis (#3297 )	2025-09-27 11:20:06 -07:00
vision_test_1.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00
vision_test_2.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00
vision_test_3.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00