llama-stack-mirror/llama_stack
Dinesh Yeduguru fdff24e77a
Inference to use provider resource id to register and validate (#428)
This PR changes the way model id gets translated to the final model name
that gets passed through the provider.
Major changes include:
1) Providers are responsible for registering an object and as part of
the registration returning the object with the correct provider specific
name of the model provider_resource_id
2) To help with the common look ups different names a new ModelLookup
class is created.



Tested all inference providers including together, fireworks, vllm,
ollama, meta reference and bedrock
2024-11-12 20:02:00 -08:00
..
apis Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
cli Rename all inline providers with an inline:: prefix (#423) 2024-11-11 22:19:16 -08:00
distribution Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
providers Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
scripts Add a test for CLI, but not fully done so disabled 2024-09-19 13:27:07 -07:00
templates Update provider types and prefix with inline:: 2024-11-12 12:54:44 -08:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00