llama-stack/llama_stack/providers/remote/inference
Dinesh Yeduguru fdff24e77a
Inference to use provider resource id to register and validate (#428)
This PR changes the way model id gets translated to the final model name
that gets passed through the provider.
Major changes include:
1) Providers are responsible for registering an object and as part of
the registration returning the object with the correct provider specific
name of the model provider_resource_id
2) To help with the common look ups different names a new ModelLookup
class is created.



Tested all inference providers including together, fireworks, vllm,
ollama, meta reference and bedrock
2024-11-12 20:02:00 -08:00
..
bedrock Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
databricks Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
fireworks Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
ollama Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
sample migrate model to Resource and new registration signature (#410) 2024-11-08 16:12:57 -08:00
tgi migrate model to Resource and new registration signature (#410) 2024-11-08 16:12:57 -08:00
together Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
vllm Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
__init__.py impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00