llama-stack/llama_stack/providers/inline
Dinesh Yeduguru fdff24e77a
Inference to use provider resource id to register and validate (#428)
This PR changes the way model id gets translated to the final model name
that gets passed through the provider.
Major changes include:
1) Providers are responsible for registering an object and as part of
the registration returning the object with the correct provider specific
name of the model provider_resource_id
2) To help with the common look ups different names a new ModelLookup
class is created.



Tested all inference providers including together, fireworks, vllm,
ollama, meta reference and bedrock
2024-11-12 20:02:00 -08:00
..
agents Allow specifying resources in StackRunConfig (#425) 2024-11-12 10:58:49 -08:00
datasetio/localfs migrate dataset to resource (#420) 2024-11-11 17:14:41 -08:00
eval/meta_reference Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
inference Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
ios/inference impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00
memory versioned persistence key prefixes 2024-11-12 10:30:39 -08:00
meta_reference Folder restructure for evals/datasets/scoring (#419) 2024-11-11 17:35:40 -05:00
safety Remove the "ShieldType" concept (#430) 2024-11-12 12:37:24 -08:00
scoring Inference to use provider resource id to register and validate (#428) 2024-11-12 20:02:00 -08:00
__init__.py impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00