llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

Dinesh Yeduguru fdff24e77a Inference to use provider resource id to register and validate (#428 ) This PR changes the way model id gets translated to the final model name that gets passed through the provider. Major changes include: 1) Providers are responsible for registering an object and as part of the registration returning the object with the correct provider specific name of the model provider_resource_id 2) To help with the common look ups different names a new ModelLookup class is created. Tested all inference providers including together, fireworks, vllm, ollama, meta reference and bedrock		2024-11-12 20:02:00 -08:00
..
apis	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
cli	Rename all inline providers with an inline:: prefix (#423 )	2024-11-11 22:19:16 -08:00
distribution	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
providers	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
scripts	Add a test for CLI, but not fully done so disabled	2024-09-19 13:27:07 -07:00
templates	Update provider types and prefix with inline::	2024-11-12 12:54:44 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00