llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Dinesh Yeduguru fdff24e77a Inference to use provider resource id to register and validate (#428 ) This PR changes the way model id gets translated to the final model name that gets passed through the provider. Major changes include: 1) Providers are responsible for registering an object and as part of the registration returning the object with the correct provider specific name of the model provider_resource_id 2) To help with the common look ups different names a new ModelLookup class is created. Tested all inference providers including together, fireworks, vllm, ollama, meta reference and bedrock		2024-11-12 20:02:00 -08:00
..
bedrock	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
databricks	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
fireworks	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
ollama	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
sample	migrate model to Resource and new registration signature (#410 )	2024-11-08 16:12:57 -08:00
tgi	migrate model to Resource and new registration signature (#410 )	2024-11-08 16:12:57 -08:00
together	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
vllm	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00