forked from phoenix-oss/llama-stack-mirror
This PR changes the way model id gets translated to the final model name that gets passed through the provider. Major changes include: 1) Providers are responsible for registering an object and as part of the registration returning the object with the correct provider specific name of the model provider_resource_id 2) To help with the common look ups different names a new ModelLookup class is created. Tested all inference providers including together, fireworks, vllm, ollama, meta reference and bedrock |
||
|---|---|---|
| .. | ||
| quantization | ||
| __init__.py | ||
| config.py | ||
| generation.py | ||
| inference.py | ||
| model_parallel.py | ||
| parallel_utils.py | ||