llama-stack

History

Dinesh Yeduguru fdff24e77a Inference to use provider resource id to register and validate (#428 ) This PR changes the way model id gets translated to the final model name that gets passed through the provider. Major changes include: 1) Providers are responsible for registering an object and as part of the registration returning the object with the correct provider specific name of the model provider_resource_id 2) To help with the common look ups different names a new ModelLookup class is created. Tested all inference providers including together, fireworks, vllm, ollama, meta reference and bedrock		2024-11-12 20:02:00 -08:00
..
__init__.py	Remove "routing_table" and "routing_key" concepts for the user (#201 )	2024-10-10 10:24:13 -07:00
conftest.py	Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376 )	2024-11-05 16:22:33 -08:00
fixtures.py	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
pasta.jpeg	Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376 )	2024-11-05 16:22:33 -08:00
test_prompt_adapter.py	Added tests for persistence (#274 )	2024-10-22 19:41:46 -07:00
test_text_inference.py	Inference to use provider resource id to register and validate (#428 )	2024-11-12 20:02:00 -08:00
test_vision_inference.py	remote::vllm now works with vision models	2024-11-06 16:07:17 -08:00
utils.py	Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376 )	2024-11-05 16:22:33 -08:00