llama-stack

505 commits 21 branches 64 tags 62 MiB

Author	SHA1	Message	Date
Dinesh Yeduguru	fdff24e77a	Inference to use provider resource id to register and validate (#428 ) This PR changes the way model id gets translated to the final model name that gets passed through the provider. Major changes include: 1) Providers are responsible for registering an object and as part of the registration returning the object with the correct provider specific name of the model provider_resource_id 2) To help with the common look ups different names a new ModelLookup class is created. Tested all inference providers including together, fireworks, vllm, ollama, meta reference and bedrock	2024-11-12 20:02:00 -08:00
Xi Yan	ba82021d4b	precommit	2024-11-08 17:58:58 -08:00
Ashwin Bharambe	694c142b89	Add provider deprecation support; change directory structure (#397 ) * Add provider deprecation support; change directory structure * fix a couple dangling imports * move the meta_reference safety dir also	2024-11-07 13:04:53 -08:00

Author

SHA1

Message

Date

Dinesh Yeduguru

fdff24e77a

Inference to use provider resource id to register and validate (#428 )

This PR changes the way model id gets translated to the final model name
that gets passed through the provider.
Major changes include:
1) Providers are responsible for registering an object and as part of
the registration returning the object with the correct provider specific
name of the model provider_resource_id
2) To help with the common look ups different names a new ModelLookup
class is created.



Tested all inference providers including together, fireworks, vllm,
ollama, meta reference and bedrock

2024-11-12 20:02:00 -08:00

Xi Yan

ba82021d4b

precommit

2024-11-08 17:58:58 -08:00

Ashwin Bharambe

694c142b89

Add provider deprecation support; change directory structure (#397 )

* Add provider deprecation support; change directory structure

* fix a couple dangling imports

* move the meta_reference safety dir also

2024-11-07 13:04:53 -08:00

Renamed from llama_stack/providers/inline/meta_reference/inference/generation.py (Browse further)

3 commits