llama-stack/llama_stack/providers/remote/inference
Henry Tu 0e2a99e223
Update Cerebras from Llama 3.1 to 3.3 (#645)
# What does this PR do?

Cerebras is rolling out support for llama 3.3 70b and deprecating llama
3.1 70b. This PR updates the documentation, config, and internal mapping
to reflect this change.

cc: @ashwinb @raghotham
2024-12-17 16:28:24 -08:00
..
bedrock Update the "InterleavedTextMedia" type (#635) 2024-12-17 11:18:31 -08:00
cerebras Update Cerebras from Llama 3.1 to 3.3 (#645) 2024-12-17 16:28:24 -08:00
databricks Update the "InterleavedTextMedia" type (#635) 2024-12-17 11:18:31 -08:00
fireworks Fix conversion to RawMessage everywhere 2024-12-17 14:00:43 -08:00
nvidia Update the "InterleavedTextMedia" type (#635) 2024-12-17 11:18:31 -08:00
ollama Fix conversion to RawMessage everywhere 2024-12-17 14:00:43 -08:00
sample migrate model to Resource and new registration signature (#410) 2024-11-08 16:12:57 -08:00
tgi Fix conversion to RawMessage everywhere 2024-12-17 14:00:43 -08:00
together Fix conversion to RawMessage everywhere 2024-12-17 14:00:43 -08:00
vllm Fix conversion to RawMessage everywhere 2024-12-17 14:00:43 -08:00
__init__.py impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00