llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Xi Yan 7a4383e4c1 add 3.3 to together inference provider (#729 ) # What does this PR do? - add llama3.3 model for together - fix fireworks distro_codegen ``` python llama_stack/scripts/distro_codegen.py ``` ## Test Plan <img width="1132" alt="image" src="https://github.com/user-attachments/assets/bf94b933-9200-4e73-878e-d1a95d450a88" /> Tests ``` pytest -v -s -k "together" --inference-model="meta-llama/Llama-3.3-70B-Instruct" ./llama_stack/providers/tests/inference/test_text_inference.py ``` <img width="1139" alt="image" src="https://github.com/user-attachments/assets/407dc98b-8de3-4841-8cb1-75e4b5128544" /> ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.		2025-01-06 15:39:41 -08:00
..
bedrock	[remove import ] clean up import 's (#689 )	2024-12-27 15:45:44 -08:00
cerebras	Redact sensitive information from configs when printing, etc.	2025-01-02 13:54:02 -08:00
databricks	[remove import ] clean up import 's (#689 )	2024-12-27 15:45:44 -08:00
fireworks	add 3.3 to together inference provider (#729 )	2025-01-06 15:39:41 -08:00
groq	Fix Groq invalid self.config reference (#719 )	2025-01-03 15:47:10 -08:00
nvidia	Redact sensitive information from configs when printing, etc.	2025-01-02 13:54:02 -08:00
ollama	Add JSON structured outputs to Ollama Provider (#680 )	2025-01-02 09:05:51 -08:00
sample	[remove import ] clean up import 's (#689 )	2024-12-27 15:45:44 -08:00
tgi	Redact sensitive information from configs when printing, etc.	2025-01-02 13:54:02 -08:00
together	add 3.3 to together inference provider (#729 )	2025-01-06 15:39:41 -08:00
vllm	Fix assert message and call to completion_request_to_prompt in remote:vllm (#709 )	2025-01-03 13:44:49 -08:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00