forked from phoenix-oss/llama-stack-mirror
# What does this PR do? Fix https://github.com/meta-llama/llama-stack/issues/697 ## Test Plan Run the 405b model. the full `accounts/fireworks/models/<model_id>` is the full model name for Fireworks, the 'fireworks/<model_id>' is just a short hand and sometimes have routing issues ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests. |
||
|---|---|---|
| .. | ||
| bedrock | ||
| cerebras | ||
| databricks | ||
| fireworks | ||
| groq | ||
| nvidia | ||
| ollama | ||
| sample | ||
| tgi | ||
| together | ||
| vllm | ||
| __init__.py | ||