forked from phoenix-oss/llama-stack-mirror
# What does this PR do? Generate distro reports to cover inference, agents, and vector_io. ## Test Plan Report generated through `/opt/miniconda3/envs/stack/bin/pytest -s -v tests/client-sdk/ --report` ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.
2 KiB
2 KiB
Report for ollama distribution
Supported Models:
Model Descriptor | ollama |
---|---|
Llama-3-8B-Instruct | ❌ |
Llama-3-70B-Instruct | ❌ |
Llama3.1-8B-Instruct | ✅ |
Llama3.1-70B-Instruct | ✅ |
Llama3.1-405B-Instruct | ✅ |
Llama3.2-1B-Instruct | ✅ |
Llama3.2-3B-Instruct | ✅ |
Llama3.2-11B-Vision-Instruct | ✅ |
Llama3.2-90B-Vision-Instruct | ✅ |
Llama3.3-70B-Instruct | ✅ |
Llama-Guard-3-11B-Vision | ❌ |
Llama-Guard-3-1B | ✅ |
Llama-Guard-3-8B | ✅ |
Llama-Guard-2-8B | ❌ |
Inference:
Model | API | Capability | Test | Status |
---|---|---|---|---|
Llama-3.1-8B-Instruct | /chat_completion | streaming | test_text_chat_completion_streaming | ✅ |
Llama-3.2-11B-Vision-Instruct | /chat_completion | streaming | test_image_chat_completion_streaming | ❌ |
Llama-3.2-11B-Vision-Instruct | /chat_completion | non_streaming | test_image_chat_completion_non_streaming | ❌ |
Llama-3.1-8B-Instruct | /chat_completion | non_streaming | test_text_chat_completion_non_streaming | ✅ |
Llama-3.1-8B-Instruct | /chat_completion | tool_calling | test_text_chat_completion_with_tool_calling_and_streaming | ✅ |
Llama-3.1-8B-Instruct | /chat_completion | tool_calling | test_text_chat_completion_with_tool_calling_and_non_streaming | ✅ |
Llama-3.1-8B-Instruct | /completion | streaming | test_text_completion_streaming | ✅ |
Llama-3.1-8B-Instruct | /completion | non_streaming | test_text_completion_non_streaming | ✅ |
Llama-3.1-8B-Instruct | /completion | structured_output | test_text_completion_structured_output | ✅ |
Vector_io:
API | Capability | Test | Status |
---|---|---|---|
/retrieve | test_vector_db_retrieve | ✅ |
Agents:
API | Capability | Test | Status |
---|---|---|---|
/create_agent_turn | rag | test_rag_agent | ✅ |
/create_agent_turn | custom_tool | test_custom_tool | ✅ |
/create_agent_turn | code_execution | test_code_interpreter_for_attachments | ✅ |