forked from phoenix-oss/llama-stack-mirror

# What does this PR do?

Generate distro reports to cover inference, agents, and vector_io. 


## Test Plan

Report generated through `/opt/miniconda3/envs/stack/bin/pytest -s -v
tests/client-sdk/ --report`


## Sources

Please link relevant resources if necessary.


## Before submitting

- [ ] This PR fixes a typo or improves the docs (you can dismiss the
other checks if that's the case).
- [ ] Ran pre-commit to handle lint / formatting issues.
- [ ] Read the [contributor
guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md),
      Pull Request section?
- [ ] Updated relevant documentation.
- [ ] Wrote necessary unit or integration tests.

2025-01-22 19:20:49 -08:00

2 KiB

Raw Blame History

Report for ollama distribution

Supported Models:

Model Descriptor	ollama
Llama-3-8B-Instruct	❌
Llama-3-70B-Instruct	❌
Llama3.1-8B-Instruct	✅
Llama3.1-70B-Instruct	✅
Llama3.1-405B-Instruct	✅
Llama3.2-1B-Instruct	✅
Llama3.2-3B-Instruct	✅
Llama3.2-11B-Vision-Instruct	✅
Llama3.2-90B-Vision-Instruct	✅
Llama3.3-70B-Instruct	✅
Llama-Guard-3-11B-Vision	❌
Llama-Guard-3-1B	✅
Llama-Guard-3-8B	✅
Llama-Guard-2-8B	❌

Inference:

Model	API	Capability	Test	Status
Llama-3.1-8B-Instruct	/chat_completion	streaming	test_text_chat_completion_streaming	✅
Llama-3.2-11B-Vision-Instruct	/chat_completion	streaming	test_image_chat_completion_streaming	❌
Llama-3.2-11B-Vision-Instruct	/chat_completion	non_streaming	test_image_chat_completion_non_streaming	❌
Llama-3.1-8B-Instruct	/chat_completion	non_streaming	test_text_chat_completion_non_streaming	✅
Llama-3.1-8B-Instruct	/chat_completion	tool_calling	test_text_chat_completion_with_tool_calling_and_streaming	✅
Llama-3.1-8B-Instruct	/chat_completion	tool_calling	test_text_chat_completion_with_tool_calling_and_non_streaming	✅
Llama-3.1-8B-Instruct	/completion	streaming	test_text_completion_streaming	✅
Llama-3.1-8B-Instruct	/completion	non_streaming	test_text_completion_non_streaming	✅
Llama-3.1-8B-Instruct	/completion	structured_output	test_text_completion_structured_output	✅

Vector_io:

API	Capability	Test	Status
/retrieve		test_vector_db_retrieve	✅

Agents:

API	Capability	Test	Status
/create_agent_turn	rag	test_rag_agent	✅
/create_agent_turn	custom_tool	test_custom_tool	✅
/create_agent_turn	code_execution	test_code_interpreter_for_attachments	✅

2 KiB Raw Blame History

Report for ollama distribution

Supported Models:

Inference:

Vector_io:

Agents:

2 KiB

Raw Blame History