llama-stack-mirror/llama_stack/templates/cerebras/report.md
2025-05-01 14:25:07 -07:00

2.1 KiB

Report for cerebras distribution

Supported Models

Model Descriptor cerebras
meta-llama/Llama-3-8B-Instruct
meta-llama/Llama-3-70B-Instruct
meta-llama/Llama-3.1-8B-Instruct
meta-llama/Llama-3.1-70B-Instruct
meta-llama/Llama-3.1-405B-Instruct-FP8
meta-llama/Llama-3.2-1B-Instruct
meta-llama/Llama-3.2-3B-Instruct
meta-llama/Llama-3.2-11B-Vision-Instruct
meta-llama/Llama-3.2-90B-Vision-Instruct
meta-llama/Llama-3.3-70B-Instruct
meta-llama/Llama-Guard-3-11B-Vision
meta-llama/Llama-Guard-3-1B
meta-llama/Llama-Guard-3-8B
meta-llama/Llama-Guard-2-8B

Inference

Model API Capability Test Status
Llama-3.1-8B-Instruct /chat_completion streaming test_text_chat_completion_streaming
Llama-3.2-11B-Vision-Instruct /chat_completion streaming test_image_chat_completion_streaming
Llama-3.2-11B-Vision-Instruct /chat_completion non_streaming test_image_chat_completion_non_streaming
Llama-3.1-8B-Instruct /chat_completion non_streaming test_text_chat_completion_non_streaming
Llama-3.1-8B-Instruct /chat_completion tool_calling test_text_chat_completion_with_tool_calling_and_streaming
Llama-3.1-8B-Instruct /chat_completion tool_calling test_text_chat_completion_with_tool_calling_and_non_streaming
Llama-3.1-8B-Instruct /completion streaming test_text_completion_streaming
Llama-3.1-8B-Instruct /completion non_streaming test_text_completion_non_streaming
Llama-3.1-8B-Instruct /completion structured_output test_text_completion_structured_output

Vector IO

API Capability Test Status
/retrieve test_vector_db_retrieve

Agents

API Capability Test Status
/create_agent_turn rag test_rag_agent
/create_agent_turn custom_tool test_custom_tool