llama-stack-mirror/llama_stack/templates/ramalama/report.md
Daniel J Walsh c9a41288a3 feat: RamaLama Documentation and Templates
RamaLama is a fully Open Source AI Model tool that facilitate
local management of AI Models.

https://github.com/containers/ramalama

It is fully open source and supports pulling models from HuggingFace,
Ollama, OCI Images, and via URI file://, http://, https://

It uses the llama.cpp and vllm AI engines for running the MODELS.

It also defaults to running the models inside of containers.

Signed-off-by: Charlie Doern <cdoern@redhat.com>
2025-04-18 12:55:52 -04:00

2 KiB

Report for ramalama distribution

Supported Models

Model Descriptor ramalama
Llama-3-8B-Instruct
Llama-3-70B-Instruct
Llama3.1-8B-Instruct
Llama3.1-70B-Instruct
Llama3.1-405B-Instruct
Llama3.2-1B-Instruct
Llama3.2-3B-Instruct
Llama3.2-11B-Vision-Instruct
Llama3.2-90B-Vision-Instruct
Llama3.3-70B-Instruct
Llama-Guard-3-11B-Vision
Llama-Guard-3-1B
Llama-Guard-3-8B
Llama-Guard-2-8B

Inference

Model API Capability Test Status
Llama-3.1-8B-Instruct /chat_completion streaming test_text_chat_completion_streaming
Llama-3.2-11B-Vision-Instruct /chat_completion streaming test_image_chat_completion_streaming
Llama-3.2-11B-Vision-Instruct /chat_completion non_streaming test_image_chat_completion_non_streaming
Llama-3.1-8B-Instruct /chat_completion non_streaming test_text_chat_completion_non_streaming
Llama-3.1-8B-Instruct /chat_completion tool_calling test_text_chat_completion_with_tool_calling_and_streaming
Llama-3.1-8B-Instruct /chat_completion tool_calling test_text_chat_completion_with_tool_calling_and_non_streaming
Llama-3.1-8B-Instruct /completion streaming test_text_completion_streaming
Llama-3.1-8B-Instruct /completion non_streaming test_text_completion_non_streaming
Llama-3.1-8B-Instruct /completion structured_output test_text_completion_structured_output

Vector IO

API Capability Test Status
/retrieve test_vector_db_retrieve

Agents

API Capability Test Status
/create_agent_turn rag test_rag_agent
/create_agent_turn custom_tool test_custom_tool
/create_agent_turn code_execution test_code_interpreter_for_attachments