llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-06-27 10:46:41 +00:00

History

Wen Zhou 8c3f2762fb build: update temp. created Containerfile (#2492 ) # What does this PR do? <!-- Provide a short summary of what this PR does and why. Link to relevant issues if applicable. --> - conditionally created folder /.llama/providers.d if external_providers_dir is set - do not create /.cache folder, not in use anywhere - combine chmod and copy to one command <!-- If resolving an issue, uncomment and update the line below --> <!-- Closes #[issue-number] --> ## Test Plan <!-- Describe the tests you ran to verify your changes with result summaries. Provide clear instructions so the plan can be easily re-executed. --> updated test: ``` export CONTAINER_BINARY=podman LLAMA_STACK_DIR=. uv run llama stack build --template remote-vllm --image-type container --image-name <name> ``` log: ``` Containerfile created successfully in /tmp/tmp.rPMunE39Aw/Containerfile FROM python:3.11-slim WORKDIR /app RUN apt-get update && apt-get install -y iputils-ping net-tools iproute2 dnsutils telnet curl wget telnet git procps psmisc lsof traceroute bubblewrap gcc && rm -rf /var/lib/apt/lists/* ENV UV_SYSTEM_PYTHON=1 RUN pip install uv RUN uv pip install --no-cache sentencepiece pillow pypdf transformers pythainlp faiss-cpu opentelemetry-sdk requests datasets chardet scipy nltk numpy matplotlib psycopg2-binary aiosqlite langdetect autoevals tree_sitter tqdm pandas chromadb-client opentelemetry-exporter-otlp-proto-http redis scikit-learn openai pymongo emoji sqlalchemy[asyncio] mcp aiosqlite fastapi fire httpx uvicorn opentelemetry-sdk opentelemetry-exporter-otlp-proto-http RUN uv pip install --no-cache sentence-transformers --no-deps RUN uv pip install --no-cache torch torchvision --index-url https://download.pytorch.org/whl/cpu # Allows running as non-root user RUN mkdir -p /.llama/providers.d /.cache RUN uv pip install --no-cache llama-stack RUN pip uninstall -y uv ENTRYPOINT ["python", "-m", "llama_stack.distribution.server.server", "--template", "remote-vllm"] RUN chmod -R g+rw /app /.llama /.cache PWD: /tmp/llama-stack Containerfile: /tmp/tmp.rPMunE39Aw/Containerfile + podman build --progress=plain --security-opt label=disable --platform linux/amd64 -t distribution-remote-vllm:0.2.12 -f /tmp/tmp.rPMunE39Aw/Containerfile /tmp/llama-stack .... Success! Build Successful! You can find the newly-built template here: /tmp/llama-stack/llama_stack/templates/remote-vllm/run.yaml You can run the new Llama Stack distro via: llama stack run /tmp/llama-stack/llama_stack/templates/remote-vllm/run.yaml --image-type container ``` ``` podman tag localhost/distribution-remote-vllm:dev quay.io/wenzhou/distribution-remote-vllm:2492_2 podman push quay.io/wenzhou/distribution-remote-vllm:2492_2 docker run --rm -p 8321:8321 -e INFERENCE_MODEL="meta-llama/Llama-2-7b-chat-hf" -e VLLM_URL="http://localhost:8000/v1" quay.io/wenzhou/distribution-remote-vllm:2492_2 --port 8321 INFO 2025-06-26 13:47:31,813 __main__:436 server: Using template remote-vllm config file: /app/llama-stack-source/llama_stack/templates/remote-vllm/run.yaml INFO 2025-06-26 13:47:31,818 __main__:438 server: Run configuration: INFO 2025-06-26 13:47:31,826 __main__:440 server: apis: - agents - datasetio - eval - inference - safety - scoring - telemetry - tool_runtime - vector_io benchmarks: [] container_image: null .... ``` ----- previous test: local run` >llama stack build --template remote-vllm --image-type container` image stored in `quay.io/wenzhou/distribution-remote-vllm:2492` --------- Signed-off-by: Wen Zhou <wenzhou@redhat.com>		2025-06-27 10:23:12 +02:00
..
access_control	fix: finish conversion to StrEnum (#2514 )	2025-06-26 08:01:26 +05:30
routers	chore: remove nested imports (#2515 )	2025-06-26 08:01:05 +05:30
routing_tables	feat: fine grained access control policy (#2264 )	2025-06-03 14:51:12 -07:00
server	feat: drop python 3.10 support (#2469 )	2025-06-19 12:07:14 +05:30
store	fix(tools): do not index tools, only index toolgroups (#2261 )	2025-05-25 13:27:52 -07:00
ui	chore: remove nested imports (#2515 )	2025-06-26 08:01:05 +05:30
utils	refactor: remove container from list of run image types (#2178 )	2025-06-02 09:57:55 +02:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
build.py	chore: bump python supported version to 3.12 (#2475 )	2025-06-24 09:22:04 +05:30
build_conda_env.sh	chore: fix build script bug (#2507 )	2025-06-24 12:05:22 -07:00
build_container.sh	build: update temp. created Containerfile (#2492 )	2025-06-27 10:23:12 +02:00
build_venv.sh	chore: remove straggler references to llama-models (#1345 )	2025-03-01 14:26:03 -08:00
client.py	chore: make cprint write to stderr (#2250 )	2025-05-24 23:39:57 -07:00
common.sh	feat(pre-commit): enhance pre-commit hooks with additional checks (#2014 )	2025-04-30 11:35:49 -07:00
configure.py	feat: refactor external providers dir (#2049 )	2025-05-15 20:17:03 +02:00
datatypes.py	fix: Some missed env variable changes from PR 2490 (#2538 )	2025-06-26 17:59:15 -07:00
distribution.py	ci: fix external provider test (#2438 )	2025-06-12 16:14:32 +02:00
inspect.py	chore: use starlette built-in Route class (#2267 )	2025-05-28 09:53:33 -07:00
library_client.py	refactor: unify stream and non-stream impls for responses (#2388 )	2025-06-05 17:48:09 +02:00
providers.py	fix: Some missed env variable changes from PR 2490 (#2538 )	2025-06-26 17:59:15 -07:00
request_headers.py	feat: fine grained access control policy (#2264 )	2025-06-03 14:51:12 -07:00
resolver.py	fix: Some missed env variable changes from PR 2490 (#2538 )	2025-06-26 17:59:15 -07:00
stack.py	refactor(env)!: enhanced environment variable substitution (#2490 )	2025-06-26 08:20:08 +05:30
start_stack.sh	refactor: remove container from list of run image types (#2178 )	2025-06-02 09:57:55 +02:00