llama-stack-mirror/llama_stack
Ashwin Bharambe 7519b73fcc
feat(distro): fork off a starter-gpu distribution (#3240)
The starter distribution added post-training which added torch
dependencies which pulls in all the nvidia CUDA libraries. This made our
starter container very big. We have worked hard to keep the starter
container small so it serves its purpose as a starter. This PR tries to
get it back to its size by forking off duplicate "-gpu" providers for
post-training. These forked providers are then used for a new
`starter-gpu` distribution which can pull in all dependencies.
2025-08-22 15:47:15 -07:00
..
apis feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
cli refactor(logging): rename llama_stack logger categories (#3065) 2025-08-21 17:31:04 -07:00
core refactor(logging): rename llama_stack logger categories (#3065) 2025-08-21 17:31:04 -07:00
distributions feat(distro): fork off a starter-gpu distribution (#3240) 2025-08-22 15:47:15 -07:00
models refactor(logging): rename llama_stack logger categories (#3065) 2025-08-21 17:31:04 -07:00
providers feat(distro): fork off a starter-gpu distribution (#3240) 2025-08-22 15:47:15 -07:00
strong_typing chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
testing fix(ci, tests): ensure uv environments in CI are kosher, record tests (#3193) 2025-08-18 17:02:24 -07:00
ui feat(UI): Adding a session manager (#3203) 2025-08-21 21:11:03 -04:00
__init__.py chore(rename): move llama_stack.distribution to llama_stack.core (#2975) 2025-07-30 23:30:53 -07:00
env.py refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401) 2025-03-04 14:53:47 -08:00
log.py chore(pre-commit): add pre-commit hook to enforce llama_stack logger usage (#3061) 2025-08-20 07:15:35 -04:00
schema_utils.py feat(auth): API access control (#2822) 2025-07-24 15:30:48 -07:00