mirror of
				https://github.com/meta-llama/llama-stack.git
				synced 2025-10-25 17:11:12 +00:00 
			
		
		
		
	
	
		
			6 commits
		
	
	
	| Author | SHA1 | Message | Date | |
|---|---|---|---|---|
|  | 3344d8a9e5 | fix: separate build and run provider types (#2917) 
		
			Some checks failed
		
		
	 Coverage Badge / unit-tests (push) Failing after 1s Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s Integration Tests / discover-tests (push) Successful in 3s SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 4s Vector IO Integration Tests / test-matrix (3.12, inline::faiss) (push) Failing after 4s Vector IO Integration Tests / test-matrix (3.12, inline::milvus) (push) Failing after 5s Vector IO Integration Tests / test-matrix (3.12, inline::sqlite-vec) (push) Failing after 5s Vector IO Integration Tests / test-matrix (3.12, remote::pgvector) (push) Failing after 4s Vector IO Integration Tests / test-matrix (3.13, inline::faiss) (push) Failing after 5s Test Llama Stack Build / generate-matrix (push) Successful in 4s Vector IO Integration Tests / test-matrix (3.13, inline::milvus) (push) Failing after 5s Python Package Build Test / build (3.13) (push) Failing after 2s Test Llama Stack Build / build-single-provider (push) Failing after 3s Python Package Build Test / build (3.12) (push) Failing after 2s Vector IO Integration Tests / test-matrix (3.13, inline::sqlite-vec) (push) Failing after 5s Vector IO Integration Tests / test-matrix (3.12, remote::chromadb) (push) Failing after 6s Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 3s Vector IO Integration Tests / test-matrix (3.13, remote::chromadb) (push) Failing after 5s Test Llama Stack Build / build-custom-container-distribution (push) Failing after 4s Vector IO Integration Tests / test-matrix (3.13, remote::pgvector) (push) Failing after 5s SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 9s Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Failing after 6s Test External API and Providers / test-external (venv) (push) Failing after 5s Update ReadTheDocs / update-readthedocs (push) Failing after 4s Unit Tests / unit-tests (3.13) (push) Failing after 5s Test Llama Stack Build / build (push) Failing after 3s Unit Tests / unit-tests (3.12) (push) Failing after 5s Integration Tests / test-matrix (push) Failing after 7s Pre-commit / pre-commit (push) Successful in 1m13s # What does this PR do? in #2637, I combined the run and build config provider types to both use `Provider` since this includes a provider_id, a user must now specify this when writing a build yaml. This is not very clear because all a user should care about upon build is the code to be installed (the module and the provider_type) introduce `BuildProvider` and fixup the parts of the code impacted by this Signed-off-by: Charlie Doern <cdoern@redhat.com> | ||
|  | de6919ecdd | refactor: install external providers from module (#2637) # What does this PR do?
Today, external providers are installed via the `external_providers_dir`
in the config. This necessitates users to understand the `ProviderSpec`
and set up their directories accordingly. This process splits up the
config for the stack across multiple files, directories, and formats.
Most (if not all) external providers today have a
[get_provider_spec]( | ||
|  | 43c1f39bd6 | refactor(env)!: enhanced environment variable substitution (#2490) # What does this PR do?
This commit significantly improves the environment variable substitution
functionality in Llama Stack configuration files:
* The version field in configuration files has been changed from string
to integer type for better type consistency across build and run
configurations.
* The environment variable substitution system for ${env.FOO:} was fixed
and properly returns an error
* The environment variable substitution system for ${env.FOO+} returns
None instead of an empty strings, it better matches type annotations in
config fields
* The system includes automatic type conversion for boolean, integer,
and float values.
* The error messages have been enhanced to provide clearer guidance when
environment variables are missing, including suggestions for using
default values or conditional syntax.
* Comprehensive documentation has been added to the configuration guide
explaining all supported syntax patterns, best practices, and runtime
override capabilities.
* Multiple provider configurations have been updated to use the new
conditional syntax for optional API keys, making the system more
flexible for different deployment scenarios. The telemetry configuration
has been improved to properly handle optional endpoints with appropriate
validation, ensuring that required endpoints are specified when their
corresponding sinks are enabled.
* There were many instances of ${env.NVIDIA_API_KEY:} that should have
caused the code to fail. However, due to a bug, the distro server was
still being started, and early validation wasn’t triggered. As a result,
failures were likely being handled downstream by the providers. I’ve
maintained similar behavior by using ${env.NVIDIA_API_KEY:+}, though I
believe this is incorrect for many configurations. I’ll leave it to each
provider to correct it as needed.
* Environment variable substitution now uses the same syntax as Bash
parameter expansion.
Signed-off-by: Sébastien Han <seb@redhat.com> | ||
|  | 446893f791 | feat: add deps dynamically based on metastore config (#2405) # What does this PR do? ## Test Plan changed metastore in one of the templates, rerun distro gen, observe change in build.yaml | ||
|  | a58c0639d5 | chore: update postgres_demo distro config (#2396) 
		
			Some checks failed
		
		
	 Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 5s Integration Tests / test-matrix (http, datasets) (push) Failing after 9s Integration Tests / test-matrix (http, inference) (push) Failing after 9s Integration Tests / test-matrix (http, agents) (push) Failing after 9s Integration Tests / test-matrix (http, inspect) (push) Failing after 10s Integration Tests / test-matrix (http, post_training) (push) Failing after 9s Integration Tests / test-matrix (library, agents) (push) Failing after 8s Integration Tests / test-matrix (http, providers) (push) Failing after 10s Integration Tests / test-matrix (http, scoring) (push) Failing after 9s Integration Tests / test-matrix (http, tool_runtime) (push) Failing after 9s Integration Tests / test-matrix (library, datasets) (push) Failing after 9s Integration Tests / test-matrix (library, inference) (push) Failing after 9s Test Llama Stack Build / build-single-provider (push) Failing after 6s Integration Tests / test-matrix (library, post_training) (push) Failing after 8s Test Llama Stack Build / generate-matrix (push) Successful in 7s Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 6s Test Llama Stack Build / build-custom-container-distribution (push) Failing after 7s Integration Tests / test-matrix (library, scoring) (push) Failing after 9s Integration Tests / test-matrix (library, providers) (push) Failing after 9s Test External Providers / test-external-providers (venv) (push) Failing after 6s Integration Tests / test-matrix (library, tool_runtime) (push) Failing after 9s Unit Tests / unit-tests (3.10) (push) Failing after 7s Test Llama Stack Build / build (push) Failing after 7s Unit Tests / unit-tests (3.12) (push) Failing after 8s Unit Tests / unit-tests (3.11) (push) Failing after 8s Unit Tests / unit-tests (3.13) (push) Failing after 9s Integration Tests / test-matrix (library, inspect) (push) Failing after 30s Pre-commit / pre-commit (push) Successful in 1m17s # What does this PR do? ## Test Plan | ||
|  | 2603f10f95 | feat: support postgresql inference store (#2310) 
		
			Some checks failed
		
		
	 Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 3s Integration Tests / test-matrix (http, post_training) (push) Failing after 11s Integration Tests / test-matrix (library, inference) (push) Failing after 13s Integration Tests / test-matrix (http, providers) (push) Failing after 15s Integration Tests / test-matrix (http, tool_runtime) (push) Failing after 16s Integration Tests / test-matrix (http, datasets) (push) Failing after 18s Integration Tests / test-matrix (http, scoring) (push) Failing after 16s Integration Tests / test-matrix (http, agents) (push) Failing after 19s Integration Tests / test-matrix (library, datasets) (push) Failing after 16s Integration Tests / test-matrix (http, inspect) (push) Failing after 18s Integration Tests / test-matrix (library, agents) (push) Failing after 18s Integration Tests / test-matrix (http, inference) (push) Failing after 20s Integration Tests / test-matrix (library, inspect) (push) Failing after 9s Integration Tests / test-matrix (library, post_training) (push) Failing after 10s Integration Tests / test-matrix (library, tool_runtime) (push) Failing after 8s Test External Providers / test-external-providers (venv) (push) Failing after 8s Integration Tests / test-matrix (library, scoring) (push) Failing after 9s Integration Tests / test-matrix (library, providers) (push) Failing after 11s Unit Tests / unit-tests (3.11) (push) Failing after 8s Unit Tests / unit-tests (3.10) (push) Failing after 8s Unit Tests / unit-tests (3.12) (push) Failing after 8s Unit Tests / unit-tests (3.13) (push) Failing after 8s Pre-commit / pre-commit (push) Successful in 57s # What does this PR do? * Added support postgresql inference store * Added 'oracle' template that demos how to config postgresql stores (except for telemetry, which is not supported currently) ## Test Plan llama stack build --template oracle --image-type conda --run LLAMA_STACK_CONFIG=http://localhost:8321 pytest -s -v tests/integration/ --text-model accounts/fireworks/models/llama-v3p3-70b-instruct -k 'inference_store' |