forked from phoenix-oss/llama-stack-mirror
		
	# What does this PR do? Before this change, `distro_codegen.py` would only work if the user manually installed multiple provider-specific dependencies (see #1122). Now, users can run `distro_codegen.py` without any provider-specific dependencies because we avoid importing the entire provider implementations just to get the config needed to build the provider template. Concretely, this mostly means moving the MODEL_ALIASES (and related variants) definitions to a new models.py class within the provider implementation for those providers that require additional dependencies. It also meant moving a couple of imports from top-level imports to inside `get_adapter_impl` for some providers, which follows the pattern used by multiple existing providers. To ensure we don't regress and accidentally add new imports that cause distro_codegen.py to fail, the stubbed-in pre-commit hook for distro_codegen.py was uncommented and slightly tweaked to run via `uv run python ...` to ensure it runs with only the project's default dependencies and to run automatically instead of manually. Lastly, this updates distro_codegen.py itself to keep track of paths it might have changed and to only `git diff` those specific paths when checking for changed files instead of doing a diff on the entire working tree. The latter was overly broad and would require a user have no other unstaged changes in their working tree, even if those unstaged changes were unrelated to generated code. Now it only flags uncommitted changes for paths distro_codegen.py actually writes to. Our generated code was also out-of-date, presumably because of these issues, so this commit also has some updates to the generated code purely because it was out of sync, and the pre-commit hook now enforces things to be updated. (Closes #1122) ## Test Plan I manually tested distro_codegen.py and the pre-commit hook to verify those work as expected, flagging any uncommited changes and catching any imports that attempt to pull in provider-specific dependencies. However, I do not have valid api keys to the impacted provider implementations, and am unable to easily run the inference tests against each changed provider. There are no functional changes to the provider implementations here, but I'd appreciate a second set of eyes on the changed import statements and moving of MODEL_ALIASES type code to a separate models.py to ensure I didn't make any obvious errors. --------- Signed-off-by: Ben Browning <bbrownin@redhat.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
		
			
				
	
	
		
			78 lines
		
	
	
	
		
			2.3 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			78 lines
		
	
	
	
		
			2.3 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| <!-- This file was auto-generated by distro_codegen.py, please edit source -->
 | |
| # Bedrock Distribution
 | |
| 
 | |
| ```{toctree}
 | |
| :maxdepth: 2
 | |
| :hidden:
 | |
| 
 | |
| self
 | |
| ```
 | |
| 
 | |
| The `llamastack/distribution-bedrock` distribution consists of the following provider configurations:
 | |
| 
 | |
| | API | Provider(s) |
 | |
| |-----|-------------|
 | |
| | agents | `inline::meta-reference` |
 | |
| | datasetio | `remote::huggingface`, `inline::localfs` |
 | |
| | eval | `inline::meta-reference` |
 | |
| | inference | `remote::bedrock` |
 | |
| | safety | `remote::bedrock` |
 | |
| | scoring | `inline::basic`, `inline::llm-as-judge`, `inline::braintrust` |
 | |
| | telemetry | `inline::meta-reference` |
 | |
| | tool_runtime | `remote::brave-search`, `remote::tavily-search`, `inline::code-interpreter`, `inline::rag-runtime`, `remote::model-context-protocol` |
 | |
| | vector_io | `inline::faiss`, `remote::chromadb`, `remote::pgvector` |
 | |
| 
 | |
| 
 | |
| 
 | |
| ### Environment Variables
 | |
| 
 | |
| The following environment variables can be configured:
 | |
| 
 | |
| - `LLAMA_STACK_PORT`: Port for the Llama Stack distribution server (default: `5001`)
 | |
| 
 | |
| ### Models
 | |
| 
 | |
| The following models are available by default:
 | |
| 
 | |
| - `meta-llama/Llama-3.1-8B-Instruct (meta.llama3-1-8b-instruct-v1:0)`
 | |
| - `meta-llama/Llama-3.1-70B-Instruct (meta.llama3-1-70b-instruct-v1:0)`
 | |
| - `meta-llama/Llama-3.1-405B-Instruct-FP8 (meta.llama3-1-405b-instruct-v1:0)`
 | |
| 
 | |
| 
 | |
| ### Prerequisite: API Keys
 | |
| 
 | |
| Make sure you have access to a AWS Bedrock API Key. You can get one by visiting [AWS Bedrock](https://aws.amazon.com/bedrock/).
 | |
| 
 | |
| 
 | |
| ## Running Llama Stack with AWS Bedrock
 | |
| 
 | |
| You can do this via Conda (build code) or Docker which has a pre-built image.
 | |
| 
 | |
| ### Via Docker
 | |
| 
 | |
| This method allows you to get started quickly without having to build the distribution code.
 | |
| 
 | |
| ```bash
 | |
| LLAMA_STACK_PORT=5001
 | |
| docker run \
 | |
|   -it \
 | |
|   -p $LLAMA_STACK_PORT:$LLAMA_STACK_PORT \
 | |
|   llamastack/distribution-bedrock \
 | |
|   --port $LLAMA_STACK_PORT \
 | |
|   --env AWS_ACCESS_KEY_ID=$AWS_ACCESS_KEY_ID \
 | |
|   --env AWS_SECRET_ACCESS_KEY=$AWS_SECRET_ACCESS_KEY \
 | |
|   --env AWS_SESSION_TOKEN=$AWS_SESSION_TOKEN \
 | |
|   --env AWS_DEFAULT_REGION=$AWS_DEFAULT_REGION
 | |
| ```
 | |
| 
 | |
| ### Via Conda
 | |
| 
 | |
| ```bash
 | |
| llama stack build --template bedrock --image-type conda
 | |
| llama stack run ./run.yaml \
 | |
|   --port $LLAMA_STACK_PORT \
 | |
|   --env AWS_ACCESS_KEY_ID=$AWS_ACCESS_KEY_ID \
 | |
|   --env AWS_SECRET_ACCESS_KEY=$AWS_SECRET_ACCESS_KEY \
 | |
|   --env AWS_SESSION_TOKEN=$AWS_SESSION_TOKEN \
 | |
|   --env AWS_DEFAULT_REGION=$AWS_DEFAULT_REGION
 | |
| ```
 |