Fix bedrock inference impl

2024-12-16 14:22:34 -08:00 · 2024-12-16 14:22:34 -08:00 · c2f7905fa4
commit c2f7905fa4
parent eb37fba9da
5 changed files with 47 additions and 8 deletions
--- a/docs/source/distributions/self_hosted_distro/bedrock.md
+++ b/docs/source/distributions/self_hosted_distro/bedrock.md
@ -28,6 +28,13 @@ The following environment variables can be configured:

 - `LLAMASTACK_PORT`: Port for the Llama Stack distribution server (default: `5001`)

+### Models
+
+The following models are available by default:
+
+- `meta-llama/Llama-3.1-8B-Instruct (meta.llama3-1-8b-instruct-v1:0)`
+- `meta-llama/Llama-3.1-70B-Instruct (meta.llama3-1-70b-instruct-v1:0)`
+- `meta-llama/Llama-3.1-405B-Instruct-FP8 (meta.llama3-1-405b-instruct-v1:0)`


 ### Prerequisite: API Keys