docs

2025-10-15 06:37:58 +00:00 · 2024-10-29 14:14:05 -07:00 · 2024-10-29 14:14:05 -07:00 · 4d8bfcea1b
commit 4d8bfcea1b
parent eeab1278f2
3 changed files with 65 additions and 52 deletions
--- a/docs/source/api_providers/index.md
+++ b/docs/source/api_providers/index.md
@ -7,7 +7,8 @@ As an example, for Inference, we could have the implementation be backed by open
 A provider can also be just a pointer to a remote REST service -- for example, cloud providers or dedicated inference providers could serve these APIs.
 ```{toctree}
-:maxdepth: 2
+:maxdepth: 1
 new_api_provider
 memory_api
 ```
--- a/docs/source/api_providers/memory_api.md
+++ b/docs/source/api_providers/memory_api.md
@ -0,0 +1,53 @@
 # Memory API Providers
 ### Start Distribution With pgvector/chromadb Memory Provider
 ##### pgvector
 1. Start running the pgvector server:
 ```
 docker run --network host --name mypostgres -it -p 5432:5432 -e POSTGRES_PASSWORD=mysecretpassword -e POSTGRES_USER=postgres -e POSTGRES_DB=postgres pgvector/pgvector:pg16
 ```
 2. Edit the `run.yaml` file to point to the pgvector server.
 ```
 memory:
  - provider_id: pgvector
    provider_type: remote::pgvector
    config:
      host: 127.0.0.1
      port: 5432
      db: postgres
      user: postgres
      password: mysecretpassword
 ```
 > [!NOTE]
 > If you get a `RuntimeError: Vector extension is not installed.`. You will need to run `CREATE EXTENSION IF NOT EXISTS vector;` to include the vector extension. E.g.
 ```
 docker exec -it mypostgres ./bin/psql -U postgres
 postgres=# CREATE EXTENSION IF NOT EXISTS vector;
 postgres=# SELECT extname from pg_extension;
 extname
 ```
 3. Run `docker compose up` with the updated `run.yaml` file.
 ##### chromadb
 1. Start running chromadb server
 ```
 docker run -it --network host --name chromadb -p 6000:6000 -v ./chroma_vdb:/chroma/chroma -e IS_PERSISTENT=TRUE chromadb/chroma:latest
 ```
 2. Edit the `run.yaml` file to point to the chromadb server.
 ```
 memory:
  - provider_id: remote::chromadb
    provider_type: remote::chromadb
    config:
      host: localhost
      port: 6000
 ```
 3. Run `docker compose up` with the updated `run.yaml` file.
--- a/docs/source/getting_started/distributions/meta-reference-gpu.md
+++ b/docs/source/getting_started/distributions/meta-reference-gpu.md
@ -17,13 +17,10 @@ Llama3.1-8B           Llama3.2-11B-Vision-Instruct  Llama3.2-1B-Instruct  Llama3
 Llama3.1-8B-Instruct  Llama3.2-1B                   Llama3.2-3B-Instruct  Llama-Guard-3-1B              Prompt-Guard-86M
 ```
-### Start the Distribution (Single Node GPU)
+### Docker: Start the Distribution
 ```
-$ cd distributions/meta-reference-gpu
+$ cd distributions/meta-reference-gpu && docker compose up
 $ ls
 build.yaml  compose.yaml  README.md  run.yaml
 $ docker compose up
 ```
 > [!NOTE]
@ -40,60 +37,22 @@ This will download and start running a pre-built docker container. Alternatively
 docker run -it -p 5000:5000 -v ~/.llama:/root/.llama -v ./run.yaml:/root/my-run.yaml --gpus=all distribution-meta-reference-gpu --yaml_config /root/my-run.yaml
 ```
-### Alternative (Build and start distribution locally via conda)
+### Conda: Start the Distribution
 - You may checkout the [Getting Started](../../docs/getting_started.md) for more details on building locally via conda and starting up a meta-reference distribution.
-### Start Distribution With pgvector/chromadb Memory Provider
+1. Install the `llama` CLI. See [CLI Reference]()
-##### pgvector
+
-1. Start running the pgvector server:
+2. Build the `meta-reference-gpu` distribution
 ```
-docker run --network host --name mypostgres -it -p 5432:5432 -e POSTGRES_PASSWORD=mysecretpassword -e POSTGRES_USER=postgres -e POSTGRES_DB=postgres pgvector/pgvector:pg16
+$ llama stack build --template meta-reference-gpu --image-type conda
 ```
-2. Edit the `run.yaml` file to point to the pgvector server.
+3. Start running distribution
 ```
-memory:
+$ cd distributions/meta-reference-gpu
-  - provider_id: pgvector
+$ llama stack run ./run.yaml
    provider_type: remote::pgvector
    config:
      host: 127.0.0.1
      port: 5432
      db: postgres
      user: postgres
      password: mysecretpassword
 ```
 > [!NOTE]
 > If you get a `RuntimeError: Vector extension is not installed.`. You will need to run `CREATE EXTENSION IF NOT EXISTS vector;` to include the vector extension. E.g.
 ```
 docker exec -it mypostgres ./bin/psql -U postgres
 postgres=# CREATE EXTENSION IF NOT EXISTS vector;
 postgres=# SELECT extname from pg_extension;
 extname
 ```
 3. Run `docker compose up` with the updated `run.yaml` file.
 ##### chromadb
 1. Start running chromadb server
 ```
 docker run -it --network host --name chromadb -p 6000:6000 -v ./chroma_vdb:/chroma/chroma -e IS_PERSISTENT=TRUE chromadb/chroma:latest
 ```
 2. Edit the `run.yaml` file to point to the chromadb server.
 ```
 memory:
  - provider_id: remote::chromadb
    provider_type: remote::chromadb
    config:
      host: localhost
      port: 6000
 ```
 3. Run `docker compose up` with the updated `run.yaml` file.
 ### Serving a new model
 You may change the `config.model` in `run.yaml` to update the model currently being served by the distribution. Make sure you have the model checkpoint downloaded in your `~/.llama`.
 ```