llama-stack-mirror/tests/client-sdk
Sixian Yi 82a28f3a24
update doc for client-sdk testing (#849)
As title


## Before submitting

- [ ] This PR fixes a typo or improves the docs (you can dismiss the
other checks if that's the case).
- [ ] Ran pre-commit to handle lint / formatting issues.
- [ ] Read the [contributor
guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md),
      Pull Request section?
- [ ] Updated relevant documentation.
- [ ] Wrote necessary unit or integration tests.
2025-01-23 00:17:16 -08:00
..
agents Rename builtin::memory -> builtin::rag 2025-01-22 20:22:51 -08:00
inference Fix meta-reference GPU implementation for inference 2025-01-22 18:31:59 -08:00
safety nitpick 2025-01-22 18:09:46 -08:00
tool_runtime Update OpenAPI generator to output discriminator (#848) 2025-01-22 22:15:23 -08:00
vector_io add distro report (#847) 2025-01-22 19:20:49 -08:00
__init__.py [tests] add client-sdk pytests & delete client.py (#638) 2024-12-16 12:04:56 -08:00
conftest.py If initialization fails for library client, error the test 2025-01-22 18:12:15 -08:00
metadata.py add distro report (#847) 2025-01-22 19:20:49 -08:00
README.md update doc for client-sdk testing (#849) 2025-01-23 00:17:16 -08:00
report.py Kill colons 2025-01-22 22:59:30 -08:00

Llama Stack Integration Tests

You can run llama stack integration tests on either a Llama Stack Library or a Llama Stack endpoint.

To test on a Llama Stack library with certain configuration, run

LLAMA_STACK_CONFIG=./llama_stack/templates/cerebras/run.yaml
pytest -s -v tests/client-sdk/inference/test_inference.py

To test on a Llama Stack endpoint, run

LLAMA_STACK_BASE_URL=http//localhost:8089
pytest -s -v tests/client-sdk/inference/test_inference.py

Common options

Depending on the API, there are custom options enabled

  • For tests in inference/ and agents/, we support --inference-model(to be used in text inference tests) and--vision-inference-model` (only used in image inference tests) overrides
  • For tests in vector_io/, we support --embedding-model override
  • For tests in safety/, we support --safety-shield override