Codemod from llama_toolchain -> llama_stack

- added providers/registry - cleaned up api/ subdirectories and moved impls away - restructured api/api.py - from llama_stack.apis.<api> import foo should work now - update imports to do llama_stack.apis.<api> - update many other imports - added __init__, fixed some registry imports - updated registry imports - create_agentic_system -> create_agent - AgenticSystem -> Agent
2025-10-08 13:00:52 +00:00 · 2024-09-16 17:34:07 -07:00 · 2024-09-16 17:34:07 -07:00 · 76b354a081
commit 76b354a081
parent 2cf731faea
128 changed files with 381 additions and 376 deletions
--- a/docs/getting_started.md
+++ b/docs/getting_started.md
@ -296,7 +296,7 @@ Once the server is setup, we can test it with a client to see the example output
 cd /path/to/llama-stack
 conda activate <env>  # any environment containing the llama-toolchain pip package will work

-python -m llama_stack.inference.client localhost 5000
+python -m llama_stack.apis.inference.client localhost 5000
 ```

 This will run the chat completion client and query the distribution’s /inference/chat_completion API.
@ -314,7 +314,7 @@ You know what's even more hilarious? People like you who think they can just Goo
 Similarly you can test safety (if you configured llama-guard and/or prompt-guard shields) by:

 ```
-python -m llama_stack.safety.client localhost 5000
+python -m llama_stack.apis.safety.client localhost 5000
 ```

 You can find more example scripts with client SDKs to talk with the Llama Stack server in our [llama-stack-apps](https://github.com/meta-llama/llama-stack-apps/tree/main/sdk_examples) repo.