llama-stack

1240 commits 21 branches 64 tags 62 MiB

Author	SHA1	Message	Date
Ihar Hrachyshka	cc700b2f68	feat: support listing all for `llama stack list-providers` (#1056 ) # What does this PR do? Support listing all for `llama stack list-providers`. For ease of reading, sort the output rows by type. Before the change. ```  llama stack list-providers usage: llama stack list-providers [-h] {inference,safety,agents,vector_io,datasetio,scoring,eval,post_training,tool_runtime,telemetry} llama stack list-providers: error: the following arguments are required: api ``` After the change. ``` +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| API Type \| Provider Type \| PIP Package Dependencies \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| agents \| inline::meta-reference \| matplotlib,pillow,pandas,scikit-learn,aiosqlite,psycopg2-binary,redis \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| datasetio \| inline::localfs \| pandas \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| datasetio \| remote::huggingface \| datasets \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| eval \| inline::meta-reference \| \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| inline::meta-reference \| accelerate,blobfile,fairscale,torch,torchvision,transformers,zmq,lm-format- \| \| \| \| enforcer,sentence-transformers \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| inline::meta-reference-quantized \| accelerate,blobfile,fairscale,torch,torchvision,transformers,zmq,lm-format- \| \| \| \| enforcer,sentence-transformers,fbgemm-gpu,torchao==0.5.0 \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| inline::sentence-transformers \| sentence-transformers \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| inline::vllm \| vllm \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::bedrock \| boto3 \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::cerebras \| cerebras_cloud_sdk \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::databricks \| openai \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::fireworks \| fireworks-ai \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::groq \| groq \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::hf::endpoint \| huggingface_hub,aiohttp \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::hf::serverless \| huggingface_hub,aiohttp \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::nvidia \| openai \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::ollama \| ollama,aiohttp \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::runpod \| openai \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::sambanova \| openai \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::tgi \| huggingface_hub,aiohttp \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::together \| together \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| inference \| remote::vllm \| openai \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| post_training \| inline::torchtune \| torch,torchtune==0.5.0,torchao==0.8.0,numpy \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| safety \| inline::code-scanner \| codeshield \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| safety \| inline::llama-guard \| \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| safety \| inline::meta-reference \| transformers,torch --index-url https://download.pytorch.org/whl/cpu \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| safety \| inline::prompt-guard \| transformers,torch --index-url https://download.pytorch.org/whl/cpu \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| safety \| remote::bedrock \| boto3 \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| scoring \| inline::basic \| \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| scoring \| inline::braintrust \| autoevals,openai \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| scoring \| inline::llm-as-judge \| \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| telemetry \| inline::meta-reference \| opentelemetry-sdk,opentelemetry-exporter-otlp-proto-http \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| tool_runtime \| inline::code-interpreter \| \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| tool_runtime \| inline::rag-runtime \| \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| tool_runtime \| remote::bing-search \| requests \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| tool_runtime \| remote::brave-search \| requests \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| tool_runtime \| remote::model-context-protocol \| mcp \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| tool_runtime \| remote::tavily-search \| requests \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| tool_runtime \| remote::wolfram-alpha \| requests \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| vector_io \| inline::chromadb \| blobfile,chardet,pypdf,tqdm,numpy,scikit- \| \| \| \| learn,scipy,nltk,sentencepiece,transformers,torch torchvision --index-url \| \| \| \| https://download.pytorch.org/whl/cpu,sentence-transformers --no-deps,chromadb \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| vector_io \| inline::faiss \| blobfile,chardet,pypdf,tqdm,numpy,scikit- \| \| \| \| learn,scipy,nltk,sentencepiece,transformers,torch torchvision --index-url \| \| \| \| https://download.pytorch.org/whl/cpu,sentence-transformers --no-deps,faiss-cpu \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| vector_io \| inline::meta-reference \| blobfile,chardet,pypdf,tqdm,numpy,scikit- \| \| \| \| learn,scipy,nltk,sentencepiece,transformers,torch torchvision --index-url \| \| \| \| https://download.pytorch.org/whl/cpu,sentence-transformers --no-deps,faiss-cpu \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| vector_io \| remote::chromadb \| blobfile,chardet,pypdf,tqdm,numpy,scikit- \| \| \| \| learn,scipy,nltk,sentencepiece,transformers,torch torchvision --index-url \| \| \| \| https://download.pytorch.org/whl/cpu,sentence-transformers --no-deps,chromadb- \| \| \| \| client \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| vector_io \| remote::pgvector \| blobfile,chardet,pypdf,tqdm,numpy,scikit- \| \| \| \| learn,scipy,nltk,sentencepiece,transformers,torch torchvision --index-url \| \| \| \| https://download.pytorch.org/whl/cpu,sentence-transformers --no- \| \| \| \| deps,psycopg2-binary \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| vector_io \| remote::qdrant \| blobfile,chardet,pypdf,tqdm,numpy,scikit- \| \| \| \| learn,scipy,nltk,sentencepiece,transformers,torch torchvision --index-url \| \| \| \| https://download.pytorch.org/whl/cpu,sentence-transformers --no-deps,qdrant- \| \| \| \| client \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ \| vector_io \| remote::weaviate \| blobfile,chardet,pypdf,tqdm,numpy,scikit- \| \| \| \| learn,scipy,nltk,sentencepiece,transformers,torch torchvision --index-url \| \| \| \| https://download.pytorch.org/whl/cpu,sentence-transformers --no-deps,weaviate- \| \| \| \| client \| +---------------+----------------------------------+----------------------------------------------------------------------------------+ ``` [//]: # (If resolving an issue, uncomment and update the line below) [//]: # (Closes #[issue-number]) ## Test Plan Manually. [//]: # (## Documentation) Signed-off-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com>	2025-02-12 22:03:28 -08:00
Ihar Hrachyshka	24385cfd03	fix: filter out remote::sample providers when listing (#1057 ) # What does this PR do? Before: ```  llama stack list-providers agents +------------------------+-----------------------------------------------------------------------+ \| Provider Type \| PIP Package Dependencies \| +------------------------+-----------------------------------------------------------------------+ \| inline::meta-reference \| matplotlib,pillow,pandas,scikit-learn,aiosqlite,psycopg2-binary,redis \| +------------------------+-----------------------------------------------------------------------+ \| remote::sample \| \| +------------------------+-----------------------------------------------------------------------+ ``` After: ```  llama stack list-providers agents +------------------------+-----------------------------------------------------------------------+ \| Provider Type \| PIP Package Dependencies \| +------------------------+-----------------------------------------------------------------------+ \| inline::meta-reference \| matplotlib,pillow,pandas,scikit-learn,aiosqlite,psycopg2-binary,redis \| +------------------------+-----------------------------------------------------------------------+ ``` [//]: # (If resolving an issue, uncomment and update the line below) [//]: # (Closes #[issue-number]) ## Test Plan Manually. [//]: # (## Documentation) Signed-off-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com>	2025-02-11 16:12:46 -08:00
Yuan Tang	3f9764d50c	fix: List providers command prints out non-existing APIs from registry. Fixes #966 (#969 ) Fixes #966. Verified that: 1. Correct list of APIs are printed out when running `llama stack list-providers` 2. `llama stack list-providers <api>` works as expected. --------- Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>	2025-02-07 09:02:15 -08:00
Ashwin Bharambe	fe4aabd690	provider_id => provider_type, adapter_id => adapter_type	2024-10-02 14:05:59 -07:00
Ashwin Bharambe	df68db644b	Refactoring distribution/distribution.py This file was becoming too large and unclear what it housed. Split it into pieces.	2024-10-02 14:03:02 -07:00
Ashwin Bharambe	fe460ba103	Avoid importing a lot of stuff	2024-09-28 16:06:10 -07:00
Ashwin Bharambe	ec4fc800cc	[API Updates] Model / shield / memory-bank routing + agent persistence + support for private headers (#92 ) This is yet another of those large PRs (hopefully we will have less and less of them as things mature fast). This one introduces substantial improvements and some simplifications to the stack. Most important bits: * Agents reference implementation now has support for session / turn persistence. The default implementation uses sqlite but there's also support for using Redis. * We have re-architected the structure of the Stack APIs to allow for more flexible routing. The motivating use cases are: - routing model A to ollama and model B to a remote provider like Together - routing shield A to local impl while shield B to a remote provider like Bedrock - routing a vector memory bank to Weaviate while routing a keyvalue memory bank to Redis * Support for provider specific parameters to be passed from the clients. A client can pass data using `x_llamastack_provider_data` parameter which can be type-checked and provided to the Adapter implementations.	2024-09-23 14:22:22 -07:00
Ashwin Bharambe	9487ad8294	API Updates (#73 ) * API Keys passed from Client instead of distro configuration * delete distribution registry * Rename the "package" word away * Introduce a "Router" layer for providers Some providers need to be factorized and considered as thin routing layers on top of other providers. Consider two examples: - The inference API should be a routing layer over inference providers, routed using the "model" key - The memory banks API is another instance where various memory bank types will be provided by independent providers (e.g., a vector store is served by Chroma while a keyvalue memory can be served by Redis or PGVector) This commit introduces a generalized routing layer for this purpose. * update `apis_to_serve` * llama_toolchain -> llama_stack * Codemod from llama_toolchain -> llama_stack - added providers/registry - cleaned up api/ subdirectories and moved impls away - restructured api/api.py - from llama_stack.apis.<api> import foo should work now - update imports to do llama_stack.apis.<api> - update many other imports - added __init__, fixed some registry imports - updated registry imports - create_agentic_system -> create_agent - AgenticSystem -> Agent * Moved some stuff out of common/; re-generated OpenAPI spec * llama-toolchain -> llama-stack (hyphens) * add control plane API * add redis adapter + sqlite provider * move core -> distribution * Some more toolchain -> stack changes * small naming shenanigans * Removing custom tool and agent utilities and moving them client side * Move control plane to distribution server for now * Remove control plane from API list * no codeshield dependency randomly plzzzzz * Add "fire" as a dependency * add back event loggers * stack configure fixes * use brave instead of bing in the example client * add init file so it gets packaged * add init files so it gets packaged * Update MANIFEST * bug fix --------- Co-authored-by: Hardik Shah <hjshah@fb.com> Co-authored-by: Xi Yan <xiyan@meta.com> Co-authored-by: Ashwin Bharambe <ashwin@meta.com>	2024-09-17 19:51:35 -07:00

Renamed from llama_toolchain/cli/stack/list_providers.py (Browse further)

8 commits