llama-stack-mirror/llama_stack/providers/remote/inference
Ashwin Bharambe 2e544ecd8a feat(tools)!: substantial clean up of "Tool" related datatypes (#3627)
This is a sweeping change to clean up some gunk around our "Tool"
definitions.

First, we had two types `Tool` and `ToolDef`. The first of these was a
"Resource" type for the registry but we had stopped registering tools
inside the Registry long back (and only registered ToolGroups.) The
latter was for specifying tools for the Agents API. This PR removes the
former and adds an optional `toolgroup_id` field to the latter.

Secondly, as pointed out by @bbrowning in
https://github.com/llamastack/llama-stack/pull/3003#issuecomment-3245270132,
we were doing a lossy conversion from a full JSON schema from the MCP
tool specification into our ToolDefinition to send it to the model.
There is no necessity to do this -- we ourselves aren't doing any
execution at all but merely passing it to the chat completions API which
supports this. By doing this (and by doing it poorly), we encountered
limitations like not supporting array items, or not resolving $refs,
etc.

To fix this, we replaced the `parameters` field by `{ input_schema,
output_schema }` which can be full blown JSON schemas.

Finally, there were some types in our llama-related chat format
conversion which needed some cleanup. We are taking this opportunity to
clean those up.

This PR is a substantial breaking change to the API. However, given our
window for introducing breaking changes, this suits us just fine. I will
be landing a concurrent `llama-stack-client` change as well since API
shapes are changing.
2025-10-02 21:50:13 -07:00
..
anthropic feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547) 2025-09-25 17:17:00 -04:00
azure feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547) 2025-09-25 17:17:00 -04:00
bedrock chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
cerebras chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
databricks chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
fireworks chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
gemini feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547) 2025-09-25 17:17:00 -04:00
groq chore: prune mypy exclude list (#3561) 2025-09-26 11:44:43 -04:00
llama_openai_compat feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547) 2025-09-25 17:17:00 -04:00
nvidia chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
ollama chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
openai chore(api): remove batch inference (#3261) 2025-09-26 14:35:34 -07:00
passthrough chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
runpod chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
sambanova chore: prune mypy exclude list (#3561) 2025-09-26 11:44:43 -04:00
tgi chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
together chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
vertexai feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547) 2025-09-25 17:17:00 -04:00
vllm feat(tools)!: substantial clean up of "Tool" related datatypes (#3627) 2025-10-02 21:50:13 -07:00
watsonx chore: remove /v1/inference/completion and implementations (#3622) 2025-10-01 11:36:53 -04:00
__init__.py impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00