llama-stack-mirror/tests/unit/providers/inline/__init__.py at 2daae3e3d4ebf47fa960f567ceed1fb904911a2e - phoenix-oss/llama-stack-mirror - Git for basel.kvant.cloud

phoenix-oss/llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-24 00:47:00 +00:00

ehhuang 6cce553c93

Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped

Details

Test External API and Providers / test-external (venv) (push) Failing after 6s

Details

SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 11s

Details

Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 17s

Details

Unit Tests / unit-tests (3.13) (push) Failing after 14s

Details

Vector IO Integration Tests / test-matrix (push) Failing after 19s

Details

SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 21s

Details

Python Package Build Test / build (3.12) (push) Failing after 20s

Details

Python Package Build Test / build (3.13) (push) Failing after 23s

Details

Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 28s

Details

Unit Tests / unit-tests (3.12) (push) Failing after 25s

Details

API Conformance Tests / check-schema-compatibility (push) Successful in 32s

Details

UI Tests / ui-tests (22) (push) Successful in 57s

Details

Pre-commit / pre-commit (push) Successful in 1m18s

Details

fix: mcp tool with array type should include items (#3602 )

# What does this PR do?
Fixes error:
```
[ERROR] Error executing endpoint route='/v1/openai/v1/responses'  
         method='post': Error code: 400 - {'error': {'message': "Invalid schema for function 'pods_exec': In context=('properties', 'command'), array 
         schema missing items.", 'type': 'invalid_request_error', 'param': 'tools[7].function.parameters', 'code': 'invalid_function_parameters'}} 
```

From script:
```
#!/usr/bin/env python3
"""
Script to test Responses API with kubernetes-mcp-server.

This script:
1. Connects to the llama stack server
2. Uses the Responses API with MCP tools
3. Asks for the list of Kubernetes namespaces using the kubernetes-mcp-server
"""

import json

from openai import OpenAI

# Connect to the llama stack server
base_url = "http://localhost:8321/v1/openai/v1"
client = OpenAI(base_url=base_url, api_key="fake")

# Define the MCP tool pointing to the kubernetes-mcp-server
# The kubernetes-mcp-server is running on port 3000 with SSE endpoint at /sse
mcp_server_url = "http://localhost:3000/sse"

tools = [
    {
        "type": "mcp",
        "server_label": "k8s",
        "server_url": mcp_server_url,
    }
]

# Create a response request asking for k8s namespaces
print("Sending request to list Kubernetes namespaces...")
print(f"Using MCP server at: {mcp_server_url}")
print("Available tools will be listed automatically by the MCP server.")
print()

response = client.responses.create(
    # model="meta-llama/Llama-3.2-3B-Instruct",  # Using the vllm model
    model="openai/gpt-4o",
    input="what are all the Kubernetes namespaces? Use tool call to `namespaces_list`. make sure to adhere to the tool calling format.",
    tools=tools,
    stream=False,
)

print("\n" + "=" * 80)
print("RESPONSE OUTPUT:")
print("=" * 80)

# Print the output
for i, output in enumerate(response.output):
    print(f"\n[Output {i + 1}] Type: {output.type}")
    if output.type == "mcp_list_tools":
        print(f"  Server: {output.server_label}")
        print(f"  Tools available: {[t.name for t in output.tools]}")
    elif output.type == "mcp_call":
        print(f"  Tool called: {output.name}")
        print(f"  Arguments: {output.arguments}")
        print(f"  Result: {output.output}")
        if output.error:
            print(f"  Error: {output.error}")
    elif output.type == "message":
        print(f"  Role: {output.role}")
        print(f"  Content: {output.content}")

print("\n" + "=" * 80)
print("FINAL RESPONSE TEXT:")
print("=" * 80)
print(response.output_text)
```


## Test Plan
new unit tests
script now runs successfully

2025-09-29 23:11:41 -07:00

5 lines

200 B

Python

Raw Blame History

 # Copyright (c) Meta Platforms, Inc. and affiliates.
 # All rights reserved.
 #
 # This source code is licensed under the terms described in the LICENSE file in
 # the root directory of this source tree.