llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-06-27 18:50:41 +00:00

History

Jash Gulabrai eab550f7d2 fix: Fix messages format in NVIDIA safety check request body (#2063 ) # What does this PR do? When running a Llama Stack server and invoking the `/v1/safety/run-shield` endpoint, the NVIDIA Guardrails endpoint in some cases errors with a `422: Unprocessable Entity` due to malformed input. For example, given an request body like: ``` { "model": "test", "messages": [ { "role": "user", "content": "You are stupid." } ] } ``` `convert_pydantic_to_json_value` converts the message to: ``` { "role": "user", "content": "You are stupid.", "context": null } ``` Which causes NVIDIA Guardrails to return an error `HTTPError: 422 Client Error: Unprocessable Entity for url: http://nemo.test/v1/guardrail/checks`, because `context` shouldn't be included in the body. [//]: # (If resolving an issue, uncomment and update the line below) [//]: # (Closes #[issue-number]) ## Test Plan I ran the Llama Stack server locally and manually verified that the endpoint now succeeds. ``` message = {"role": "user", "content": "You are stupid."} response = client.safety.run_shield(messages=[message], shield_id=shield_id, params={}) ``` Server logs: ``` 14:29:09.656 [START] /v1/safety/run-shield INFO: 127.0.0.1:54616 - "POST /v1/safety/run-shield HTTP/1.1" 200 OK 14:29:09.918 [END] /v1/safety/run-shield [StatusCode.OK] (262.26ms ``` [//]: # (## Documentation) Co-authored-by: Jash Gulabrai <jgulabrai@nvidia.com>		2025-04-30 18:01:28 +02:00
..
apis	feat: OpenAI Responses API (#1989 )	2025-04-28 14:06:00 -07:00
cli	feat: add api.llama provider, llama-guard-4 model (#2058 )	2025-04-29 10:07:41 -07:00
distribution	fix: tools page on playground resets agent after every interaction (#2044 )	2025-04-28 23:13:27 +02:00
models	feat: add api.llama provider, llama-guard-4 model (#2058 )	2025-04-29 10:07:41 -07:00
providers	fix: Fix messages format in NVIDIA safety check request body (#2063 )	2025-04-30 18:01:28 +02:00
strong_typing	feat: OpenAI Responses API (#1989 )	2025-04-28 14:06:00 -07:00
templates	chore: Remove zero-width space characters from OTEL service name env var defaults (#2060 )	2025-04-30 17:56:46 +02:00
__init__.py	export LibraryClient	2024-12-13 12:08:00 -08:00
env.py	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
log.py	chore: Remove style tags from log formatter (#1808 )	2025-03-27 10:18:21 -04:00
schema_utils.py	fix: dont check protocol compliance for experimental methods	2025-04-12 16:26:32 -07:00