llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Michael Dawson a654467552 feat: add cpu/cuda config for prompt guard (#2194 ) # What does this PR do? Previously prompt guard was hard coded to require cuda which prevented it from being used on an instance without a cuda support. This PR allows prompt guard to be configured to use either cpu or cuda. [//]: # (If resolving an issue, uncomment and update the line below) Closes [#2133](https://github.com/meta-llama/llama-stack/issues/2133) ## Test Plan (Edited after incorporating suggestion) 1) started stack configured with prompt guard as follows on a system without a GPU and validated prompt guard could be used through the APIs 2) validated on a system with a gpu (but without llama stack) that the python selecting between cpu and cuda support returned the right value when a cuda device was available. 3) ran the unit tests as per - https://github.com/meta-llama/llama-stack/blob/main/tests/unit/README.md [//]: # (## Documentation) --------- Signed-off-by: Michael Dawson <mdawson@devrus.com>		2025-05-28 12:23:15 -07:00
..
inline	feat: add cpu/cuda config for prompt guard (#2194 )	2025-05-28 12:23:15 -07:00
registry	feat: accept MCP authorization headers for MCP toolgroups (#2230 )	2025-05-23 08:52:18 -07:00
remote	fix: convert boolean string to boolean (#2284 )	2025-05-27 13:05:38 -07:00
utils	fix: chat completion with more than one choice (#2288 )	2025-05-27 15:39:15 -07:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	fix(tools): do not index tools, only index toolgroups (#2261 )	2025-05-25 13:27:52 -07:00