llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

Jiayi Ni b72169ca47 docs: update the docs for NVIDIA Inference provider (#3227 ) # What does this PR do? - Documentation update and fix for the NVIDIA Inference provider. - Update the `run_moderation` for safety API with a `NotImplementedError` placeholder. Otherwise initialization NVIDIA inference client will raise an error. ## Test Plan N/A		2025-08-21 15:59:39 -07:00
..
inline	fix: handle mcp tool calls in previous response correctly (#3155 )	2025-08-20 14:12:15 -07:00
registry	feat: add batches API with OpenAI compatibility (with inference replay) (#3162 )	2025-08-15 15:34:15 -07:00
remote	docs: update the docs for NVIDIA Inference provider (#3227 )	2025-08-21 15:59:39 -07:00
utils	fix: Use `pool_pre_ping=True` in SQLAlchemy engine creation (#3208 )	2025-08-20 13:52:05 -07:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	feat: create unregister shield API endpoint in Llama Stack (#2853 )	2025-08-05 07:33:46 -07:00