llama-stack

History

Matthew Farrellee 706b4ca651 feat: support nvidia hosted vision models (llama 3.2 11b/90b) (#1278 ) # What does this PR do? support nvidia hosted 3.2 11b/90b vision models. they are not hosted on the common https://integrate.api.nvidia.com/v1. they are hosted on their own individual urls. ## Test Plan `LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -s -v tests/client-sdk/inference/test_vision_inference.py --inference-model=meta/llama-3.2-11b-vision-instruct -k image`		2025-03-18 11:54:10 -07:00
..
agents	test: add unit test to ensure all config types are instantiable (#1601 )	2025-03-12 22:29:58 -07:00
datasetio	feat(api): (1/n) datasets api clean up (#1573 )	2025-03-17 16:55:45 -07:00
inference	feat: support nvidia hosted vision models (llama 3.2 11b/90b) (#1278 )	2025-03-18 11:54:10 -07:00
safety	feat: added nvidia as safety provider (#1248 )	2025-03-17 14:39:23 -07:00
tool_runtime	test: add unit test to ensure all config types are instantiable (#1601 )	2025-03-12 22:29:58 -07:00
vector_io	test: add unit test to ensure all config types are instantiable (#1601 )	2025-03-12 22:29:58 -07:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00