llama-stack

History

Matthew Farrellee e28cedd833 feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213 ) # What does this PR do? updates nvidia inference provider's embedding implementation to use new signature add support for task_type, output_dimensions, text_truncation parameters ## Test Plan `LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/inference/test_embedding.py --embedding-model baai/bge-m3`		2025-02-27 16:58:11 -08:00
..
__init__.py	add NVIDIA NIM inference adapter (#355 )	2024-11-23 15:59:00 -08:00
config.py	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
models.py	fix: register provider model name and HF alias in run.yaml (#1304 )	2025-02-27 16:39:23 -08:00
nvidia.py	feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213 )	2025-02-27 16:58:11 -08:00
openai_utils.py	refactor: move OpenAI compat utilities from nvidia to openai_compat (#1258 )	2025-02-25 22:02:11 -08:00
utils.py	style: remove prints in codebase (#1146 )	2025-02-18 19:41:37 -08:00