llama-stack

History

Matthew Farrellee e28cedd833 feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213 ) # What does this PR do? updates nvidia inference provider's embedding implementation to use new signature add support for task_type, output_dimensions, text_truncation parameters ## Test Plan `LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/inference/test_embedding.py --embedding-model baai/bge-m3`		2025-02-27 16:58:11 -08:00
..
agents	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
datasetio	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
inference	feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213 )	2025-02-27 16:58:11 -08:00
safety	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
tool_runtime	fix: Get distro_codegen.py working with default deps and enabled in pre-commit hooks (#1123 )	2025-02-19 18:39:20 -08:00
vector_io	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00