llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

Matthew Farrellee e28cedd833 feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213 ) # What does this PR do? updates nvidia inference provider's embedding implementation to use new signature add support for task_type, output_dimensions, text_truncation parameters ## Test Plan `LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/inference/test_embedding.py --embedding-model baai/bge-m3`		2025-02-27 16:58:11 -08:00
..
apis	ci: add mypy for static type checking (#1101 )	2025-02-21 13:15:40 -08:00
cli	fix: update notebooks to avoid using the nutsy --image-name __system__ thing (#1308 )	2025-02-27 16:39:04 -08:00
distribution	fix(test): update client-sdk tests to handle tool format parametrization better (#1287 )	2025-02-26 21:16:00 -08:00
models/llama	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
providers	feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213 )	2025-02-27 16:58:11 -08:00
scripts	ci: add mypy for static type checking (#1101 )	2025-02-21 13:15:40 -08:00
strong_typing	Ensure that deprecations for fields follow through to OpenAPI	2025-02-19 13:54:04 -08:00
templates	docs: update the output of llama-stack-client models list (#1271 )	2025-02-27 16:46:38 -08:00
__init__.py	export LibraryClient	2024-12-13 12:08:00 -08:00
schema_utils.py	ci: add mypy for static type checking (#1101 )	2025-02-21 13:15:40 -08:00