llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-04 02:03:44 +00:00

History

Matthew Farrellee e28cedd833 feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213 ) # What does this PR do? updates nvidia inference provider's embedding implementation to use new signature add support for task_type, output_dimensions, text_truncation parameters ## Test Plan `LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/inference/test_embedding.py --embedding-model baai/bge-m3`		2025-02-27 16:58:11 -08:00
..
inline	fix: Avoid unexpected keyword argument for sentence_transformers (#1269 )	2025-02-27 16:47:26 -08:00
registry	fix: groq now depends on litellm	2025-02-27 14:07:12 -08:00
remote	feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213 )	2025-02-27 16:58:11 -08:00
tests	feat(providers): Groq now uses LiteLLM openai-compat (#1303 )	2025-02-27 13:16:50 -08:00
utils	feat(providers): Groq now uses LiteLLM openai-compat (#1303 )	2025-02-27 13:16:50 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00