llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-06 12:37:33 +00:00

History

Matthew Farrellee e28cedd833 feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213 ) # What does this PR do? updates nvidia inference provider's embedding implementation to use new signature add support for task_type, output_dimensions, text_truncation parameters ## Test Plan `LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/inference/test_embedding.py --embedding-model baai/bge-m3`		2025-02-27 16:58:11 -08:00
..
__init__.py	[tests] add client-sdk pytests & delete client.py (#638 )	2024-12-16 12:04:56 -08:00
dog.png	fix vllm base64 image inference (#815 )	2025-01-17 17:07:28 -08:00
test_embedding.py	feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213 )	2025-02-27 16:58:11 -08:00
test_text_inference.py	feat(providers): Groq now uses LiteLLM openai-compat (#1303 )	2025-02-27 13:16:50 -08:00
test_vision_inference.py	feat(providers): support non-llama models for inference providers (#1200 )	2025-02-21 13:21:28 -08:00