mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-12-04 02:03:44 +00:00
# What does this PR do? updates nvidia inference provider's embedding implementation to use new signature add support for task_type, output_dimensions, text_truncation parameters ## Test Plan `LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/inference/test_embedding.py --embedding-model baai/bge-m3` |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| dog.png | ||
| test_embedding.py | ||
| test_text_inference.py | ||
| test_vision_inference.py | ||