llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

Jiayi Ni fa7699d2c3 feat: Add rerank API for NVIDIA Inference Provider (#3329 ) # What does this PR do? Add rerank API for NVIDIA Inference Provider. <!-- If resolving an issue, uncomment and update the line below --> Closes #3278 ## Test Plan Unit test: ``` pytest tests/unit/providers/nvidia/test_rerank_inference.py ``` Integration test: ``` pytest -s -v tests/integration/inference/test_rerank.py --stack-config="inference=nvidia" --rerank-model=nvidia/nvidia/nv-rerankqa-mistral-4b-v3 --env NVIDIA_API_KEY="" --env NVIDIA_BASE_URL="https://integrate.api.nvidia.com" ```		2025-10-30 21:42:09 -07:00
..
recordings	feat(api)!: BREAKING CHANGE: support passing `extra_body` through to providers (#3777 )	2025-10-10 16:21:44 -07:00
__init__.py	fix: remove ruff N999 (#1388 )	2025-03-07 11:14:04 -08:00
dog.png	refactor: tests/unittests -> tests/unit; tests/api -> tests/integration	2025-03-04 09:57:00 -08:00
test_openai_completion.py	fix: relax structured output test assertions to handle whitespace and… (#3997 )	2025-10-30 16:55:23 -07:00
test_openai_embeddings.py	fix(inference): enable routing of models with provider_data alone (#3928 )	2025-10-28 11:16:37 -07:00
test_openai_vision_inference.py	feat(internal): add image_url download feature to OpenAIMixin (#3516 )	2025-09-26 17:32:16 -04:00
test_provider_data_routing.py	fix(inference): enable routing of models with provider_data alone (#3928 )	2025-10-28 11:16:37 -07:00
test_rerank.py	feat: Add rerank API for NVIDIA Inference Provider (#3329 )	2025-10-30 21:42:09 -07:00
test_tools_with_schemas.py	feat(tools)!: substantial clean up of "Tool" related datatypes (#3627 )	2025-10-02 15:12:03 -07:00
test_vision_inference.py	chore(apis): unpublish deprecated /v1/inference apis (#3297 )	2025-09-27 11:20:06 -07:00
vision_test_1.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00
vision_test_2.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00
vision_test_3.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00