llama-stack-mirror/src/llama_stack
Jiayi Ni fa7699d2c3
feat: Add rerank API for NVIDIA Inference Provider (#3329)
# What does this PR do?
Add rerank API for NVIDIA Inference Provider.

<!-- If resolving an issue, uncomment and update the line below -->
Closes #3278 

## Test Plan
Unit test:
```
pytest tests/unit/providers/nvidia/test_rerank_inference.py
```

Integration test: 
```
pytest -s -v tests/integration/inference/test_rerank.py   --stack-config="inference=nvidia"   --rerank-model=nvidia/nvidia/nv-rerankqa-mistral-4b-v3   --env NVIDIA_API_KEY=""   --env NVIDIA_BASE_URL="https://integrate.api.nvidia.com"
```
2025-10-30 21:42:09 -07:00
..
apis fix!: remove chunk_id property from Chunk class (#3954) 2025-10-29 18:59:59 -07:00
cli fix: handle missing external_providers_dir (#3974) 2025-10-30 17:01:31 -07:00
core fix: handle missing external_providers_dir (#3974) 2025-10-30 17:01:31 -07:00
distributions docs: add documentation on how to use custom run yaml in docker (#3949) 2025-10-28 16:05:44 -07:00
models fix(mypy): resolve provider utility and testing type issues (#3935) 2025-10-28 10:37:27 -07:00
providers feat: Add rerank API for NVIDIA Inference Provider (#3329) 2025-10-30 21:42:09 -07:00
strong_typing fix(mypy): part-03 completely resolve meta reference responses impl typing issues (#3951) 2025-10-29 08:07:15 -07:00
testing fix(mypy): add type stubs and fix typing issues (#3938) 2025-10-28 11:00:09 -07:00
ui chore(package): migrate to src/ layout (#3920) 2025-10-27 12:02:21 -07:00
__init__.py chore(package): migrate to src/ layout (#3920) 2025-10-27 12:02:21 -07:00
env.py chore(package): migrate to src/ layout (#3920) 2025-10-27 12:02:21 -07:00
log.py chore(package): migrate to src/ layout (#3920) 2025-10-27 12:02:21 -07:00
schema_utils.py chore(package): migrate to src/ layout (#3920) 2025-10-27 12:02:21 -07:00