forked from phoenix-oss/llama-stack-mirror
# What does this PR do? * Add a test fixture for tgi * Fixes the logic to correctly pass the llama model for chat completion Fixes #514 ## Test Plan pytest -k "tgi" llama_stack/providers/tests/inference/test_text_inference.py --env TGI_URL=http://localhost:$INFERENCE_PORT --env TGI_API_TOKEN=$HF_TOKEN |
||
---|---|---|
.. | ||
bedrock | ||
databricks | ||
fireworks | ||
nvidia | ||
ollama | ||
sample | ||
tgi | ||
together | ||
vllm | ||
__init__.py |