llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-04 04:04:14 +00:00

History

Dinesh Yeduguru de7af28756 Tgi fixture (#519 ) # What does this PR do? * Add a test fixture for tgi * Fixes the logic to correctly pass the llama model for chat completion Fixes #514 ## Test Plan pytest -k "tgi" llama_stack/providers/tests/inference/test_text_inference.py --env TGI_URL=http://localhost:$INFERENCE_PORT --env TGI_API_TOKEN=$HF_TOKEN		2024-11-25 13:17:02 -08:00
..
inline	Fix opentelemetry adapter (#510 )	2024-11-22 18:18:11 -08:00
registry	add NVIDIA NIM inference adapter (#355 )	2024-11-23 15:59:00 -08:00
remote	Tgi fixture (#519 )	2024-11-25 13:17:02 -08:00
tests	Tgi fixture (#519 )	2024-11-25 13:17:02 -08:00
utils	add missing __init__	2024-11-25 09:42:46 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	unregister for memory banks and remove update API (#458 )	2024-11-14 17:12:11 -08:00