llama-stack

History

ehhuang 0b695538af fix: chat completion with more than one choice (#2288 ) # What does this PR do? Fix a bug in openai_compat where choices are not indexed correctly. ## Test Plan Added a new test. Rerun the failed inference_store tests: llama stack run fireworks --image-type conda pytest -s -v tests/integration/ --stack-config http://localhost:8321 -k 'test_inference_store' --text-model meta-llama/Llama-3.3-70B-Instruct --count 10		2025-05-27 15:39:15 -07:00
..
__init__.py	fix: remove ruff N999 (#1388 )	2025-03-07 11:14:04 -08:00
dog.png	refactor: tests/unittests -> tests/unit; tests/api -> tests/integration	2025-03-04 09:57:00 -08:00
test_batch_inference.py	feat: add batch inference API to llama stack inference (#1945 )	2025-04-12 11:41:12 -07:00
test_embedding.py	refactor: tests/unittests -> tests/unit; tests/api -> tests/integration	2025-03-04 09:57:00 -08:00
test_openai_completion.py	fix: chat completion with more than one choice (#2288 )	2025-05-27 15:39:15 -07:00
test_text_inference.py	fix: llama4 tool use prompt fix (#2103 )	2025-05-06 22:18:31 -07:00
test_vision_inference.py	test: verification on provider's OAI endpoints (#1893 )	2025-04-07 23:06:28 -07:00
vision_test_1.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00
vision_test_2.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00
vision_test_3.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00