llama-stack-mirror/llama_stack/templates
ehhuang 549812f51e
feat: implement get chat completions APIs (#2200)
# What does this PR do?
* Provide sqlite implementation of the APIs introduced in
https://github.com/meta-llama/llama-stack/pull/2145.
* Introduced a SqlStore API: llama_stack/providers/utils/sqlstore/api.py
and the first Sqlite implementation
* Pagination support will be added in a future PR.

## Test Plan
Unit test on sql store:
<img width="1005" alt="image"
src="https://github.com/user-attachments/assets/9b8b7ec8-632b-4667-8127-5583426b2e29"
/>


Integration test:
```
INFERENCE_MODEL="llama3.2:3b-instruct-fp16" llama stack build --template ollama --image-type conda --run
```
```
LLAMA_STACK_CONFIG=http://localhost:5001 INFERENCE_MODEL="llama3.2:3b-instruct-fp16" python -m pytest -v tests/integration/inference/test_openai_completion.py --text-model "llama3.2:3b-instruct-fp16" -k 'inference_store and openai'
```
2025-05-21 22:21:52 -07:00
..
bedrock feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
cerebras feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
ci-tests feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
dell feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
experimental-post-training feat: add huggingface post_training impl (#2132) 2025-05-16 14:41:28 -07:00
fireworks feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
groq feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
hf-endpoint feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
hf-serverless feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
llama_api feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
meta-reference-gpu feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
nvidia feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
ollama feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
open-benchmark feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
passthrough feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
remote-vllm feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
sambanova feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
starter feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
tgi feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
together feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
verification feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
vllm-gpu feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
watsonx feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
__init__.py Auto-generate distro yamls + docs (#468) 2024-11-18 14:57:06 -08:00
dependencies.json feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
template.py feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00