llama-stack-mirror/llama_stack/providers/adapters/inference
Ashwin Bharambe 05e73d12b3 introduce openai_compat with the completions (not chat-completions) API
This keeps the prompt encoding layer in our control (see
`chat_completion_request_to_prompt()` method)
2024-10-08 17:23:42 -07:00
..
bedrock inference registry updates 2024-10-08 17:23:02 -07:00
databricks Introduce model_store, shield_store, memory_bank_store 2024-10-08 17:23:02 -07:00
fireworks introduce openai_compat with the completions (not chat-completions) API 2024-10-08 17:23:42 -07:00
ollama introduce openai_compat with the completions (not chat-completions) API 2024-10-08 17:23:42 -07:00
sample Introduce model_store, shield_store, memory_bank_store 2024-10-08 17:23:02 -07:00
tgi Add inference test 2024-10-08 17:23:02 -07:00
together introduce openai_compat with the completions (not chat-completions) API 2024-10-08 17:23:42 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00