llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-08 19:10:56 +00:00

History

Ashwin Bharambe 05e73d12b3 introduce openai_compat with the completions (not chat-completions) API This keeps the prompt encoding layer in our control (see `chat_completion_request_to_prompt()` method)		2024-10-08 17:23:42 -07:00
..
bedrock	inference registry updates	2024-10-08 17:23:02 -07:00
databricks	Introduce model_store, shield_store, memory_bank_store	2024-10-08 17:23:02 -07:00
fireworks	introduce openai_compat with the completions (not chat-completions) API	2024-10-08 17:23:42 -07:00
ollama	introduce openai_compat with the completions (not chat-completions) API	2024-10-08 17:23:42 -07:00
sample	Introduce model_store, shield_store, memory_bank_store	2024-10-08 17:23:02 -07:00
tgi	Add inference test	2024-10-08 17:23:02 -07:00
together	introduce openai_compat with the completions (not chat-completions) API	2024-10-08 17:23:42 -07:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00