llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-08 19:10:56 +00:00

Author	SHA1	Message	Date
Ashwin Bharambe	ed899a5dec	Convert TGI to work with openai_compat	2024-10-08 17:23:42 -07:00
Ashwin Bharambe	05e73d12b3	introduce openai_compat with the completions (not chat-completions) API This keeps the prompt encoding layer in our control (see `chat_completion_request_to_prompt()` method)	2024-10-08 17:23:42 -07:00
Adrian Cole	01d93be948	Adds markdown-link-check and fixes a broken link (#165 ) Signed-off-by: Adrian Cole <adrian.cole@elastic.co> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>	2024-10-02 14:26:20 -07:00
Byung Chun Kim	2f096ca509	accepts not model itself. (#153 )	2024-09-29 20:16:50 -07:00
Ashwin Bharambe	0a3999a9a4	Use inference APIs for executing Llama Guard (#121 ) We should use Inference APIs to execute Llama Guard instead of directly needing to use HuggingFace modeling related code. The actual inference consideration is handled by Inference.	2024-09-28 15:40:06 -07:00
Ashwin Bharambe	56aed59eb4	Support for Llama3.2 models and Swift SDK (#98 )	2024-09-25 10:29:58 -07:00