llama-stack-mirror/llama_stack/providers/remote/inference
Varsha Prasad Narsing e35e6eebfe chore: Add OpenAI compatiblity for vLLM embeddings
- Implement OpenAI-compatible embeddings endpoint in vLLM provider
- Support both float and base64 encoding formats
- Add proper error handling and response formatting

Signed-off-by: Varsha Prasad Narsing <varshaprasad96@gmail.com>
2025-06-16 12:23:50 -07:00
..
anthropic chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
bedrock feat: New OpenAI compat embeddings API (#2314) 2025-05-31 22:11:47 -07:00
cerebras feat: New OpenAI compat embeddings API (#2314) 2025-05-31 22:11:47 -07:00
cerebras_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
databricks feat: New OpenAI compat embeddings API (#2314) 2025-05-31 22:11:47 -07:00
fireworks feat: Add suffix to openai_completions (#2449) 2025-06-13 16:06:06 -07:00
fireworks_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
gemini chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
groq chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
groq_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
llama_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
nvidia feat: Add suffix to openai_completions (#2449) 2025-06-13 16:06:06 -07:00
ollama feat: Add suffix to openai_completions (#2449) 2025-06-13 16:06:06 -07:00
openai feat: Add suffix to openai_completions (#2449) 2025-06-13 16:06:06 -07:00
passthrough feat: Add suffix to openai_completions (#2449) 2025-06-13 16:06:06 -07:00
runpod feat: New OpenAI compat embeddings API (#2314) 2025-05-31 22:11:47 -07:00
sambanova fix(providers): update sambanova json schema mode (#2306) 2025-05-29 09:54:23 -07:00
sambanova_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
tgi feat: New OpenAI compat embeddings API (#2314) 2025-05-31 22:11:47 -07:00
together feat: Add suffix to openai_completions (#2449) 2025-06-13 16:06:06 -07:00
together_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
vllm chore: Add OpenAI compatiblity for vLLM embeddings 2025-06-16 12:23:50 -07:00
watsonx feat: Add suffix to openai_completions (#2449) 2025-06-13 16:06:06 -07:00
__init__.py impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00