llama-stack-mirror/llama_stack/providers/remote/inference
Hardik Shah 6ec2ed4196
feat: New OpenAI compat embeddings API (#2314)
# What does this PR do?
Adds a new endpoint that is compatible with OpenAI for embeddings api. 
`/openai/v1/embeddings`
Added providers for OpenAI, LiteLLM and SentenceTransformer. 


## Test Plan
```
LLAMA_STACK_CONFIG=http://localhost:8321 pytest -sv tests/integration/inference/test_openai_embeddings.py --embedding-model all-MiniLM-L6-v2,text-embedding-3-small,gemini/text-embedding-004
```
2025-06-01 17:55:12 +05:30
..
anthropic chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
bedrock feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
cerebras feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
cerebras_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
databricks feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
fireworks feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
fireworks_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
gemini chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
groq chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
groq_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
llama_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
nvidia feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
ollama feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
openai feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
passthrough feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
runpod feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
sambanova fix(providers): update sambanova json schema mode (#2306) 2025-05-29 09:54:23 -07:00
sambanova_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
tgi feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
together feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
together_openai_compat feat: introduce APIs for retrieving chat completion requests (#2145) 2025-05-18 21:43:19 -07:00
vllm feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
watsonx feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
__init__.py impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00