llama-stack-mirror/llama_stack/distribution/routers
Hardik Shah 6ec2ed4196
feat: New OpenAI compat embeddings API (#2314)
# What does this PR do?
Adds a new endpoint that is compatible with OpenAI for embeddings api. 
`/openai/v1/embeddings`
Added providers for OpenAI, LiteLLM and SentenceTransformer. 


## Test Plan
```
LLAMA_STACK_CONFIG=http://localhost:8321 pytest -sv tests/integration/inference/test_openai_embeddings.py --embedding-model all-MiniLM-L6-v2,text-embedding-3-small,gemini/text-embedding-004
```
2025-06-01 17:55:12 +05:30
..
__init__.py chore: split routing_tables into individual files (#2259) 2025-05-24 23:15:05 -07:00
datasets.py chore: split routers into individual files (datasets) (#2249) 2025-05-24 22:11:43 -07:00
eval_scoring.py chore: split routers into individual files (inference, tool, vector_io, eval_scoring) (#2258) 2025-05-24 22:59:07 -07:00
inference.py feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
safety.py chore: split routers into individual files (safety) 2025-05-24 22:00:32 -07:00
tool_runtime.py fix(tools): do not index tools, only index toolgroups (#2261) 2025-05-25 13:27:52 -07:00
vector_io.py chore: split routers into individual files (inference, tool, vector_io, eval_scoring) (#2258) 2025-05-24 22:59:07 -07:00