llama-stack-mirror/llama_stack/providers/remote/inference/vllm
Hardik Shah 6ec2ed4196
feat: New OpenAI compat embeddings API (#2314)
# What does this PR do?
Adds a new endpoint that is compatible with OpenAI for embeddings api. 
`/openai/v1/embeddings`
Added providers for OpenAI, LiteLLM and SentenceTransformer. 


## Test Plan
```
LLAMA_STACK_CONFIG=http://localhost:8321 pytest -sv tests/integration/inference/test_openai_embeddings.py --embedding-model all-MiniLM-L6-v2,text-embedding-3-small,gemini/text-embedding-004
```
2025-06-01 17:55:12 +05:30
..
__init__.py Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
config.py fix: convert boolean string to boolean (#2284) 2025-05-27 13:05:38 -07:00
vllm.py feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30