llama-stack-mirror/llama_stack/providers/utils
Hardik Shah 6ec2ed4196
feat: New OpenAI compat embeddings API (#2314)
# What does this PR do?
Adds a new endpoint that is compatible with OpenAI for embeddings api. 
`/openai/v1/embeddings`
Added providers for OpenAI, LiteLLM and SentenceTransformer. 


## Test Plan
```
LLAMA_STACK_CONFIG=http://localhost:8321 pytest -sv tests/integration/inference/test_openai_embeddings.py --embedding-model all-MiniLM-L6-v2,text-embedding-3-small,gemini/text-embedding-004
```
2025-06-01 17:55:12 +05:30
..
bedrock chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
common chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
datasetio chore(refact): move paginate_records fn outside of datasetio (#2137) 2025-05-12 10:56:14 -07:00
inference feat: New OpenAI compat embeddings API (#2314) 2025-06-01 17:55:12 +05:30
kvstore feat: support postgresql inference store (#2310) 2025-06-01 17:55:11 +05:30
memory feat: Enable ingestion of precomputed embeddings (#2317) 2025-06-01 17:55:11 +05:30
responses feat: add responses input items api (#2239) 2025-05-24 07:05:53 -07:00
scoring chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
sqlstore feat: support postgresql inference store (#2310) 2025-06-01 17:55:11 +05:30
telemetry feat: Propagate W3C trace context headers from clients (#2153) 2025-05-19 18:56:54 -07:00
tools fix: match mcp headers in provider data to Responses API shape (#2263) 2025-05-25 14:33:10 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
pagination.py chore(refact): move paginate_records fn outside of datasetio (#2137) 2025-05-12 10:56:14 -07:00
scheduler.py chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00