llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-07-18 02:42:31 +00:00

History

Hardik Shah 6ec2ed4196 feat: New OpenAI compat embeddings API (#2314 ) # What does this PR do? Adds a new endpoint that is compatible with OpenAI for embeddings api. `/openai/v1/embeddings` Added providers for OpenAI, LiteLLM and SentenceTransformer. ## Test Plan ``` LLAMA_STACK_CONFIG=http://localhost:8321 pytest -sv tests/integration/inference/test_openai_embeddings.py --embedding-model all-MiniLM-L6-v2,text-embedding-3-small,gemini/text-embedding-004 ```		2025-06-01 17:55:12 +05:30
..
__init__.py	chore: split routing_tables into individual files (#2259 )	2025-05-24 23:15:05 -07:00
datasets.py	chore: split routers into individual files (datasets) (#2249 )	2025-05-24 22:11:43 -07:00
eval_scoring.py	chore: split routers into individual files (inference, tool, vector_io, eval_scoring) (#2258 )	2025-05-24 22:59:07 -07:00
inference.py	feat: New OpenAI compat embeddings API (#2314 )	2025-06-01 17:55:12 +05:30
safety.py	chore: split routers into individual files (safety)	2025-05-24 22:00:32 -07:00
tool_runtime.py	fix(tools): do not index tools, only index toolgroups (#2261 )	2025-05-25 13:27:52 -07:00
vector_io.py	chore: split routers into individual files (inference, tool, vector_io, eval_scoring) (#2258 )	2025-05-24 22:59:07 -07:00