llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-28 06:11:59 +00:00

History

Hardik Shah 6ec2ed4196 feat: New OpenAI compat embeddings API (#2314 ) # What does this PR do? Adds a new endpoint that is compatible with OpenAI for embeddings api. `/openai/v1/embeddings` Added providers for OpenAI, LiteLLM and SentenceTransformer. ## Test Plan ``` LLAMA_STACK_CONFIG=http://localhost:8321 pytest -sv tests/integration/inference/test_openai_embeddings.py --embedding-model all-MiniLM-L6-v2,text-embedding-3-small,gemini/text-embedding-004 ```		2025-06-01 17:55:12 +05:30
..
__init__.py	fix: remove ruff N999 (#1388 )	2025-03-07 11:14:04 -08:00
dog.png	refactor: tests/unittests -> tests/unit; tests/api -> tests/integration	2025-03-04 09:57:00 -08:00
test_batch_inference.py	feat: add batch inference API to llama stack inference (#1945 )	2025-04-12 11:41:12 -07:00
test_embedding.py	refactor: tests/unittests -> tests/unit; tests/api -> tests/integration	2025-03-04 09:57:00 -08:00
test_openai_completion.py	feat: support postgresql inference store (#2310 )	2025-06-01 17:55:11 +05:30
test_openai_embeddings.py	feat: New OpenAI compat embeddings API (#2314 )	2025-06-01 17:55:12 +05:30
test_text_inference.py	fix: llama4 tool use prompt fix (#2103 )	2025-05-06 22:18:31 -07:00
test_vision_inference.py	test: verification on provider's OAI endpoints (#1893 )	2025-04-07 23:06:28 -07:00
vision_test_1.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00
vision_test_2.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00
vision_test_3.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00