mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-12-28 06:11:59 +00:00
# What does this PR do? Adds a new endpoint that is compatible with OpenAI for embeddings api. `/openai/v1/embeddings` Added providers for OpenAI, LiteLLM and SentenceTransformer. ## Test Plan ``` LLAMA_STACK_CONFIG=http://localhost:8321 pytest -sv tests/integration/inference/test_openai_embeddings.py --embedding-model all-MiniLM-L6-v2,text-embedding-3-small,gemini/text-embedding-004 ``` |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| dog.png | ||
| test_batch_inference.py | ||
| test_embedding.py | ||
| test_openai_completion.py | ||
| test_openai_embeddings.py | ||
| test_text_inference.py | ||
| test_vision_inference.py | ||
| vision_test_1.jpg | ||
| vision_test_2.jpg | ||
| vision_test_3.jpg | ||