mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-12-28 16:11:59 +00:00
- Implement OpenAI-compatible embeddings endpoint in vLLM provider - Support both float and base64 encoding formats - Add proper error handling and response formatting Signed-off-by: Varsha Prasad Narsing <varshaprasad96@gmail.com> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| config.py | ||
| vllm.py | ||