llama-stack-mirror/llama_stack/providers/adapters/inference/vllm
2024-10-24 16:02:41 -07:00
..
__init__.py Add vLLM inference provider for OpenAI compatible vLLM server (#178) 2024-10-20 18:43:25 -07:00
config.py Add vLLM inference provider for OpenAI compatible vLLM server (#178) 2024-10-20 18:43:25 -07:00
vllm.py completion() for tgi (#295) 2024-10-24 16:02:41 -07:00