llama-stack-mirror/llama_stack
Yuan Tang 74e6356b51 Add vLLM inference provider for OpenAI compatible vLLM server (#178)
This PR adds vLLM inference provider for OpenAI compatible vLLM server.
2024-10-21 10:46:45 -07:00
..
apis Make all methods async def again; add completion() for meta-reference (#270) 2024-10-21 10:46:40 -07:00
cli rename 2024-10-18 17:28:26 -07:00
distribution Add vLLM inference provider for OpenAI compatible vLLM server (#178) 2024-10-21 10:46:45 -07:00
providers Add vLLM inference provider for OpenAI compatible vLLM server (#178) 2024-10-21 10:46:45 -07:00
scripts Add a test for CLI, but not fully done so disabled 2024-09-19 13:27:07 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00