Add vLLM inference provider for OpenAI compatible vLLM server (#178)

This PR adds vLLM inference provider for OpenAI compatible vLLM server.
This commit is contained in:
Yuan Tang 2024-10-20 21:43:25 -04:00 committed by Xi Yan
parent 391dedd1c0
commit 74e6356b51
6 changed files with 209 additions and 1 deletions

View file

@ -0,0 +1,10 @@
name: remote-vllm
distribution_spec:
description: Use remote vLLM for running LLM inference
providers:
inference: remote::vllm
memory: meta-reference
safety: meta-reference
agents: meta-reference
telemetry: meta-reference
image_type: docker