Add vLLM inference provider for OpenAI compatible vLLM server (#178)

This PR adds vLLM inference provider for OpenAI compatible vLLM server.
This commit is contained in:
Yuan Tang 2024-10-20 21:43:25 -04:00 committed by GitHub
parent 59c43736e8
commit a27a2cd2af
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
6 changed files with 209 additions and 1 deletions

View file

@ -7,4 +7,4 @@ distribution_spec:
safety: meta-reference
agents: meta-reference
telemetry: meta-reference
image_type: conda
image_type: conda

View file

@ -0,0 +1,10 @@
name: remote-vllm
distribution_spec:
description: Use remote vLLM for running LLM inference
providers:
inference: remote::vllm
memory: meta-reference
safety: meta-reference
agents: meta-reference
telemetry: meta-reference
image_type: docker