llama-stack-mirror/llama_stack/providers/remote/inference
Ben Browning a5827f7cb3 Nvidia provider support for OpenAI API endpoints
This wires up the openai_completion and openai_chat_completion API
methods for the remote Nvidia inference provider, and adds it to the
chat completions part of the OpenAI test suite.

The hosted Nvidia service doesn't actually host any Llama models with
functioning completions and chat completions endpoints, so for now the
test suite only activates the nvidia provider for chat completions.

Signed-off-by: Ben Browning <bbrownin@redhat.com>
2025-04-10 13:43:28 -04:00
..
anthropic feat(providers): Groq now uses LiteLLM openai-compat (#1303) 2025-02-27 13:16:50 -08:00
bedrock Add unsupported OpenAI mixin to all remaining inference providers 2025-04-09 15:47:02 -04:00
cerebras Add unsupported OpenAI mixin to all remaining inference providers 2025-04-09 15:47:02 -04:00
cerebras_openai_compat test: verification on provider's OAI endpoints (#1893) 2025-04-07 23:06:28 -07:00
databricks Add unsupported OpenAI mixin to all remaining inference providers 2025-04-09 15:47:02 -04:00
fireworks test: verification on provider's OAI endpoints (#1893) 2025-04-07 23:06:28 -07:00
fireworks_openai_compat test: verification on provider's OAI endpoints (#1893) 2025-04-07 23:06:28 -07:00
gemini feat(providers): Groq now uses LiteLLM openai-compat (#1303) 2025-02-27 13:16:50 -08:00
groq test: verification on provider's OAI endpoints (#1893) 2025-04-07 23:06:28 -07:00
groq_openai_compat test: verification on provider's OAI endpoints (#1893) 2025-04-07 23:06:28 -07:00
nvidia Nvidia provider support for OpenAI API endpoints 2025-04-10 13:43:28 -04:00
ollama Add prompt_logprobs and guided_choice to OpenAI completions 2025-04-09 15:47:02 -04:00
openai feat(providers): Groq now uses LiteLLM openai-compat (#1303) 2025-02-27 13:16:50 -08:00
passthrough Add prompt_logprobs and guided_choice to OpenAI completions 2025-04-09 15:47:02 -04:00
runpod Add unsupported OpenAI mixin to all remaining inference providers 2025-04-09 15:47:02 -04:00
sambanova Add unsupported OpenAI mixin to all remaining inference providers 2025-04-09 15:47:02 -04:00
sambanova_openai_compat test: verification on provider's OAI endpoints (#1893) 2025-04-07 23:06:28 -07:00
tgi Add unsupported OpenAI mixin to all remaining inference providers 2025-04-09 15:47:02 -04:00
together Add prompt_logprobs and guided_choice to OpenAI completions 2025-04-09 15:47:02 -04:00
together_openai_compat test: verification on provider's OAI endpoints (#1893) 2025-04-07 23:06:28 -07:00
vllm vllm prompt_logprobs can also be 0 2025-04-09 17:32:03 -04:00
__init__.py impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00