[Feat] Add max_completion_tokens param (#5691)

* add max_completion_tokens * add max_completion_tokens * add max_completion_tokens support for OpenAI models * add max_completion_tokens param * add max_completion_tokens for bedrock converse models * add test for converse maxTokens * fix openai o1 param mapping test * move test optional params * add max_completion_tokens for anthropic api * fix conftest * add max_completion tokens for vertex ai partner models * add max_completion_tokens for fireworks ai * add max_completion_tokens for hf rest api * add test for param mapping * add param mapping for vertex, gemini + testing * predibase is the most unstable and unusable llm api in prod, can't handle our ci/cd * add max_completion_tokens to openai supported params * fix fireworks ai param mapping
2025-04-26 03:04:13 +00:00 · 2024-09-14 14:57:01 -07:00 · 2024-09-14 14:57:01 -07:00 · 85acdb9193
commit 85acdb9193
parent 415a3ede9e
31 changed files with 591 additions and 35 deletions
--- a/litellm/llms/ollama_chat.py
+++ b/litellm/llms/ollama_chat.py
@ -140,6 +140,7 @@ class OllamaChatConfig:
    ):
        return [
            "max_tokens",
+            "max_completion_tokens",
            "stream",
            "top_p",
            "temperature",
@ -156,7 +157,7 @@ class OllamaChatConfig:
        self, model: str, non_default_params: dict, optional_params: dict
    ):
        for param, value in non_default_params.items():
-            if param == "max_tokens":
+            if param == "max_tokens" or param == "max_completion_tokens":
                optional_params["num_predict"] = value
            if param == "stream":
                optional_params["stream"] = value