[Feat] Add max_completion_tokens param (#5691)

* add max_completion_tokens * add max_completion_tokens * add max_completion_tokens support for OpenAI models * add max_completion_tokens param * add max_completion_tokens for bedrock converse models * add test for converse maxTokens * fix openai o1 param mapping test * move test optional params * add max_completion_tokens for anthropic api * fix conftest * add max_completion tokens for vertex ai partner models * add max_completion_tokens for fireworks ai * add max_completion_tokens for hf rest api * add test for param mapping * add param mapping for vertex, gemini + testing * predibase is the most unstable and unusable llm api in prod, can't handle our ci/cd * add max_completion_tokens to openai supported params * fix fireworks ai param mapping
2025-04-26 11:14:04 +00:00 · 2024-09-14 14:57:01 -07:00 · 2024-09-14 14:57:01 -07:00 · 85acdb9193
commit 85acdb9193
parent 415a3ede9e
31 changed files with 591 additions and 35 deletions
--- a/litellm/tests/test_completion.py
+++ b/litellm/tests/test_completion.py
@ -1317,11 +1317,12 @@ import openai


 def test_completion_gpt4_turbo():
+    litellm.set_verbose = True
    try:
        response = completion(
            model="gpt-4-1106-preview",
            messages=messages,
-            max_tokens=10,
+            max_completion_tokens=10,
        )
        print(response)
    except openai.RateLimitError: