Commit graph

1169 commits

Author SHA1 Message Date
Mikkel Gravgaard
e2213a1a5e
Use DEBUG level for curl command logging
Currently, the INFO level is used, which can cause excessive logging in production.
2024-04-12 11:27:53 +02:00
Krrish Dholakia
7a3821e0f6 fix(gemini.py): log system prompt in verbose output 2024-04-11 23:15:58 -07:00
Krrish Dholakia
c377ba0755 fix(utils.py): vertex ai exception mapping
fixes check which caused all vertex errors to be ratelimit errors
2024-04-11 23:04:21 -07:00
David Manouchehri
649c3bb0dd
(feat) - Add support for JSON mode in Vertex AI 2024-04-12 00:03:29 +00:00
Krish Dholakia
83e7ed94ce
Merge pull request #2942 from BerriAI/litellm_fix_router_loading
Router Async Improvements
2024-04-10 20:16:53 -07:00
Krrish Dholakia
37ac17aebd fix(router.py): fix datetime object 2024-04-10 17:55:24 -07:00
Ishaan Jaff
0c26404cb3 fix - allow base64 cache hits embedding responses 2024-04-10 16:44:40 -07:00
Krrish Dholakia
b2741933dc fix(proxy_cli.py): don't double load the router config
was causing callbacks to be instantiated twice - double couting usage in cache
2024-04-10 13:23:56 -07:00
Ishaan Jaff
caec5374d9
Merge pull request #2893 from unclecode/main
Fix issue #2832: Add protected_namespaces to Config class within utils.py, router.py and completion.py to avoid the warning message.
2024-04-09 08:51:41 -07:00
Krrish Dholakia
48bfc45cb0 fix(utils.py): fix reordering of items for cached embeddings
ensures cached embedding item is returned in correct order
2024-04-08 12:18:24 -07:00
unclecode
1ace192155 Fix issue #2832: Add protected_namespaces to Config class within utils.py, router.py and completion.py to avoid the warning message. 2024-04-08 12:43:17 +08:00
Ishaan Jaff
a5aef6ec00
Merge pull request #2879 from BerriAI/litellm_async_anthropic_api
[Feat] Async Anthropic API 97.5% lower median latency
2024-04-07 09:56:52 -07:00
Krrish Dholakia
460546956d fix(utils.py): fix import 2024-04-06 18:37:38 -07:00
Krrish Dholakia
a410981972 fix(utils.py): fix circular import 2024-04-06 18:29:51 -07:00
Ishaan Jaff
7849c29f70 async anthropic streaming 2024-04-06 17:36:56 -07:00
Krrish Dholakia
ead59b7b38 fix(utils.py): add gemini api base support to 'get_api_base' 2024-04-06 16:08:15 -07:00
Krrish Dholakia
6110d32b1c feat(proxy/utils.py): return api base for request hanging alerts 2024-04-06 15:58:53 -07:00
Krrish Dholakia
7b30e5ae38 fix(utils.py): fix content check in pre-call rules 2024-04-06 09:03:19 -07:00
Krrish Dholakia
30f57e7aa5 fix(utils.py): move info statement to debug 2024-04-05 22:06:46 -07:00
Ishaan Jaff
faa0d38087
Merge pull request #2868 from BerriAI/litellm_add_command_r_on_proxy
Add Azure Command-r-plus on litellm proxy
2024-04-05 15:13:47 -07:00
Ishaan Jaff
2174b240d8
Merge pull request #2861 from BerriAI/litellm_add_azure_command_r_plust
[FEAT] add azure command-r-plus
2024-04-05 15:13:35 -07:00
Ishaan Jaff
9055a071e6 proxy - add azure/command r 2024-04-05 14:35:31 -07:00
Krish Dholakia
a50edef1e6
Merge pull request #2856 from lazyhope/anthropic-tools-use-2024-04-04
Support latest Anthropic Tools Use (2024-04-04)
2024-04-05 14:31:26 -07:00
Ishaan Jaff
6b9c04618e fix use azure_ai/mistral 2024-04-05 10:07:43 -07:00
Ishaan Jaff
5ce80d82d3 fix support azure/mistral models 2024-04-05 09:32:39 -07:00
Krrish Dholakia
f0c4ff6e60 fix(vertex_ai_anthropic.py): support streaming, async completion, async streaming for vertex ai anthropic 2024-04-05 09:27:48 -07:00
Ishaan Jaff
71352b1b36 fix add azure/command-r-plus 2024-04-05 08:53:24 -07:00
Zihao Li
d2cf9d2cf1 Move tool definitions from system prompt to parameter and refactor tool calling parse 2024-04-05 16:01:40 +08:00
Ishaan Jaff
9dc4127576 v0 return cache key in responses 2024-04-04 10:11:18 -07:00
Krrish Dholakia
15e0099948 fix(proxy_server.py): return original model response via response headers - /v1/completions
to help devs with debugging
2024-04-03 13:05:43 -07:00
Krrish Dholakia
919ec86b2b fix(openai.py): switch to using openai sdk for text completion calls 2024-04-02 15:08:12 -07:00
Krrish Dholakia
b07788d2a5 fix(openai.py): return logprobs for text completion calls 2024-04-02 14:05:56 -07:00
Krrish Dholakia
0d949d71ab fix(main.py): support text completion input being a list of strings
addresses - https://github.com/BerriAI/litellm/issues/2792, https://github.com/BerriAI/litellm/issues/2777
2024-04-02 08:50:16 -07:00
Sebastián Estévez
e50e76bbd5
support cohere_chat in get_api_key 2024-04-01 13:24:03 -04:00
Ishaan Jaff
c365de122a check num retries in async wrapper 2024-03-30 19:33:40 -07:00
Ishaan Jaff
bd95626579 (fix) improve async perf 2024-03-30 19:07:04 -07:00
Vincelwt
1b84dfac91
Merge branch 'main' into main 2024-03-30 13:21:53 +09:00
Ishaan Jaff
24570bc075 (docs) grafana / prometheus 2024-03-29 14:25:45 -07:00
Krrish Dholakia
41fb76740e fix(utils.py): exception mapping on 'next()' streaming error 2024-03-29 09:18:41 -07:00
Krrish Dholakia
cd53291b62 fix(utils.py): support bedrock mistral streaming 2024-03-29 07:56:10 -07:00
Krrish Dholakia
5d428ac94c fix(utils.py): don't add chatml tokens to a simple text token count 2024-03-28 13:48:48 -07:00
Krrish Dholakia
2926d5a8eb fix(proxy/utils.py): check cache before alerting user 2024-03-27 20:09:15 -07:00
Krrish Dholakia
9b7383ac67 fix(utils.py): don't run post-call rules on a coroutine function 2024-03-27 13:16:27 -07:00
Krish Dholakia
0ab708e6f1
Merge pull request #2704 from BerriAI/litellm_jwt_auth_improvements_3
fix(handle_jwt.py): enable team-based jwt-auth access
2024-03-26 16:06:56 -07:00
Krrish Dholakia
4281f1545b fix(utils.py): check if item in list is pydantic object or dict before dereferencing 2024-03-26 14:39:16 -07:00
Krrish Dholakia
4d7f4550e2 test(test_batch_completions.py): handle anthropic overloaded error 2024-03-26 13:55:03 -07:00
Krrish Dholakia
3a82ff2ef2 fix(utils.py): don't send subsequent chunks if last chunk sent
prevents multiple empty finish chunks from being sent
2024-03-26 13:49:42 -07:00
Ishaan Jaff
da503eab18
Merge branch 'main' into litellm_remove_litellm_telemetry 2024-03-26 11:35:02 -07:00
Ishaan Jaff
6b4b05b58f (fix) remove litellm.telemetry 2024-03-26 11:21:09 -07:00
Krrish Dholakia
584d187e0e fix(utils.py): check if message is pydantic object or dict before dereferencing 2024-03-26 09:47:44 -07:00