Commit graph

1177 commits

Author SHA1 Message Date
Krrish Dholakia
63b6165ea5 fix(utils.py): fix timeout error - don't pass in httpx.request 2024-04-15 10:50:23 -07:00
Krish Dholakia
cfd2bc030f Merge pull request #3028 from BerriAI/litellm_anthropic_text_completion_fix
fix(anthropic_text.py): add support for async text completion calls
2024-04-15 09:26:28 -07:00
Krrish Dholakia
1cd0551a1e fix(anthropic_text.py): add support for async text completion calls 2024-04-15 08:15:00 -07:00
Ishaan Jaff
3c8150914f groq - add tool calling support 2024-04-15 08:09:27 -07:00
Krrish Dholakia
866259f95f feat(prometheus_services.py): monitor health of proxy adjacent services (redis / postgres / etc.) 2024-04-13 18:15:02 -07:00
Ishaan Jaff
7d2215a809 Merge pull request #2991 from BerriAI/litellm_fix_text_completion_caching
[Feat] Support + Test caching for TextCompletion
2024-04-12 20:08:01 -07:00
Ishaan Jaff
41ec025b5c fix - support text completion caching 2024-04-12 12:34:28 -07:00
Krish Dholakia
6dbe2bef9a Merge pull request #2984 from Dev-Khant/slack-msg-truncation
truncate long slack msg
2024-04-12 08:30:08 -07:00
Dev Khant
18eae1facf truncate long slack msg 2024-04-12 17:22:14 +05:30
Krrish Dholakia
ec72202d56 fix(gemini.py): log system prompt in verbose output 2024-04-11 23:15:58 -07:00
Krrish Dholakia
4c0ba026a7 fix(utils.py): vertex ai exception mapping
fixes check which caused all vertex errors to be ratelimit errors
2024-04-11 23:04:21 -07:00
David Manouchehri
cc71ca3166 (feat) - Add support for JSON mode in Vertex AI 2024-04-12 00:03:29 +00:00
Krish Dholakia
e48cc9f1e4 Merge pull request #2942 from BerriAI/litellm_fix_router_loading
Router Async Improvements
2024-04-10 20:16:53 -07:00
Krrish Dholakia
8f06c2d8c4 fix(router.py): fix datetime object 2024-04-10 17:55:24 -07:00
Ishaan Jaff
686810ec00 fix - allow base64 cache hits embedding responses 2024-04-10 16:44:40 -07:00
Krrish Dholakia
06a0ca1e80 fix(proxy_cli.py): don't double load the router config
was causing callbacks to be instantiated twice - double couting usage in cache
2024-04-10 13:23:56 -07:00
Ishaan Jaff
3083326c33 Merge pull request #2893 from unclecode/main
Fix issue #2832: Add protected_namespaces to Config class within utils.py, router.py and completion.py to avoid the warning message.
2024-04-09 08:51:41 -07:00
Krrish Dholakia
075c96a408 fix(utils.py): fix reordering of items for cached embeddings
ensures cached embedding item is returned in correct order
2024-04-08 12:18:24 -07:00
unclecode
311e801ab4 Fix issue #2832: Add protected_namespaces to Config class within utils.py, router.py and completion.py to avoid the warning message. 2024-04-08 12:43:17 +08:00
Ishaan Jaff
d1d3d932ca Merge pull request #2879 from BerriAI/litellm_async_anthropic_api
[Feat] Async Anthropic API 97.5% lower median latency
2024-04-07 09:56:52 -07:00
Krrish Dholakia
fd67dc7556 fix(utils.py): fix import 2024-04-06 18:37:38 -07:00
Krrish Dholakia
179cede5a4 fix(utils.py): fix circular import 2024-04-06 18:29:51 -07:00
Ishaan Jaff
e3c066dcd2 async anthropic streaming 2024-04-06 17:36:56 -07:00
Krrish Dholakia
b145d620e0 fix(utils.py): add gemini api base support to 'get_api_base' 2024-04-06 16:08:15 -07:00
Krrish Dholakia
0dad78b53c feat(proxy/utils.py): return api base for request hanging alerts 2024-04-06 15:58:53 -07:00
Krrish Dholakia
474afae9d0 fix(utils.py): fix content check in pre-call rules 2024-04-06 09:03:19 -07:00
Krrish Dholakia
94957f7cfa fix(utils.py): move info statement to debug 2024-04-05 22:06:46 -07:00
Ishaan Jaff
72fddabf84 Merge pull request #2868 from BerriAI/litellm_add_command_r_on_proxy
Add Azure Command-r-plus on litellm proxy
2024-04-05 15:13:47 -07:00
Ishaan Jaff
f65828db26 Merge pull request #2861 from BerriAI/litellm_add_azure_command_r_plust
[FEAT] add azure command-r-plus
2024-04-05 15:13:35 -07:00
Ishaan Jaff
5c1a662caa proxy - add azure/command r 2024-04-05 14:35:31 -07:00
Krish Dholakia
10951ca2f0 Merge pull request #2856 from lazyhope/anthropic-tools-use-2024-04-04
Support latest Anthropic Tools Use (2024-04-04)
2024-04-05 14:31:26 -07:00
Ishaan Jaff
3ff3947fc0 fix use azure_ai/mistral 2024-04-05 10:07:43 -07:00
Ishaan Jaff
c74d43fcbc fix support azure/mistral models 2024-04-05 09:32:39 -07:00
Krrish Dholakia
a8d0aace83 fix(vertex_ai_anthropic.py): support streaming, async completion, async streaming for vertex ai anthropic 2024-04-05 09:27:48 -07:00
Ishaan Jaff
296f10e2c9 fix add azure/command-r-plus 2024-04-05 08:53:24 -07:00
Zihao Li
4d0975bf4e Move tool definitions from system prompt to parameter and refactor tool calling parse 2024-04-05 16:01:40 +08:00
Ishaan Jaff
d0977d61ff v0 return cache key in responses 2024-04-04 10:11:18 -07:00
Krrish Dholakia
f3fcfe3007 fix(proxy_server.py): return original model response via response headers - /v1/completions
to help devs with debugging
2024-04-03 13:05:43 -07:00
Krrish Dholakia
de3b314620 fix(openai.py): switch to using openai sdk for text completion calls 2024-04-02 15:08:12 -07:00
Krrish Dholakia
ed46af19ec fix(openai.py): return logprobs for text completion calls 2024-04-02 14:05:56 -07:00
Krrish Dholakia
67f62aa53e fix(main.py): support text completion input being a list of strings
addresses - https://github.com/BerriAI/litellm/issues/2792, https://github.com/BerriAI/litellm/issues/2777
2024-04-02 08:50:16 -07:00
Sebastián Estévez
a4c65e6e61 support cohere_chat in get_api_key 2024-04-01 13:24:03 -04:00
Ishaan Jaff
90038c1c86 check num retries in async wrapper 2024-03-30 19:33:40 -07:00
Ishaan Jaff
5cc2086dc1 (fix) improve async perf 2024-03-30 19:07:04 -07:00
Vincelwt
35c279eff8 Merge branch 'main' into main 2024-03-30 13:21:53 +09:00
Ishaan Jaff
41f9622f29 (docs) grafana / prometheus 2024-03-29 14:25:45 -07:00
Krrish Dholakia
ae3510c0d2 fix(utils.py): exception mapping on 'next()' streaming error 2024-03-29 09:18:41 -07:00
Krrish Dholakia
006c20a10f fix(utils.py): support bedrock mistral streaming 2024-03-29 07:56:10 -07:00
Krrish Dholakia
d98e040f36 fix(utils.py): don't add chatml tokens to a simple text token count 2024-03-28 13:48:48 -07:00
Krrish Dholakia
7fe02405e0 fix(proxy/utils.py): check cache before alerting user 2024-03-27 20:09:15 -07:00