Dev Khant
|
18eae1facf
|
truncate long slack msg
|
2024-04-12 17:22:14 +05:30 |
|
Mikkel Gravgaard
|
1d18bf2888
|
Use DEBUG level for curl command logging
Currently, the INFO level is used, which can cause excessive logging in production.
|
2024-04-12 11:27:53 +02:00 |
|
Krrish Dholakia
|
ec72202d56
|
fix(gemini.py): log system prompt in verbose output
|
2024-04-11 23:15:58 -07:00 |
|
Krrish Dholakia
|
4c0ba026a7
|
fix(utils.py): vertex ai exception mapping
fixes check which caused all vertex errors to be ratelimit errors
|
2024-04-11 23:04:21 -07:00 |
|
David Manouchehri
|
cc71ca3166
|
(feat) - Add support for JSON mode in Vertex AI
|
2024-04-12 00:03:29 +00:00 |
|
Krish Dholakia
|
e48cc9f1e4
|
Merge pull request #2942 from BerriAI/litellm_fix_router_loading
Router Async Improvements
|
2024-04-10 20:16:53 -07:00 |
|
Krrish Dholakia
|
8f06c2d8c4
|
fix(router.py): fix datetime object
|
2024-04-10 17:55:24 -07:00 |
|
Ishaan Jaff
|
686810ec00
|
fix - allow base64 cache hits embedding responses
|
2024-04-10 16:44:40 -07:00 |
|
Krrish Dholakia
|
06a0ca1e80
|
fix(proxy_cli.py): don't double load the router config
was causing callbacks to be instantiated twice - double couting usage in cache
|
2024-04-10 13:23:56 -07:00 |
|
Ishaan Jaff
|
3083326c33
|
Merge pull request #2893 from unclecode/main
Fix issue #2832: Add protected_namespaces to Config class within utils.py, router.py and completion.py to avoid the warning message.
|
2024-04-09 08:51:41 -07:00 |
|
Krrish Dholakia
|
075c96a408
|
fix(utils.py): fix reordering of items for cached embeddings
ensures cached embedding item is returned in correct order
|
2024-04-08 12:18:24 -07:00 |
|
unclecode
|
311e801ab4
|
Fix issue #2832: Add protected_namespaces to Config class within utils.py, router.py and completion.py to avoid the warning message.
|
2024-04-08 12:43:17 +08:00 |
|
Ishaan Jaff
|
d1d3d932ca
|
Merge pull request #2879 from BerriAI/litellm_async_anthropic_api
[Feat] Async Anthropic API 97.5% lower median latency
|
2024-04-07 09:56:52 -07:00 |
|
Krrish Dholakia
|
fd67dc7556
|
fix(utils.py): fix import
|
2024-04-06 18:37:38 -07:00 |
|
Krrish Dholakia
|
179cede5a4
|
fix(utils.py): fix circular import
|
2024-04-06 18:29:51 -07:00 |
|
Ishaan Jaff
|
e3c066dcd2
|
async anthropic streaming
|
2024-04-06 17:36:56 -07:00 |
|
Krrish Dholakia
|
b145d620e0
|
fix(utils.py): add gemini api base support to 'get_api_base'
|
2024-04-06 16:08:15 -07:00 |
|
Krrish Dholakia
|
0dad78b53c
|
feat(proxy/utils.py): return api base for request hanging alerts
|
2024-04-06 15:58:53 -07:00 |
|
Krrish Dholakia
|
474afae9d0
|
fix(utils.py): fix content check in pre-call rules
|
2024-04-06 09:03:19 -07:00 |
|
Krrish Dholakia
|
94957f7cfa
|
fix(utils.py): move info statement to debug
|
2024-04-05 22:06:46 -07:00 |
|
Ishaan Jaff
|
72fddabf84
|
Merge pull request #2868 from BerriAI/litellm_add_command_r_on_proxy
Add Azure Command-r-plus on litellm proxy
|
2024-04-05 15:13:47 -07:00 |
|
Ishaan Jaff
|
f65828db26
|
Merge pull request #2861 from BerriAI/litellm_add_azure_command_r_plust
[FEAT] add azure command-r-plus
|
2024-04-05 15:13:35 -07:00 |
|
Ishaan Jaff
|
5c1a662caa
|
proxy - add azure/command r
|
2024-04-05 14:35:31 -07:00 |
|
Krish Dholakia
|
10951ca2f0
|
Merge pull request #2856 from lazyhope/anthropic-tools-use-2024-04-04
Support latest Anthropic Tools Use (2024-04-04)
|
2024-04-05 14:31:26 -07:00 |
|
Ishaan Jaff
|
3ff3947fc0
|
fix use azure_ai/mistral
|
2024-04-05 10:07:43 -07:00 |
|
Ishaan Jaff
|
c74d43fcbc
|
fix support azure/mistral models
|
2024-04-05 09:32:39 -07:00 |
|
Krrish Dholakia
|
a8d0aace83
|
fix(vertex_ai_anthropic.py): support streaming, async completion, async streaming for vertex ai anthropic
|
2024-04-05 09:27:48 -07:00 |
|
Ishaan Jaff
|
296f10e2c9
|
fix add azure/command-r-plus
|
2024-04-05 08:53:24 -07:00 |
|
Zihao Li
|
4d0975bf4e
|
Move tool definitions from system prompt to parameter and refactor tool calling parse
|
2024-04-05 16:01:40 +08:00 |
|
Nandesh Guru
|
39360f1605
|
Greenscale Integration
Adding logger for Greenscale
|
2024-04-04 15:38:51 -07:00 |
|
Ishaan Jaff
|
d0977d61ff
|
v0 return cache key in responses
|
2024-04-04 10:11:18 -07:00 |
|
Krrish Dholakia
|
f3fcfe3007
|
fix(proxy_server.py): return original model response via response headers - /v1/completions
to help devs with debugging
|
2024-04-03 13:05:43 -07:00 |
|
Krrish Dholakia
|
de3b314620
|
fix(openai.py): switch to using openai sdk for text completion calls
|
2024-04-02 15:08:12 -07:00 |
|
Krrish Dholakia
|
ed46af19ec
|
fix(openai.py): return logprobs for text completion calls
|
2024-04-02 14:05:56 -07:00 |
|
Krrish Dholakia
|
67f62aa53e
|
fix(main.py): support text completion input being a list of strings
addresses - https://github.com/BerriAI/litellm/issues/2792, https://github.com/BerriAI/litellm/issues/2777
|
2024-04-02 08:50:16 -07:00 |
|
Sebastián Estévez
|
a4c65e6e61
|
support cohere_chat in get_api_key
|
2024-04-01 13:24:03 -04:00 |
|
Ishaan Jaff
|
90038c1c86
|
check num retries in async wrapper
|
2024-03-30 19:33:40 -07:00 |
|
Ishaan Jaff
|
5cc2086dc1
|
(fix) improve async perf
|
2024-03-30 19:07:04 -07:00 |
|
Vincelwt
|
35c279eff8
|
Merge branch 'main' into main
|
2024-03-30 13:21:53 +09:00 |
|
Ishaan Jaff
|
41f9622f29
|
(docs) grafana / prometheus
|
2024-03-29 14:25:45 -07:00 |
|
Krrish Dholakia
|
ae3510c0d2
|
fix(utils.py): exception mapping on 'next()' streaming error
|
2024-03-29 09:18:41 -07:00 |
|
Krrish Dholakia
|
006c20a10f
|
fix(utils.py): support bedrock mistral streaming
|
2024-03-29 07:56:10 -07:00 |
|
Krrish Dholakia
|
d98e040f36
|
fix(utils.py): don't add chatml tokens to a simple text token count
|
2024-03-28 13:48:48 -07:00 |
|
Krrish Dholakia
|
7fe02405e0
|
fix(proxy/utils.py): check cache before alerting user
|
2024-03-27 20:09:15 -07:00 |
|
Krrish Dholakia
|
62e557404c
|
fix(utils.py): don't run post-call rules on a coroutine function
|
2024-03-27 13:16:27 -07:00 |
|
Krish Dholakia
|
d51f12ca44
|
Merge pull request #2704 from BerriAI/litellm_jwt_auth_improvements_3
fix(handle_jwt.py): enable team-based jwt-auth access
|
2024-03-26 16:06:56 -07:00 |
|
Krrish Dholakia
|
4028f935a5
|
fix(utils.py): check if item in list is pydantic object or dict before dereferencing
|
2024-03-26 14:39:16 -07:00 |
|
Krrish Dholakia
|
c21e954c6f
|
test(test_batch_completions.py): handle anthropic overloaded error
|
2024-03-26 13:55:03 -07:00 |
|
Krrish Dholakia
|
05fddcb06b
|
fix(utils.py): don't send subsequent chunks if last chunk sent
prevents multiple empty finish chunks from being sent
|
2024-03-26 13:49:42 -07:00 |
|
Ishaan Jaff
|
f5f7e344c7
|
Merge branch 'main' into litellm_remove_litellm_telemetry
|
2024-03-26 11:35:02 -07:00 |
|