Commit graph

2362 commits

Author SHA1 Message Date
Krrish Dholakia
866259f95f feat(prometheus_services.py): monitor health of proxy adjacent services (redis / postgres / etc.) 2024-04-13 18:15:02 -07:00
Ishaan Jaff
abac496706 (ci/cd) run again 2024-04-12 20:48:26 -07:00
Ishaan Jaff
966356721b fix - test caching atext_completion 2024-04-12 20:37:56 -07:00
Ishaan Jaff
204573f3f2 test - atext_completion + caching 2024-04-12 12:32:21 -07:00
Krrish Dholakia
9d9426c291 test(test_jwt.py): add testing to make sure user api key auth returns the expected values 2024-04-11 20:18:55 -07:00
Krrish Dholakia
0863c10b0b fix(router.py): make sure pre call rpm check runs even when model not in model cost map 2024-04-11 09:27:46 -07:00
Krish Dholakia
e48cc9f1e4 Merge pull request #2942 from BerriAI/litellm_fix_router_loading
Router Async Improvements
2024-04-10 20:16:53 -07:00
Ishaan Jaff
4058327b43 test -base64 cache hits 2024-04-10 16:46:56 -07:00
Krrish Dholakia
f5206d592a fix(router.py): generate consistent model id's
having the same id for a deployment, lets redis usage caching work across multiple instances
2024-04-10 15:23:57 -07:00
Krrish Dholakia
67b3a05e1a test(test_completion.py): handle api instability 2024-04-09 21:58:48 -07:00
Krrish Dholakia
f0d61fd6dd test(test_completion.py): change model 2024-04-09 21:38:17 -07:00
Krish Dholakia
8334e3cd6e Merge pull request #2924 from BerriAI/litellm_text_completion_response
fix(main.py): handle translating text completion openai to chat completion for async requests
2024-04-09 21:13:30 -07:00
Krrish Dholakia
ce9ea28e42 test(test_key_generate_prisma.py): add better unit testing for spend logs on proxy server 2024-04-09 17:58:18 -07:00
Krrish Dholakia
70fd803a6e fix(main.py): handle translating text completion openai to chat completion for async requests
also adds testing for this, to prevent future regressions
2024-04-09 16:47:49 -07:00
Krrish Dholakia
dab219d4d7 test(test_key_generate_prisma.py): fix check for raising error if key generate max budget > key upperbound 2024-04-09 13:07:57 -07:00
Krrish Dholakia
d98c5a1797 test(test_llm_guard.py): fix test 2024-04-08 21:55:35 -07:00
Krrish Dholakia
aa4acaf06b fix(llm_guard.py): enable request-specific llm guard flag 2024-04-08 21:15:33 -07:00
Krrish Dholakia
075c96a408 fix(utils.py): fix reordering of items for cached embeddings
ensures cached embedding item is returned in correct order
2024-04-08 12:18:24 -07:00
Ishaan Jaff
d1d3d932ca Merge pull request #2879 from BerriAI/litellm_async_anthropic_api
[Feat] Async Anthropic API 97.5% lower median latency
2024-04-07 09:56:52 -07:00
Krrish Dholakia
ad409d67f3 test(test_alerting.py): remove duplicate test 2024-04-06 19:33:01 -07:00
Ishaan Jaff
9b8555e482 undo adding next-asyncio 2024-04-06 19:28:51 -07:00
Krish Dholakia
1060244a7f Merge pull request #2880 from BerriAI/litellm_api_base_alerting
feat(proxy/utils.py): return api base for request hanging alerts
2024-04-06 19:17:18 -07:00
Krrish Dholakia
4bb070c0e4 test(test_custom_callback_input.py): fix assert 2024-04-06 18:48:33 -07:00
Krrish Dholakia
0d067fb65d test(test_streaming.py): fix vertex project name 2024-04-06 18:45:17 -07:00
Ishaan Jaff
c37383192f fix - test streaming 2024-04-06 18:28:07 -07:00
Ishaan Jaff
aca692282a (ci/cd) run again 2024-04-06 18:26:52 -07:00
Ishaan Jaff
ccab3cdfb5 Revert "fix test streaming"
This reverts commit f10aba1007.
2024-04-06 18:24:00 -07:00
Ishaan Jaff
9e92b9b8ad fix test streaming 2024-04-06 18:23:10 -07:00
Ishaan Jaff
436133b4b3 ci/cd run again 2024-04-06 18:07:41 -07:00
Krrish Dholakia
a236cd22fc test(test_amazing_vertex_completion.py): fix project name 2024-04-06 17:59:49 -07:00
Ishaan Jaff
3978a17ff9 test - async claude streaming 2024-04-06 17:55:26 -07:00
Krrish Dholakia
0dad78b53c feat(proxy/utils.py): return api base for request hanging alerts 2024-04-06 15:58:53 -07:00
Ishaan Jaff
d23d6068ff Merge pull request #2877 from BerriAI/litellm_fix_text_completion
[Feat] Text-Completion-OpenAI - Re-use OpenAI Client
2024-04-06 12:15:52 -07:00
Krrish Dholakia
37c0c883a8 ci(vertex_key.json): switch gcloud accounts 2024-04-06 12:14:46 -07:00
Ishaan Jaff
ff0b021949 test - setting org for openai text completion 2024-04-06 11:53:26 -07:00
Ishaan Jaff
d8788bbe9b test - router re-use openai client 2024-04-06 11:33:17 -07:00
Krish Dholakia
b8d285d120 Merge pull request #2798 from CLARKBENHAM/main
add test for rate limits - Router isn't coroutine safe
2024-04-06 08:47:40 -07:00
Ishaan Jaff
6757b42ce9 (test) completion 2024-04-05 21:03:04 -07:00
Ishaan Jaff
8436b611b4 (ci/cd) run again 2024-04-05 20:44:51 -07:00
Ishaan Jaff
aa0c01dcf3 fix gemini test 2024-04-05 20:38:28 -07:00
Ishaan Jaff
a27e488248 (ci/cd) run again 2024-04-05 17:26:02 -07:00
Ishaan Jaff
d4ca3f38c7 fix testing yaml 2024-04-05 16:17:53 -07:00
Ishaan Jaff
72fddabf84 Merge pull request #2868 from BerriAI/litellm_add_command_r_on_proxy
Add Azure Command-r-plus on litellm proxy
2024-04-05 15:13:47 -07:00
Ishaan Jaff
f65828db26 Merge pull request #2861 from BerriAI/litellm_add_azure_command_r_plust
[FEAT] add azure command-r-plus
2024-04-05 15:13:35 -07:00
Ishaan Jaff
5c1a662caa proxy - add azure/command r 2024-04-05 14:35:31 -07:00
Krrish Dholakia
6b7d253aa2 test(test_amazing_vertex_completion.py): handle vertex ai rate limit errors 2024-04-05 14:04:29 -07:00
Krrish Dholakia
7a0143a653 fix(test_amazing_vertex_completion.py): handle vertex ai rate limit error 2024-04-05 14:03:38 -07:00
Ishaan Jaff
bfaa2bd3df catch gemini 429 errors 2024-04-05 14:02:01 -07:00
Ishaan Jaff
5e369473d3 fix gemini rate limit errors 2024-04-05 14:01:32 -07:00
Krrish Dholakia
00c0a0c209 fix(vertex_ai_anthropic.py): fix import 2024-04-05 13:38:05 -07:00