Commit graph

2596 commits

Author SHA1 Message Date
Krrish Dholakia
b6cd200676 fix(llm_guard.py): enable request-specific llm guard flag 2024-04-08 21:15:33 -07:00
Krrish Dholakia
48bfc45cb0 fix(utils.py): fix reordering of items for cached embeddings
ensures cached embedding item is returned in correct order
2024-04-08 12:18:24 -07:00
Ishaan Jaff
a5aef6ec00
Merge pull request #2879 from BerriAI/litellm_async_anthropic_api
[Feat] Async Anthropic API 97.5% lower median latency
2024-04-07 09:56:52 -07:00
Krrish Dholakia
6773341a08 test(test_alerting.py): remove duplicate test 2024-04-06 19:33:01 -07:00
Ishaan Jaff
d51e853b60 undo adding next-asyncio 2024-04-06 19:28:51 -07:00
Krish Dholakia
83f608dc5d
Merge pull request #2880 from BerriAI/litellm_api_base_alerting
feat(proxy/utils.py): return api base for request hanging alerts
2024-04-06 19:17:18 -07:00
Krrish Dholakia
6234982fd9 test(test_custom_callback_input.py): fix assert 2024-04-06 18:48:33 -07:00
Krrish Dholakia
5f910c1811 test(test_streaming.py): fix vertex project name 2024-04-06 18:45:17 -07:00
Ishaan Jaff
f08486448c fix - test streaming 2024-04-06 18:28:07 -07:00
Ishaan Jaff
2622f0351b (ci/cd) run again 2024-04-06 18:26:52 -07:00
Ishaan Jaff
4512510d37 Revert "fix test streaming"
This reverts commit f10aba1007.
2024-04-06 18:24:00 -07:00
Ishaan Jaff
f10aba1007 fix test streaming 2024-04-06 18:23:10 -07:00
Ishaan Jaff
9be6b7ec7c ci/cd run again 2024-04-06 18:07:41 -07:00
Krrish Dholakia
a6f987e065 test(test_amazing_vertex_completion.py): fix project name 2024-04-06 17:59:49 -07:00
Ishaan Jaff
548b2b6861 test - async claude streaming 2024-04-06 17:55:26 -07:00
Krrish Dholakia
6110d32b1c feat(proxy/utils.py): return api base for request hanging alerts 2024-04-06 15:58:53 -07:00
Ishaan Jaff
a2c63075ef
Merge pull request #2877 from BerriAI/litellm_fix_text_completion
[Feat] Text-Completion-OpenAI - Re-use OpenAI Client
2024-04-06 12:15:52 -07:00
Krrish Dholakia
b49e47b634 ci(vertex_key.json): switch gcloud accounts 2024-04-06 12:14:46 -07:00
Ishaan Jaff
3b622556be test - setting org for openai text completion 2024-04-06 11:53:26 -07:00
Ishaan Jaff
a55f3cdace test - router re-use openai client 2024-04-06 11:33:17 -07:00
Krish Dholakia
9119858f4a
Merge pull request #2798 from CLARKBENHAM/main
add test for rate limits - Router isn't coroutine safe
2024-04-06 08:47:40 -07:00
Ishaan Jaff
3d298fc549 (test) completion 2024-04-05 21:03:04 -07:00
Ishaan Jaff
83580dc00a (ci/cd) run again 2024-04-05 20:44:51 -07:00
Ishaan Jaff
122dff007b fix gemini test 2024-04-05 20:38:28 -07:00
Ishaan Jaff
7fc416b636 (ci/cd) run again 2024-04-05 17:26:02 -07:00
Ishaan Jaff
fdadeabe79 fix testing yaml 2024-04-05 16:17:53 -07:00
Ishaan Jaff
faa0d38087
Merge pull request #2868 from BerriAI/litellm_add_command_r_on_proxy
Add Azure Command-r-plus on litellm proxy
2024-04-05 15:13:47 -07:00
Ishaan Jaff
2174b240d8
Merge pull request #2861 from BerriAI/litellm_add_azure_command_r_plust
[FEAT] add azure command-r-plus
2024-04-05 15:13:35 -07:00
Ishaan Jaff
9055a071e6 proxy - add azure/command r 2024-04-05 14:35:31 -07:00
Krrish Dholakia
9d7aaa91e0 test(test_amazing_vertex_completion.py): handle vertex ai rate limit errors 2024-04-05 14:04:29 -07:00
Krrish Dholakia
f7537f2cdf fix(test_amazing_vertex_completion.py): handle vertex ai rate limit error 2024-04-05 14:03:38 -07:00
Ishaan Jaff
3dbb547af0 catch gemini 429 errors 2024-04-05 14:02:01 -07:00
Ishaan Jaff
34a9b8648d fix gemini rate limit errors 2024-04-05 14:01:32 -07:00
Krrish Dholakia
f662dac5c9 fix(vertex_ai_anthropic.py): fix import 2024-04-05 13:38:05 -07:00
Ishaan Jaff
3b2c34d587 ci/cd - skip 429 errors from google gemini 2024-04-05 13:36:29 -07:00
Krrish Dholakia
695eb129ad fix(router.py): fix client init for streaming timeouts 2024-04-05 12:30:15 -07:00
Krrish Dholakia
f0c4ff6e60 fix(vertex_ai_anthropic.py): support streaming, async completion, async streaming for vertex ai anthropic 2024-04-05 09:27:48 -07:00
Ishaan Jaff
cfe358abaa simplify calling azure/commmand-r-plus 2024-04-05 09:18:11 -07:00
Ishaan Jaff
1ba1c84d83 skip vertex test for this PR 2024-04-05 09:06:15 -07:00
Ishaan Jaff
5d196ff300 test - azure/command-r-plus 2024-04-05 08:56:05 -07:00
Krish Dholakia
eb34306099
Merge pull request #2665 from BerriAI/litellm_claude_vertex_ai
[WIP] feat(vertex_ai_anthropic.py): Add support for claude 3 on vertex ai
2024-04-05 07:06:04 -07:00
Krish Dholakia
4ce8227e70
Merge pull request #2841 from Manouchehri/nuke-gemini-1.5-pro-vision
Fix: Remove non-existent gemini-1.5-pro-vision model.
2024-04-05 07:03:38 -07:00
Ishaan Jaff
7f50b80917 (ci/cd) aws suspended account 2024-04-04 17:43:16 -07:00
Krrish Dholakia
2472311a3f test(test_caching.py): skip test - aws suspended account
will need to recreate these objects on a new aws account
2024-04-04 15:07:19 -07:00
Krrish Dholakia
2236f283fe fix(router.py): handle id being passed in as int 2024-04-04 14:23:10 -07:00
Krrish Dholakia
2e40ab959d test(test_router.py): fix casting 2024-04-04 13:54:16 -07:00
Krrish Dholakia
c372c873a0 test(test_router.py): fix test to check cast 2024-04-04 13:32:50 -07:00
Krrish Dholakia
b9030be792 test(test_router.py): fix test to check type 2024-04-04 11:45:12 -07:00
David Manouchehri
6044045b91
Fix: Remove non-existent gemini-1.5-pro-vision model.
The gemini-1.5-pro model handles both text and vision.
2024-04-04 17:33:08 +00:00
Krrish Dholakia
c3e3ca9722 test(test_secret_manager.py): skip test - aws account suspended 2024-04-04 10:10:21 -07:00