Commit graph

2623 commits

Author SHA1 Message Date
Chen Wang
ebc889d77a
Use max_input_token for trim_messages 2024-04-16 13:36:25 +01:00
Krrish Dholakia
40d9167cae test(test_amazing_vertex_completion.py): fix vertex anthropic streaming text 2024-04-15 14:37:22 -07:00
Krrish Dholakia
1ec7118e1f fix(vertex_ai_anthropic.py): set vertex_credentials for vertex ai anthropic calls
allows setting vertex credentials as a json string for vertex ai anthropic calls
2024-04-15 14:16:28 -07:00
Krrish Dholakia
50081479f9 fix(main.py): accept vertex service account credentials as json string
allows us to dynamically set vertex ai credentials
2024-04-15 13:28:59 -07:00
Krish Dholakia
72b54eaad7
Merge pull request #3028 from BerriAI/litellm_anthropic_text_completion_fix
fix(anthropic_text.py): add support for async text completion calls
2024-04-15 09:26:28 -07:00
Ishaan Jaff
c8832489d6 test groq tool calling 2024-04-15 08:41:53 -07:00
Ishaan Jaff
ea575ef62d fix test groq function call 2024-04-15 08:40:39 -07:00
Krrish Dholakia
26286a54b8 fix(anthropic_text.py): add support for async text completion calls 2024-04-15 08:15:00 -07:00
Ishaan Jaff
017127a704 test - groq tool calling 2024-04-15 08:13:05 -07:00
Stefan Dobrev
1cfea955aa
Fix Anthropic system message handling 2024-04-14 16:23:59 +03:00
Krrish Dholakia
bef24cd4ab fix(caching.py): don't decode a string 2024-04-13 18:48:03 -07:00
Krrish Dholakia
4e81acf2c6 feat(prometheus_services.py): monitor health of proxy adjacent services (redis / postgres / etc.) 2024-04-13 18:15:02 -07:00
Ishaan Jaff
5856ec03c6 (ci/cd) run again 2024-04-12 20:48:26 -07:00
Ishaan Jaff
f617f5ebb5 fix - test caching atext_completion 2024-04-12 20:37:56 -07:00
Ishaan Jaff
11cd1ec6cf test - atext_completion + caching 2024-04-12 12:32:21 -07:00
Krrish Dholakia
77097f8e7d test(test_jwt.py): add testing to make sure user api key auth returns the expected values 2024-04-11 20:18:55 -07:00
Krrish Dholakia
84d43484c6 fix(router.py): make sure pre call rpm check runs even when model not in model cost map 2024-04-11 09:27:46 -07:00
Krish Dholakia
83e7ed94ce
Merge pull request #2942 from BerriAI/litellm_fix_router_loading
Router Async Improvements
2024-04-10 20:16:53 -07:00
Ishaan Jaff
8bc02b34c2 test -base64 cache hits 2024-04-10 16:46:56 -07:00
Krrish Dholakia
a47a719caa fix(router.py): generate consistent model id's
having the same id for a deployment, lets redis usage caching work across multiple instances
2024-04-10 15:23:57 -07:00
Krrish Dholakia
a311788f0d test(test_completion.py): handle api instability 2024-04-09 21:58:48 -07:00
Krrish Dholakia
a6b004f10b test(test_completion.py): change model 2024-04-09 21:38:17 -07:00
Krish Dholakia
2fa47cf416
Merge pull request #2924 from BerriAI/litellm_text_completion_response
fix(main.py): handle translating text completion openai to chat completion for async requests
2024-04-09 21:13:30 -07:00
Krrish Dholakia
8857c9b978 test(test_key_generate_prisma.py): add better unit testing for spend logs on proxy server 2024-04-09 17:58:18 -07:00
Krrish Dholakia
855e7ed9d2 fix(main.py): handle translating text completion openai to chat completion for async requests
also adds testing for this, to prevent future regressions
2024-04-09 16:47:49 -07:00
Krrish Dholakia
4495f6c058 test(test_key_generate_prisma.py): fix check for raising error if key generate max budget > key upperbound 2024-04-09 13:07:57 -07:00
Krrish Dholakia
ef4c85522c test(test_llm_guard.py): fix test 2024-04-08 21:55:35 -07:00
Krrish Dholakia
b6cd200676 fix(llm_guard.py): enable request-specific llm guard flag 2024-04-08 21:15:33 -07:00
Krrish Dholakia
48bfc45cb0 fix(utils.py): fix reordering of items for cached embeddings
ensures cached embedding item is returned in correct order
2024-04-08 12:18:24 -07:00
Ishaan Jaff
a5aef6ec00
Merge pull request #2879 from BerriAI/litellm_async_anthropic_api
[Feat] Async Anthropic API 97.5% lower median latency
2024-04-07 09:56:52 -07:00
Krrish Dholakia
6773341a08 test(test_alerting.py): remove duplicate test 2024-04-06 19:33:01 -07:00
Ishaan Jaff
d51e853b60 undo adding next-asyncio 2024-04-06 19:28:51 -07:00
Krish Dholakia
83f608dc5d
Merge pull request #2880 from BerriAI/litellm_api_base_alerting
feat(proxy/utils.py): return api base for request hanging alerts
2024-04-06 19:17:18 -07:00
Krrish Dholakia
6234982fd9 test(test_custom_callback_input.py): fix assert 2024-04-06 18:48:33 -07:00
Krrish Dholakia
5f910c1811 test(test_streaming.py): fix vertex project name 2024-04-06 18:45:17 -07:00
Ishaan Jaff
f08486448c fix - test streaming 2024-04-06 18:28:07 -07:00
Ishaan Jaff
2622f0351b (ci/cd) run again 2024-04-06 18:26:52 -07:00
Ishaan Jaff
4512510d37 Revert "fix test streaming"
This reverts commit f10aba1007.
2024-04-06 18:24:00 -07:00
Ishaan Jaff
f10aba1007 fix test streaming 2024-04-06 18:23:10 -07:00
Ishaan Jaff
9be6b7ec7c ci/cd run again 2024-04-06 18:07:41 -07:00
Krrish Dholakia
a6f987e065 test(test_amazing_vertex_completion.py): fix project name 2024-04-06 17:59:49 -07:00
Ishaan Jaff
548b2b6861 test - async claude streaming 2024-04-06 17:55:26 -07:00
Krrish Dholakia
6110d32b1c feat(proxy/utils.py): return api base for request hanging alerts 2024-04-06 15:58:53 -07:00
Ishaan Jaff
a2c63075ef
Merge pull request #2877 from BerriAI/litellm_fix_text_completion
[Feat] Text-Completion-OpenAI - Re-use OpenAI Client
2024-04-06 12:15:52 -07:00
Krrish Dholakia
b49e47b634 ci(vertex_key.json): switch gcloud accounts 2024-04-06 12:14:46 -07:00
Ishaan Jaff
3b622556be test - setting org for openai text completion 2024-04-06 11:53:26 -07:00
Ishaan Jaff
a55f3cdace test - router re-use openai client 2024-04-06 11:33:17 -07:00
Krish Dholakia
9119858f4a
Merge pull request #2798 from CLARKBENHAM/main
add test for rate limits - Router isn't coroutine safe
2024-04-06 08:47:40 -07:00
Ishaan Jaff
3d298fc549 (test) completion 2024-04-05 21:03:04 -07:00
Ishaan Jaff
83580dc00a (ci/cd) run again 2024-04-05 20:44:51 -07:00