Krrish Dholakia
|
866259f95f
|
feat(prometheus_services.py): monitor health of proxy adjacent services (redis / postgres / etc.)
|
2024-04-13 18:15:02 -07:00 |
|
Ishaan Jaff
|
abac496706
|
(ci/cd) run again
|
2024-04-12 20:48:26 -07:00 |
|
Ishaan Jaff
|
966356721b
|
fix - test caching atext_completion
|
2024-04-12 20:37:56 -07:00 |
|
Ishaan Jaff
|
204573f3f2
|
test - atext_completion + caching
|
2024-04-12 12:32:21 -07:00 |
|
Krrish Dholakia
|
9d9426c291
|
test(test_jwt.py): add testing to make sure user api key auth returns the expected values
|
2024-04-11 20:18:55 -07:00 |
|
Krrish Dholakia
|
0863c10b0b
|
fix(router.py): make sure pre call rpm check runs even when model not in model cost map
|
2024-04-11 09:27:46 -07:00 |
|
Krish Dholakia
|
e48cc9f1e4
|
Merge pull request #2942 from BerriAI/litellm_fix_router_loading
Router Async Improvements
|
2024-04-10 20:16:53 -07:00 |
|
Ishaan Jaff
|
4058327b43
|
test -base64 cache hits
|
2024-04-10 16:46:56 -07:00 |
|
Krrish Dholakia
|
f5206d592a
|
fix(router.py): generate consistent model id's
having the same id for a deployment, lets redis usage caching work across multiple instances
|
2024-04-10 15:23:57 -07:00 |
|
Krrish Dholakia
|
67b3a05e1a
|
test(test_completion.py): handle api instability
|
2024-04-09 21:58:48 -07:00 |
|
Krrish Dholakia
|
f0d61fd6dd
|
test(test_completion.py): change model
|
2024-04-09 21:38:17 -07:00 |
|
Krish Dholakia
|
8334e3cd6e
|
Merge pull request #2924 from BerriAI/litellm_text_completion_response
fix(main.py): handle translating text completion openai to chat completion for async requests
|
2024-04-09 21:13:30 -07:00 |
|
Krrish Dholakia
|
ce9ea28e42
|
test(test_key_generate_prisma.py): add better unit testing for spend logs on proxy server
|
2024-04-09 17:58:18 -07:00 |
|
Krrish Dholakia
|
70fd803a6e
|
fix(main.py): handle translating text completion openai to chat completion for async requests
also adds testing for this, to prevent future regressions
|
2024-04-09 16:47:49 -07:00 |
|
Krrish Dholakia
|
dab219d4d7
|
test(test_key_generate_prisma.py): fix check for raising error if key generate max budget > key upperbound
|
2024-04-09 13:07:57 -07:00 |
|
Krrish Dholakia
|
d98c5a1797
|
test(test_llm_guard.py): fix test
|
2024-04-08 21:55:35 -07:00 |
|
Krrish Dholakia
|
aa4acaf06b
|
fix(llm_guard.py): enable request-specific llm guard flag
|
2024-04-08 21:15:33 -07:00 |
|
Krrish Dholakia
|
075c96a408
|
fix(utils.py): fix reordering of items for cached embeddings
ensures cached embedding item is returned in correct order
|
2024-04-08 12:18:24 -07:00 |
|
Ishaan Jaff
|
d1d3d932ca
|
Merge pull request #2879 from BerriAI/litellm_async_anthropic_api
[Feat] Async Anthropic API 97.5% lower median latency
|
2024-04-07 09:56:52 -07:00 |
|
Krrish Dholakia
|
ad409d67f3
|
test(test_alerting.py): remove duplicate test
|
2024-04-06 19:33:01 -07:00 |
|
Ishaan Jaff
|
9b8555e482
|
undo adding next-asyncio
|
2024-04-06 19:28:51 -07:00 |
|
Krish Dholakia
|
1060244a7f
|
Merge pull request #2880 from BerriAI/litellm_api_base_alerting
feat(proxy/utils.py): return api base for request hanging alerts
|
2024-04-06 19:17:18 -07:00 |
|
Krrish Dholakia
|
4bb070c0e4
|
test(test_custom_callback_input.py): fix assert
|
2024-04-06 18:48:33 -07:00 |
|
Krrish Dholakia
|
0d067fb65d
|
test(test_streaming.py): fix vertex project name
|
2024-04-06 18:45:17 -07:00 |
|
Ishaan Jaff
|
c37383192f
|
fix - test streaming
|
2024-04-06 18:28:07 -07:00 |
|
Ishaan Jaff
|
aca692282a
|
(ci/cd) run again
|
2024-04-06 18:26:52 -07:00 |
|
Ishaan Jaff
|
ccab3cdfb5
|
Revert "fix test streaming"
This reverts commit f10aba1007 .
|
2024-04-06 18:24:00 -07:00 |
|
Ishaan Jaff
|
9e92b9b8ad
|
fix test streaming
|
2024-04-06 18:23:10 -07:00 |
|
Ishaan Jaff
|
436133b4b3
|
ci/cd run again
|
2024-04-06 18:07:41 -07:00 |
|
Krrish Dholakia
|
a236cd22fc
|
test(test_amazing_vertex_completion.py): fix project name
|
2024-04-06 17:59:49 -07:00 |
|
Ishaan Jaff
|
3978a17ff9
|
test - async claude streaming
|
2024-04-06 17:55:26 -07:00 |
|
Krrish Dholakia
|
0dad78b53c
|
feat(proxy/utils.py): return api base for request hanging alerts
|
2024-04-06 15:58:53 -07:00 |
|
Ishaan Jaff
|
d23d6068ff
|
Merge pull request #2877 from BerriAI/litellm_fix_text_completion
[Feat] Text-Completion-OpenAI - Re-use OpenAI Client
|
2024-04-06 12:15:52 -07:00 |
|
Krrish Dholakia
|
37c0c883a8
|
ci(vertex_key.json): switch gcloud accounts
|
2024-04-06 12:14:46 -07:00 |
|
Ishaan Jaff
|
ff0b021949
|
test - setting org for openai text completion
|
2024-04-06 11:53:26 -07:00 |
|
Ishaan Jaff
|
d8788bbe9b
|
test - router re-use openai client
|
2024-04-06 11:33:17 -07:00 |
|
Krish Dholakia
|
b8d285d120
|
Merge pull request #2798 from CLARKBENHAM/main
add test for rate limits - Router isn't coroutine safe
|
2024-04-06 08:47:40 -07:00 |
|
Ishaan Jaff
|
6757b42ce9
|
(test) completion
|
2024-04-05 21:03:04 -07:00 |
|
Ishaan Jaff
|
8436b611b4
|
(ci/cd) run again
|
2024-04-05 20:44:51 -07:00 |
|
Ishaan Jaff
|
aa0c01dcf3
|
fix gemini test
|
2024-04-05 20:38:28 -07:00 |
|
Ishaan Jaff
|
a27e488248
|
(ci/cd) run again
|
2024-04-05 17:26:02 -07:00 |
|
Ishaan Jaff
|
d4ca3f38c7
|
fix testing yaml
|
2024-04-05 16:17:53 -07:00 |
|
Ishaan Jaff
|
72fddabf84
|
Merge pull request #2868 from BerriAI/litellm_add_command_r_on_proxy
Add Azure Command-r-plus on litellm proxy
|
2024-04-05 15:13:47 -07:00 |
|
Ishaan Jaff
|
f65828db26
|
Merge pull request #2861 from BerriAI/litellm_add_azure_command_r_plust
[FEAT] add azure command-r-plus
|
2024-04-05 15:13:35 -07:00 |
|
Ishaan Jaff
|
5c1a662caa
|
proxy - add azure/command r
|
2024-04-05 14:35:31 -07:00 |
|
Krrish Dholakia
|
6b7d253aa2
|
test(test_amazing_vertex_completion.py): handle vertex ai rate limit errors
|
2024-04-05 14:04:29 -07:00 |
|
Krrish Dholakia
|
7a0143a653
|
fix(test_amazing_vertex_completion.py): handle vertex ai rate limit error
|
2024-04-05 14:03:38 -07:00 |
|
Ishaan Jaff
|
bfaa2bd3df
|
catch gemini 429 errors
|
2024-04-05 14:02:01 -07:00 |
|
Ishaan Jaff
|
5e369473d3
|
fix gemini rate limit errors
|
2024-04-05 14:01:32 -07:00 |
|
Krrish Dholakia
|
00c0a0c209
|
fix(vertex_ai_anthropic.py): fix import
|
2024-04-05 13:38:05 -07:00 |
|