Commit graph

2628 commits

Author SHA1 Message Date
Krrish Dholakia
91bbef4bcd fix(predibase.py): support json schema on predibase 2024-06-25 16:03:47 -07:00
Krrish Dholakia
1e51b8894f fix(utils.py): predibase exception mapping - map 424 as a badrequest error 2024-06-25 13:47:38 -07:00
Krrish Dholakia
715153b27b fix(router.py): improve error message returned for fallbacks 2024-06-25 11:27:20 -07:00
Krrish Dholakia
c46b229202 feat(router.py): support mock testing content policy + context window fallbacks 2024-06-25 10:58:19 -07:00
Ishaan Jaff
3bce8a2395 Merge pull request #4364 from BerriAI/litellm_use_tracemalloc_mem_usage
[Feat]  add endpoint to debug memory util
2024-06-25 10:32:52 -07:00
Ishaan Jaff
168c7af9da Merge pull request #4395 from BerriAI/litelm_use_spend_calc_aliases
Fix /spend/calculate use model_group_alias when set
2024-06-24 21:15:05 -07:00
Ishaan Jaff
759bcbd4db /spend/calculate use model aliases on this endpoint 2024-06-24 21:14:00 -07:00
Krish Dholakia
f1735ce5e1 Merge pull request #4390 from BerriAI/litellm_langfuse_router_logging
Log rejected router requests to langfuse
2024-06-24 21:13:45 -07:00
Ishaan Jaff
114115a7a1 Merge pull request #4389 from BerriAI/litellm_allow_user_to_define_public_routes
[Feat-Enterprise] - Allow setting custom public routes
2024-06-24 20:23:35 -07:00
Krish Dholakia
e97f9c530c Merge pull request #4392 from BerriAI/litellm_gemini_content_policy_errors
fix(vertex_httpx.py): cover gemini content violation (on prompt)
2024-06-24 20:00:06 -07:00
Ishaan Jaff
90ad55416b fix importing litellm 2024-06-24 19:58:53 -07:00
Ishaan Jaff
cf71e6a255 example cofnig with public routes 2024-06-24 19:51:23 -07:00
Ishaan Jaff
7ea4c7b328 add helper to check route_in_additonal_public_routes 2024-06-24 19:50:35 -07:00
Ishaan Jaff
9acab3dac5 test - spend/calculate endpoints 2024-06-24 19:32:52 -07:00
Krrish Dholakia
62ff12c0b6 fix(vertex_httpx.py): cover gemini content violation (on prompt) 2024-06-24 19:13:56 -07:00
Ishaan Jaff
ff543e3e45 feat - refactor /spend/calculate 2024-06-24 19:05:34 -07:00
Krrish Dholakia
94830ec281 fix(router.py): log rejected router requests to langfuse
Fixes issue where rejected requests weren't being logged
2024-06-24 17:52:01 -07:00
Ishaan Jaff
e0039396fb example config with public routes 2024-06-24 17:46:38 -07:00
Ishaan Jaff
7ffd9ffd0e feat - allow user to define public routes 2024-06-24 17:45:13 -07:00
Krrish Dholakia
ddd32e5341 fix(router.py): use user-defined model_input_tokens for pre-call filter checks 2024-06-24 17:25:26 -07:00
Ishaan Jaff
26790352db Merge pull request #4382 from BerriAI/litellm_ui_fix_login_bug
[Fix] - Admin UI login bug
2024-06-24 08:19:37 -07:00
Ishaan Jaff
f59804d04b fix ui login bug 2024-06-24 07:54:08 -07:00
Krish Dholakia
39c2fe511c Merge branch 'main' into litellm_azure_content_filter_fallbacks 2024-06-22 21:28:29 -07:00
Krrish Dholakia
4055381cdb fix(add-exception-mapping-+-langfuse-exception-logging-for-streaming-exceptions): add exception mapping + langfuse exception logging for streaming exceptions
Fixes https://github.com/BerriAI/litellm/issues/4338
2024-06-22 21:26:15 -07:00
Krish Dholakia
63d0defa6d Merge branch 'main' into litellm_dynamic_tpm_limits 2024-06-22 19:14:59 -07:00
Krrish Dholakia
f9ce6472d7 fix(router.py): check if azure returns 'content_filter' response + fallback available -> fallback
Exception maps azure content filter response exceptions
2024-06-22 19:10:15 -07:00
Ishaan Jaff
016e19270a feat - add debug_utils 2024-06-22 16:12:42 -07:00
Krrish Dholakia
1e4f8744e6 docs(team_budgets.md): fix script
/
2024-06-22 15:42:05 -07:00
Krrish Dholakia
1b2243059c docs(team_budgets.md): update docs with script for testing dynamic rate limiting 2024-06-22 15:26:58 -07:00
Krrish Dholakia
8843b0dc77 feat(dynamic_rate_limiter.py): working e2e 2024-06-22 14:41:22 -07:00
Ishaan Jaff
8786cd1e54 fix /audio/speech 2024-06-22 13:45:34 -07:00
Ishaan Jaff
6310f284fe ui - new build 2024-06-22 09:13:34 -07:00
Ishaan Jaff
0960f8cfd5 use hash of master key encode the jwt 2024-06-22 08:58:31 -07:00
Ishaan Jaff
7065c9838f ui - use cookies to return JWTs 2024-06-22 08:50:26 -07:00
Krrish Dholakia
8f95381276 refactor: instrument 'dynamic_rate_limiting' callback on proxy 2024-06-22 00:32:29 -07:00
Krish Dholakia
186fc867a4 Merge pull request #4344 from BerriAI/litellm_refactor_langfuse_slack_trace_url
refactor(litellm_logging.py): refactors how slack_alerting generates langfuse trace url
2024-06-21 23:37:38 -07:00
Krrish Dholakia
6a7982fa40 feat(dynamic_rate_limiter.py): passing base case 2024-06-21 22:46:46 -07:00
Ishaan Jaff
7468f08ec6 fix get_logging_payload 2024-06-21 20:42:28 -07:00
Krrish Dholakia
0430807178 feat(dynamic_rate_limiter.py): update cache with active project 2024-06-21 20:25:40 -07:00
Krrish Dholakia
89dba82be9 feat(dynamic_rate_limiter.py): initial commit for dynamic rate limiting
Closes https://github.com/BerriAI/litellm/issues/4124
2024-06-21 18:41:31 -07:00
Ishaan Jaff
0e8b28021c ui - new build 2024-06-21 18:12:12 -07:00
Ishaan Jaff
aa3f2b3cf9 fix cost tracking by tags 2024-06-21 16:49:57 -07:00
Krrish Dholakia
c7b06c42b7 refactor(litellm_logging.py): refactors how slack_alerting generates langfuse trace url
gets the url from logging object
2024-06-21 16:12:25 -07:00
Ishaan Jaff
f4db69a275 fix - /moderation don't require a model 2024-06-21 16:00:43 -07:00
Ishaan Jaff
9931a0d082 Merge pull request #4340 from BerriAI/litellm_show_cache_hit_stats
[Feat] Admin UI - Show Cache hit stats
2024-06-21 15:34:11 -07:00
Krrish Dholakia
174b345766 fix(proxy/utils.py): fix add langfuse trace id to alert
Fixing the import after refactor
2024-06-21 14:55:09 -07:00
Ishaan Jaff
8ab5916a0f show correct key aliases on ui 2024-06-21 14:36:38 -07:00
Krrish Dholakia
fb98dd70ce fix(proxy/utils.py): fix bool on check 2024-06-21 14:29:38 -07:00
Ishaan Jaff
7cba8f6de5 backend - new endpoint to show cache hit stats 2024-06-21 13:08:54 -07:00
Ishaan Jaff
5dd5cc7d87 fix refactor - add batches endpoints proxy server.py 2024-06-21 07:55:53 -07:00