Commit graph

1809 commits

Author SHA1 Message Date
Ishaan Jaff
b9f83f48ba
Merge pull request #4364 from BerriAI/litellm_use_tracemalloc_mem_usage
[Feat]  add endpoint to debug memory util
2024-06-25 10:32:52 -07:00
Ishaan Jaff
dd048cf342
Merge pull request #4382 from BerriAI/litellm_ui_fix_login_bug
[Fix] - Admin UI login bug
2024-06-24 08:19:37 -07:00
Ishaan Jaff
acbcfef5af fix ui login bug 2024-06-24 07:54:08 -07:00
Krrish Dholakia
cea630022e fix(add-exception-mapping-+-langfuse-exception-logging-for-streaming-exceptions): add exception mapping + langfuse exception logging for streaming exceptions
Fixes https://github.com/BerriAI/litellm/issues/4338
2024-06-22 21:26:15 -07:00
Krish Dholakia
961e7ac95d
Merge branch 'main' into litellm_dynamic_tpm_limits 2024-06-22 19:14:59 -07:00
Ishaan Jaff
3105eaebc3 feat - add debug_utils 2024-06-22 16:12:42 -07:00
Krrish Dholakia
a31a05d45d feat(dynamic_rate_limiter.py): working e2e 2024-06-22 14:41:22 -07:00
Ishaan Jaff
c86efa55a5 use hash of master key encode the jwt 2024-06-22 08:58:31 -07:00
Ishaan Jaff
03d1d9229d ui - use cookies to return JWTs 2024-06-22 08:50:26 -07:00
Krrish Dholakia
532f24bfb7 refactor: instrument 'dynamic_rate_limiting' callback on proxy 2024-06-22 00:32:29 -07:00
Ishaan Jaff
29c2155796 fix cost tracking by tags 2024-06-21 16:49:57 -07:00
Ishaan Jaff
eb8a9b2654 fix - /moderation don't require a model 2024-06-21 16:00:43 -07:00
Ishaan Jaff
10e88a939d backend - new endpoint to show cache hit stats 2024-06-21 13:08:54 -07:00
Ishaan Jaff
12f4fb3a42 fix refactor - add batches endpoints proxy server.py 2024-06-21 07:55:53 -07:00
Ishaan Jaff
558c00931e
Merge pull request #4322 from BerriAI/litellm_bug_fix_spend_tracking
[Fix]  user field and user_api_key_* is sometimes omitted randomly
2024-06-20 21:10:13 -07:00
Ishaan Jaff
c4b34946ed fix update_cache function 2024-06-20 17:27:25 -07:00
Ishaan Jaff
0003de41c1 fix don't store bad api key in user api key cache 2024-06-20 16:55:12 -07:00
Krrish Dholakia
16889b8478 feat(router.py): allow user to call specific deployment via id
Allows easier health checks for specific deployments by just passing in model id
2024-06-19 13:02:46 -07:00
Krish Dholakia
3a3b3667ee
Merge branch 'main' into litellm_aws_kms_fixes 2024-06-19 09:30:54 -07:00
Krrish Dholakia
164dccbbd4 fix(router.py): support adding models across multiple orgs, with 1 model definition 2024-06-18 19:57:10 -07:00
Krrish Dholakia
6558abf845 fix(proxy_server.py): track team spend for cached team object
fixes issue where team budgets for jwt tokens weren't asserted
2024-06-18 17:10:12 -07:00
John HU
3d16bd6eb8
fix: strip llm credential before returning 2024-06-17 12:14:40 -07:00
Ishaan Jaff
b718e7ec79 fix- refactor proxy place internal user endpoints in sep file 2024-06-15 16:39:06 -07:00
Ishaan Jaff
f84941bdc0
Merge pull request #4217 from BerriAI/litellm_refactor_proxy_server
[Refactor-Proxy] Make proxy_server.py < 10K lines (move management, key, endpoints to their own files)
2024-06-15 16:27:31 -07:00
Ishaan Jaff
ae884f741e fix - non sso ui sign up flow 2024-06-15 15:26:44 -07:00
Ishaan Jaff
0f8237e263 fix - refactor key endpoints 2024-06-15 14:13:32 -07:00
Ishaan Jaff
349724a630 use directories for importing new endpoints 2024-06-15 13:35:21 -07:00
Ishaan Jaff
938cca57d4 refactor health endpoints 2024-06-15 12:39:04 -07:00
Ishaan Jaff
1b271aa95c fix swagger 2024-06-15 12:22:16 -07:00
Ishaan Jaff
b4057dc022 refactor spend endpoints 2024-06-15 12:17:33 -07:00
Ishaan Jaff
23e5f36522 fix test 2024-06-15 11:46:15 -07:00
Ishaan Jaff
f2570fdf00 feat - refactor team endpoints 2024-06-15 11:40:36 -07:00
Ishaan Jaff
da3ae00bd6 fix - proxy refactor user_api_key_auth 2024-06-15 10:33:58 -07:00
Ishaan Jaff
09b1d52ea1 fix send emails to team members 2024-06-14 20:11:16 -07:00
Ishaan Jaff
75d5609705 send team_alias on team budget alerts 2024-06-14 17:00:43 -07:00
Krish Dholakia
e61cd2e1e2
Merge branch 'main' into litellm_redis_cache_usage 2024-06-13 22:07:21 -07:00
Ishaan Jaff
b986aa2846 update swagger for /team endpoints 2024-06-13 20:55:50 -07:00
Ishaan Jaff
b4db497e23 fix /team/update 2024-06-13 20:55:50 -07:00
Ishaan Jaff
8f77cfc013 fix bug when updating team 2024-06-13 20:55:49 -07:00
Ishaan Jaff
21ed23296f fix - clean up swagger spend endpoints 2024-06-13 20:55:49 -07:00
Krrish Dholakia
345094a49d fix(utils.py): check if model info is for model with correct provider
Fixes issue where incorrect pricing was used for custom llm provider
2024-06-13 15:54:24 -07:00
Ishaan Jaff
eeb0e7d2be fix "/global/spend/report", 2024-06-13 08:51:49 -07:00
Ishaan Jaff
8fd162136f feat - litellm_model_cost_map 2024-06-12 20:03:46 -07:00
Ishaan Jaff
8987f16507 fix - sso pass correct role for user 2024-06-12 19:37:58 -07:00
Ishaan Jaff
d380697723 fix - comment on /user/new 2024-06-12 13:16:39 -07:00
Krrish Dholakia
76c9b715f2 fix(parallel_request_limiter.py): use redis cache, if available for rate limiting across instances
Fixes https://github.com/BerriAI/litellm/issues/4148
2024-06-12 10:35:48 -07:00
Ishaan Jaff
6d2134ad2a
Merge pull request #4138 from BerriAI/litellm_log_management_endpoint_logs_otel
[FEAT]  log management endpoint logs to otel
2024-06-11 19:00:13 -07:00
Krrish Dholakia
23f7d06c76 refactor(proxy_server.py): cleanup sensitive key debug log 2024-06-11 18:45:12 -07:00
Krish Dholakia
75b0d7821b
Merge pull request #4139 from BerriAI/litellm_fix_budget_exceeded_error_code
fix(proxy_server.py): use consistent 400-status code error code for exceeded budget errors
2024-06-11 18:36:58 -07:00
Ishaan Jaff
b10ac0d1f3 fix linting error 2024-06-11 18:14:06 -07:00