Commit graph

2687 commits

Author SHA1 Message Date
Ishaan Jaff
29c2155796 fix cost tracking by tags 2024-06-21 16:49:57 -07:00
Krrish Dholakia
2584120012 refactor(litellm_logging.py): refactors how slack_alerting generates langfuse trace url
gets the url from logging object
2024-06-21 16:12:25 -07:00
Ishaan Jaff
eb8a9b2654 fix - /moderation don't require a model 2024-06-21 16:00:43 -07:00
Ishaan Jaff
cc11cedf02
Merge pull request #4340 from BerriAI/litellm_show_cache_hit_stats
[Feat] Admin UI - Show Cache hit stats
2024-06-21 15:34:11 -07:00
Krrish Dholakia
941574a921 fix(proxy/utils.py): fix add langfuse trace id to alert
Fixing the import after refactor
2024-06-21 14:55:09 -07:00
Ishaan Jaff
35c07306df show correct key aliases on ui 2024-06-21 14:36:38 -07:00
Krrish Dholakia
5912decd19 fix(proxy/utils.py): fix bool on check 2024-06-21 14:29:38 -07:00
Ishaan Jaff
10e88a939d backend - new endpoint to show cache hit stats 2024-06-21 13:08:54 -07:00
Ishaan Jaff
12f4fb3a42 fix refactor - add batches endpoints proxy server.py 2024-06-21 07:55:53 -07:00
Ishaan Jaff
558c00931e
Merge pull request #4322 from BerriAI/litellm_bug_fix_spend_tracking
[Fix]  user field and user_api_key_* is sometimes omitted randomly
2024-06-20 21:10:13 -07:00
Krish Dholakia
f86290584a
Merge pull request #4290 from BerriAI/litellm_specific_deployment
feat(router.py): allow user to call specific deployment via id
2024-06-20 20:36:13 -07:00
Krish Dholakia
790d7057cb
Merge pull request #4323 from BerriAI/litellm_fix_duration_in_s
fix(key_management_endpoints.py): use common _duration_in_seconds function
2024-06-20 20:35:59 -07:00
Ishaan Jaff
aa3c14fa46 make sure linting runs proxy_cli.py 2024-06-20 20:20:08 -07:00
Chris Van Pelt
306c2b425d
Update proxy_cli.py
Fixed indentation to so we don't get an `UnboundLocalError`.  Fixes #4324
2024-06-20 17:48:16 -07:00
Ishaan Jaff
c4b34946ed fix update_cache function 2024-06-20 17:27:25 -07:00
Krrish Dholakia
511c0dc4d7 fix(key_management_endpoints.py): use common _duration_in_seconds function 2024-06-20 17:02:31 -07:00
Ishaan Jaff
0003de41c1 fix don't store bad api key in user api key cache 2024-06-20 16:55:12 -07:00
Krrish Dholakia
5729eb5168 fix(user_api_key_auth.py): ensure user has access to fallback models
for client side fallbacks, checks if user has access to fallback models
2024-06-20 16:02:19 -07:00
Krrish Dholakia
14da2d5ade fix(_logging.py): fix timestamp format for json logs
Outputs timestamp in ISO 8601 format to work with fluentbit
2024-06-20 15:20:21 -07:00
Krrish Dholakia
248ee488f0 fix(proxy_cli.py): fix double counting json logs 2024-06-20 15:15:23 -07:00
Krrish Dholakia
df3afa652f fix(proxy/utils.py): fix linting error 2024-06-20 14:13:38 -07:00
Krrish Dholakia
682ec33aa0 fix(litellm_logging.py): initialize global variables
Fixes https://github.com/BerriAI/litellm/issues/4281
2024-06-19 18:39:45 -07:00
Krrish Dholakia
16889b8478 feat(router.py): allow user to call specific deployment via id
Allows easier health checks for specific deployments by just passing in model id
2024-06-19 13:02:46 -07:00
Ishaan Jaff
93c5625dc6
Merge pull request #4286 from BerriAI/litellm_support_options_health_endpoints
feat - support CURL OPTIONS for `/health/readiness` endpoint
2024-06-19 12:25:32 -07:00
Ishaan Jaff
19f0f66dbc add options for /health/readiness and liveliness 2024-06-19 12:13:35 -07:00
Ishaan Jaff
4007c6c6e8 add health/readiness OPTIONS 2024-06-19 12:00:48 -07:00
Ishaan Jaff
8aa1387170 feat - support OPTIONS for health endpoints 2024-06-19 09:55:38 -07:00
Krish Dholakia
3a3b3667ee
Merge branch 'main' into litellm_aws_kms_fixes 2024-06-19 09:30:54 -07:00
Krrish Dholakia
164dccbbd4 fix(router.py): support adding models across multiple orgs, with 1 model definition 2024-06-18 19:57:10 -07:00
Krrish Dholakia
6558abf845 fix(proxy_server.py): track team spend for cached team object
fixes issue where team budgets for jwt tokens weren't asserted
2024-06-18 17:10:12 -07:00
Ishaan Jaff
5e2af8236a fix - thread create endpoints 2024-06-18 07:54:47 -07:00
Ishaan Jaff
116fab526d
Merge pull request #4254 from BerriAI/litellm_refactor_logfire
[Fix] Refactor Logfire to use LiteLLM OTEL Class
2024-06-17 19:49:36 -07:00
Krrish Dholakia
4cddd1eb31 fix(_logging.py): add loglevel and timestamp to json logs
Fixes https://github.com/BerriAI/litellm/issues/4248
2024-06-17 18:57:20 -07:00
Krrish Dholakia
469fdc0fc6 fix(proxy/_types.py): allow internal user to access sso routes 2024-06-17 18:34:37 -07:00
Ishaan Jaff
18f1a42c5a fix - refactor logfire 2024-06-17 17:21:43 -07:00
John HU
3d16bd6eb8
fix: strip llm credential before returning 2024-06-17 12:14:40 -07:00
Krrish Dholakia
2623bb260f fix(litellm_logging.py): fix lago callback logic 2024-06-17 09:10:19 -07:00
Ishaan Jaff
b718e7ec79 fix- refactor proxy place internal user endpoints in sep file 2024-06-15 16:39:06 -07:00
Ishaan Jaff
f84941bdc0
Merge pull request #4217 from BerriAI/litellm_refactor_proxy_server
[Refactor-Proxy] Make proxy_server.py < 10K lines (move management, key, endpoints to their own files)
2024-06-15 16:27:31 -07:00
Ishaan Jaff
dc72545396 doc - add comment explaining user_api_key_auth file 2024-06-15 16:08:17 -07:00
Ishaan Jaff
5746a6b9bf
Merge pull request #4220 from BerriAI/litellm_admin_ui_sign_up_flow
fix - non sso ui sign up flow
2024-06-15 15:55:21 -07:00
Ishaan Jaff
70e3295ead ui - new build 2024-06-15 15:53:46 -07:00
Ishaan Jaff
ae884f741e fix - non sso ui sign up flow 2024-06-15 15:26:44 -07:00
Krish Dholakia
f2b3770869
Merge pull request #4216 from BerriAI/litellm_refactor_logging
refactor(utils.py): Cut down utils.py to <10k lines.
2024-06-15 15:19:42 -07:00
Ishaan Jaff
0f8237e263 fix - refactor key endpoints 2024-06-15 14:13:32 -07:00
Ishaan Jaff
349724a630 use directories for importing new endpoints 2024-06-15 13:35:21 -07:00
Ishaan Jaff
938cca57d4 refactor health endpoints 2024-06-15 12:39:04 -07:00
Ishaan Jaff
1b271aa95c fix swagger 2024-06-15 12:22:16 -07:00
Ishaan Jaff
b4057dc022 refactor spend endpoints 2024-06-15 12:17:33 -07:00
Krrish Dholakia
af88e55d46 fix(utils.py): move 'set_callbacks' to litellm_logging.py 2024-06-15 12:02:30 -07:00