Ishaan Jaff
7409dcd222
(fix) doc string
2024-03-26 09:25:44 -07:00
Ishaan Jaff
b8af946fb9
(feat) /cache/flushall
2024-03-26 09:18:58 -07:00
Ishaan Jaff
151b717ae2
(feat) support cache flush on redis
2024-03-26 09:12:30 -07:00
Krish Dholakia
f15ba10170
Merge pull request #2687 from BerriAI/litellm_jwt_auth_fixes_2
...
Litellm jwt auth fixes
2024-03-25 13:27:19 -07:00
Krrish Dholakia
93959ab5aa
fix(handle_jwt.py): allow setting proxy admin role string for jwt auth
2024-03-25 12:20:14 -07:00
Ishaan Jaff
734a51c049
(fix) stop using f strings in verbose logger
2024-03-25 10:55:30 -07:00
Ishaan Jaff
5d121a9f3c
(fix) stop using f strings with logger
2024-03-25 10:47:18 -07:00
Ishaan Jaff
dad4bd58bc
(feat) stop eagerly evaluating fstring
2024-03-25 09:01:42 -07:00
Krrish Dholakia
c81c9c2583
fix(proxy_server.py): fix model info check
2024-03-23 15:59:17 -07:00
Krrish Dholakia
d06b9a5a47
fix(proxy_server.py): enable jwt-auth for users
...
allow a user to auth into the proxy via jwt's and call allowed routes
2024-03-22 17:08:10 -07:00
Krrish Dholakia
33964233a5
fix(proxy_server.py): allow user to disable swagger ui docs via env
...
user can disable swagger ui docs by setting 'NO_DOCS="True"' in their env
2024-03-21 17:15:18 -07:00
Krish Dholakia
33a433eb0a
Merge branch 'main' into litellm_llm_api_prompt_injection_check
2024-03-21 09:57:10 -07:00
Ishaan Jaff
bcd62034ed
Merge pull request #2563 from eltociear/patch-2
...
Update proxy_server.py
2024-03-21 07:29:33 -07:00
Krrish Dholakia
d91f9a9f50
feat(proxy_server.py): enable llm api based prompt injection checks
...
run user calls through an llm api to check for prompt injection attacks. This happens in parallel to th
e actual llm call using `async_moderation_hook`
2024-03-20 22:43:42 -07:00
Krish Dholakia
007d439017
Merge pull request #2606 from BerriAI/litellm_jwt_auth_updates
...
fix(handle_jwt.py): track spend for user using jwt auth
2024-03-20 19:40:17 -07:00
Krrish Dholakia
f24d3ffdb6
fix(proxy_server.py): fix import
2024-03-20 19:15:06 -07:00
Krrish Dholakia
8bb00c4ae8
fix(caching.py): enable async setting of cache for dual cache
2024-03-20 18:42:34 -07:00
Krrish Dholakia
90e17b5422
fix(handle_jwt.py): track spend for user using jwt auth
2024-03-20 10:55:52 -07:00
Ishaan Jaff
4ed551dc52
(feat) better debugging for /cache/ping
2024-03-20 08:30:11 -07:00
Ishaan Jaff
2256ece5a9
(feat) litellm cache ping
2024-03-20 08:24:13 -07:00
Ishaan Jaff
8f750b71eb
(fix) caching - don't require cache password
2024-03-19 20:50:16 -07:00
Krrish Dholakia
f25b03326b
fix(proxy_server.py): allow user to disable scheduled reset budget task
2024-03-19 20:36:22 -07:00
Krrish Dholakia
2dfdc8dd69
Revert "Merge pull request #2593 from BerriAI/litellm_reset_budget_fix"
...
This reverts commit afd363129f
, reversing
changes made to c94bc94ad5
.
2024-03-19 20:25:41 -07:00
Krish Dholakia
afd363129f
Merge pull request #2593 from BerriAI/litellm_reset_budget_fix
...
fix(proxy/utils.py): fix reset budget logic
2024-03-19 20:17:03 -07:00
Krrish Dholakia
37795c0d92
fix(proxy_server.py): add more debug logs
2024-03-19 19:59:43 -07:00
Krrish Dholakia
f6de3a0359
fix: better debug logs
2024-03-19 19:28:26 -07:00
Ishaan Jaff
c94bc94ad5
Merge pull request #2591 from BerriAI/litellm_metrics_endpoint
...
[Feat] /metrics endpoint for Prometheus, Grafana
2024-03-19 18:08:22 -07:00
Ishaan Jaff
aa1c480452
(feat) using prom litellm
2024-03-19 15:49:23 -07:00
Krrish Dholakia
302bab6f1f
feat(handle_jwt.py): support authenticating admins into the proxy via jwt's
2024-03-19 15:00:27 -07:00
Ishaan Jaff
4b7e102187
(v0) prometheus metric
2024-03-19 14:48:38 -07:00
Krrish Dholakia
7c74a0e6e2
fix(proxy_server.py): expose disable_spend_logs
flag in config general settings
...
Writing each spend log adds +300ms latency
https://github.com/BerriAI/litellm/issues/1714#issuecomment-1924727281
2024-03-19 12:08:37 -07:00
Krish Dholakia
c4dbd0407e
Merge pull request #2561 from BerriAI/litellm_batch_writing_db
...
fix(proxy/utils.py): move to batch writing db updates
2024-03-18 21:50:47 -07:00
Krrish Dholakia
7eaddaef10
refactor(proxy_server.py): re-add custom db client logic - prevent regressions
2024-03-18 21:16:28 -07:00
Ishaan Jaff
51d658e878
(fix) if litellm-proxy-budget set use it
2024-03-18 20:31:23 -07:00
Krrish Dholakia
f588bff69b
fix(proxy_server.py): fix spend log update
2024-03-18 20:26:28 -07:00
Ishaan Jaff
87dd3f1235
(fix) show global spend on UI
2024-03-18 18:15:08 -07:00
Krrish Dholakia
8fefe625d9
fix(proxy/utils.py): batch writing updates to db
2024-03-18 16:47:02 -07:00
Krrish Dholakia
f0434350f1
fix(proxy_server.py): don't override cache params on proxy config if set
2024-03-18 12:11:30 -07:00
Krrish Dholakia
8619499853
fix(proxy_server.py): ignore cache if value is false
2024-03-18 11:15:32 -07:00
Ikko Eltociear Ashimine
0b561764b0
Update proxy_server.py
...
intialize -> initialize
2024-03-18 01:28:12 +09:00
Krrish Dholakia
077b9c6234
fix(proxy/utils.py): move to batch writing db updates
2024-03-16 22:32:00 -07:00
Ishaan Jaff
4e33aac997
Merge pull request #2560 from BerriAI/litellm_view_team_based_spend
...
Admin UI - view team based spend
2024-03-16 19:38:17 -07:00
Ishaan Jaff
0d349d389a
(feat) new ui build
2024-03-16 19:35:00 -07:00
Ishaan Jaff
902337e28a
(feat) view team based spend
2024-03-16 19:06:16 -07:00
Krish Dholakia
e55a8c3570
Merge pull request #2556 from BerriAI/litellm_aws_secret_manager_support
...
fix(utils.py): initial commit for aws secret manager support
2024-03-16 18:41:58 -07:00
Krrish Dholakia
bc66ef9d5c
fix(utils.py): fix aws secret manager + support key_management_settings
...
fixes the aws secret manager implementation and allows the user to set which keys they want to check thr
ough it
2024-03-16 16:47:50 -07:00
Ishaan Jaff
ac6c69ff89
Merge pull request #2559 from BerriAI/litellm_show_spend_correctly
...
(fix) admin ui - order spend by date
2024-03-16 16:43:13 -07:00
Ishaan Jaff
e3582d86c6
Merge pull request #2557 from BerriAI/litellm_clean_up_health_readiness
...
(fix) /health/readiness return success callback names as (str)
2024-03-16 16:38:01 -07:00
Ishaan Jaff
2355e9dc51
(fix) admin ui - order spend by date
2024-03-16 16:15:10 -07:00
Ishaan Jaff
cb4c36b7f6
(fix) /health readiness return callback names
2024-03-16 15:56:07 -07:00