Krrish Dholakia
|
1ff0129a94
|
fix(vertex_httpx.py): cover gemini content violation (on prompt)
|
2024-06-24 19:13:56 -07:00 |
|
Ishaan Jaff
|
dd048cf342
|
Merge pull request #4382 from BerriAI/litellm_ui_fix_login_bug
[Fix] - Admin UI login bug
|
2024-06-24 08:19:37 -07:00 |
|
Ishaan Jaff
|
acbcfef5af
|
fix ui login bug
|
2024-06-24 07:54:08 -07:00 |
|
Krish Dholakia
|
0454c0781a
|
Merge branch 'main' into litellm_azure_content_filter_fallbacks
|
2024-06-22 21:28:29 -07:00 |
|
Krrish Dholakia
|
cea630022e
|
fix(add-exception-mapping-+-langfuse-exception-logging-for-streaming-exceptions): add exception mapping + langfuse exception logging for streaming exceptions
Fixes https://github.com/BerriAI/litellm/issues/4338
|
2024-06-22 21:26:15 -07:00 |
|
Krish Dholakia
|
961e7ac95d
|
Merge branch 'main' into litellm_dynamic_tpm_limits
|
2024-06-22 19:14:59 -07:00 |
|
Krrish Dholakia
|
2c7a80d08d
|
fix(router.py): check if azure returns 'content_filter' response + fallback available -> fallback
Exception maps azure content filter response exceptions
|
2024-06-22 19:10:15 -07:00 |
|
Krrish Dholakia
|
bae7377128
|
docs(team_budgets.md): fix script
/
|
2024-06-22 15:42:05 -07:00 |
|
Krrish Dholakia
|
fc4e900a23
|
docs(team_budgets.md): update docs with script for testing dynamic rate limiting
|
2024-06-22 15:26:58 -07:00 |
|
Krrish Dholakia
|
a31a05d45d
|
feat(dynamic_rate_limiter.py): working e2e
|
2024-06-22 14:41:22 -07:00 |
|
Ishaan Jaff
|
ab38a90be0
|
fix /audio/speech
|
2024-06-22 13:45:34 -07:00 |
|
Ishaan Jaff
|
c4f6b903b3
|
ui - new build
|
2024-06-22 09:13:34 -07:00 |
|
Ishaan Jaff
|
c86efa55a5
|
use hash of master key encode the jwt
|
2024-06-22 08:58:31 -07:00 |
|
Ishaan Jaff
|
03d1d9229d
|
ui - use cookies to return JWTs
|
2024-06-22 08:50:26 -07:00 |
|
Krrish Dholakia
|
532f24bfb7
|
refactor: instrument 'dynamic_rate_limiting' callback on proxy
|
2024-06-22 00:32:29 -07:00 |
|
Krish Dholakia
|
f192b76ed4
|
Merge pull request #4344 from BerriAI/litellm_refactor_langfuse_slack_trace_url
refactor(litellm_logging.py): refactors how slack_alerting generates langfuse trace url
|
2024-06-21 23:37:38 -07:00 |
|
Krrish Dholakia
|
068e8dff5b
|
feat(dynamic_rate_limiter.py): passing base case
|
2024-06-21 22:46:46 -07:00 |
|
Ishaan Jaff
|
b4d95e740e
|
fix get_logging_payload
|
2024-06-21 20:42:28 -07:00 |
|
Krrish Dholakia
|
a028600932
|
feat(dynamic_rate_limiter.py): update cache with active project
|
2024-06-21 20:25:40 -07:00 |
|
Krrish Dholakia
|
2545da777b
|
feat(dynamic_rate_limiter.py): initial commit for dynamic rate limiting
Closes https://github.com/BerriAI/litellm/issues/4124
|
2024-06-21 18:41:31 -07:00 |
|
Ishaan Jaff
|
662cb5a5ec
|
ui - new build
|
2024-06-21 18:12:12 -07:00 |
|
Ishaan Jaff
|
29c2155796
|
fix cost tracking by tags
|
2024-06-21 16:49:57 -07:00 |
|
Krrish Dholakia
|
2584120012
|
refactor(litellm_logging.py): refactors how slack_alerting generates langfuse trace url
gets the url from logging object
|
2024-06-21 16:12:25 -07:00 |
|
Ishaan Jaff
|
eb8a9b2654
|
fix - /moderation don't require a model
|
2024-06-21 16:00:43 -07:00 |
|
Ishaan Jaff
|
cc11cedf02
|
Merge pull request #4340 from BerriAI/litellm_show_cache_hit_stats
[Feat] Admin UI - Show Cache hit stats
|
2024-06-21 15:34:11 -07:00 |
|
Krrish Dholakia
|
941574a921
|
fix(proxy/utils.py): fix add langfuse trace id to alert
Fixing the import after refactor
|
2024-06-21 14:55:09 -07:00 |
|
Ishaan Jaff
|
35c07306df
|
show correct key aliases on ui
|
2024-06-21 14:36:38 -07:00 |
|
Krrish Dholakia
|
5912decd19
|
fix(proxy/utils.py): fix bool on check
|
2024-06-21 14:29:38 -07:00 |
|
Ishaan Jaff
|
10e88a939d
|
backend - new endpoint to show cache hit stats
|
2024-06-21 13:08:54 -07:00 |
|
Ishaan Jaff
|
12f4fb3a42
|
fix refactor - add batches endpoints proxy server.py
|
2024-06-21 07:55:53 -07:00 |
|
Ishaan Jaff
|
558c00931e
|
Merge pull request #4322 from BerriAI/litellm_bug_fix_spend_tracking
[Fix] user field and user_api_key_* is sometimes omitted randomly
|
2024-06-20 21:10:13 -07:00 |
|
Krish Dholakia
|
f86290584a
|
Merge pull request #4290 from BerriAI/litellm_specific_deployment
feat(router.py): allow user to call specific deployment via id
|
2024-06-20 20:36:13 -07:00 |
|
Krish Dholakia
|
790d7057cb
|
Merge pull request #4323 from BerriAI/litellm_fix_duration_in_s
fix(key_management_endpoints.py): use common _duration_in_seconds function
|
2024-06-20 20:35:59 -07:00 |
|
Ishaan Jaff
|
aa3c14fa46
|
make sure linting runs proxy_cli.py
|
2024-06-20 20:20:08 -07:00 |
|
Chris Van Pelt
|
306c2b425d
|
Update proxy_cli.py
Fixed indentation to so we don't get an `UnboundLocalError`. Fixes #4324
|
2024-06-20 17:48:16 -07:00 |
|
Ishaan Jaff
|
c4b34946ed
|
fix update_cache function
|
2024-06-20 17:27:25 -07:00 |
|
Krrish Dholakia
|
511c0dc4d7
|
fix(key_management_endpoints.py): use common _duration_in_seconds function
|
2024-06-20 17:02:31 -07:00 |
|
Ishaan Jaff
|
0003de41c1
|
fix don't store bad api key in user api key cache
|
2024-06-20 16:55:12 -07:00 |
|
Krrish Dholakia
|
5729eb5168
|
fix(user_api_key_auth.py): ensure user has access to fallback models
for client side fallbacks, checks if user has access to fallback models
|
2024-06-20 16:02:19 -07:00 |
|
Krrish Dholakia
|
14da2d5ade
|
fix(_logging.py): fix timestamp format for json logs
Outputs timestamp in ISO 8601 format to work with fluentbit
|
2024-06-20 15:20:21 -07:00 |
|
Krrish Dholakia
|
248ee488f0
|
fix(proxy_cli.py): fix double counting json logs
|
2024-06-20 15:15:23 -07:00 |
|
Krrish Dholakia
|
df3afa652f
|
fix(proxy/utils.py): fix linting error
|
2024-06-20 14:13:38 -07:00 |
|
Krrish Dholakia
|
682ec33aa0
|
fix(litellm_logging.py): initialize global variables
Fixes https://github.com/BerriAI/litellm/issues/4281
|
2024-06-19 18:39:45 -07:00 |
|
Krrish Dholakia
|
16889b8478
|
feat(router.py): allow user to call specific deployment via id
Allows easier health checks for specific deployments by just passing in model id
|
2024-06-19 13:02:46 -07:00 |
|
Ishaan Jaff
|
93c5625dc6
|
Merge pull request #4286 from BerriAI/litellm_support_options_health_endpoints
feat - support CURL OPTIONS for `/health/readiness` endpoint
|
2024-06-19 12:25:32 -07:00 |
|
Ishaan Jaff
|
19f0f66dbc
|
add options for /health/readiness and liveliness
|
2024-06-19 12:13:35 -07:00 |
|
Ishaan Jaff
|
4007c6c6e8
|
add health/readiness OPTIONS
|
2024-06-19 12:00:48 -07:00 |
|
Ishaan Jaff
|
8aa1387170
|
feat - support OPTIONS for health endpoints
|
2024-06-19 09:55:38 -07:00 |
|
Krish Dholakia
|
3a3b3667ee
|
Merge branch 'main' into litellm_aws_kms_fixes
|
2024-06-19 09:30:54 -07:00 |
|
Krrish Dholakia
|
164dccbbd4
|
fix(router.py): support adding models across multiple orgs, with 1 model definition
|
2024-06-18 19:57:10 -07:00 |
|