Krish Dholakia
713d762411
LiteLLM Minor Fixes and Improvements (09/13/2024) ( #5689 )
...
* refactor: cleanup unused variables + fix pyright errors
* feat(health_check.py): Closes https://github.com/BerriAI/litellm/issues/5686
* fix(o1_reasoning.py): add stricter check for o-1 reasoning model
* refactor(mistral/): make it easier to see mistral transformation logic
* fix(openai.py): fix openai o-1 model param mapping
Fixes https://github.com/BerriAI/litellm/issues/5685
* feat(main.py): infer finetuned gemini model from base model
Fixes https://github.com/BerriAI/litellm/issues/5678
* docs(vertex.md): update docs to call finetuned gemini models
* feat(proxy_server.py): allow admin to hide proxy model aliases
Closes https://github.com/BerriAI/litellm/issues/5692
* docs(load_balancing.md): add docs on hiding alias models from proxy config
* fix(base.py): don't raise notimplemented error
* fix(user_api_key_auth.py): fix model max budget check
* fix(router.py): fix elif
* fix(user_api_key_auth.py): don't set team_id to empty str
* fix(team_endpoints.py): fix response type
* test(test_completion.py): handle predibase error
* test(test_proxy_server.py): fix test
* fix(o1_transformation.py): fix max_completion_token mapping
* test(test_image_generation.py): mark flaky test
2024-09-14 10:02:55 -07:00
Krish Dholakia
91c918fd70
LiteLLM Minor Fixes and Improvements (09/12/2024) ( #5658 )
...
* fix(factory.py): handle tool call content as list
Fixes https://github.com/BerriAI/litellm/issues/5652
* fix(factory.py): enforce stronger typing
* fix(router.py): return model alias in /v1/model/info and /v1/model_group/info
* fix(user_api_key_auth.py): move noisy warning message to debug
cleanup logs
* fix(types.py): cleanup pydantic v2 deprecated param
Fixes https://github.com/BerriAI/litellm/issues/5649
* docs(gemini.md): show how to pass inline data to gemini api
Fixes https://github.com/BerriAI/litellm/issues/5674
2024-09-12 23:04:06 -07:00
Krish Dholakia
dec53961f7
LiteLLM Minor Fixes and Improvements (11/09/2024) ( #5634 )
...
* fix(caching.py): set ttl for async_increment cache
fixes issue where ttl for redis client was not being set on increment_cache
Fixes https://github.com/BerriAI/litellm/issues/5609
* fix(caching.py): fix increment cache w/ ttl for sync increment cache on redis
Fixes https://github.com/BerriAI/litellm/issues/5609
* fix(router.py): support adding retry policy + allowed fails policy via config.yaml
* fix(router.py): don't cooldown single deployments
No point, as there's no other deployment to loadbalance with.
* fix(user_api_key_auth.py): support setting allowed email domains on jwt tokens
Closes https://github.com/BerriAI/litellm/issues/5605
* docs(token_auth.md): add user upsert + allowed email domain to jwt auth docs
* fix(litellm_pre_call_utils.py): fix dynamic key logging when team id is set
Fixes issue where key logging would not be set if team metadata was not none
* fix(secret_managers/main.py): load environment variables correctly
Fixes issue where os.environ/ was not being loaded correctly
* test(test_router.py): fix test
* feat(spend_tracking_utils.py): support logging additional usage params - e.g. prompt caching values for deepseek
* test: fix tests
* test: fix test
* test: fix test
* test: fix test
* test: fix test
2024-09-11 22:36:06 -07:00
Krish Dholakia
6f354ecac6
fix(pass_through_endpoints): support bedrock agents via pass through ( #5527 )
2024-09-04 22:22:22 -07:00
Krish Dholakia
6fdee99632
LiteLLM Minor fixes + improvements (08/04/2024) ( #5505 )
...
* Minor IAM AWS OIDC Improvements (#5246 )
* AWS IAM: Temporary tokens are valid across all regions after being issued, so it is wasteful to request one for each region.
* AWS IAM: Include an inline policy, to help reduce misuse of overly permissive IAM roles.
* (test_bedrock_completion.py): Ensure we are testing cross AWS region OIDC flow.
* fix(router.py): log rejected requests
Fixes https://github.com/BerriAI/litellm/issues/5498
* refactor: don't use verbose_logger.exception, if exception is raised
User might already have handling for this. But alerting systems in prod will raise this as an unhandled error.
* fix(datadog.py): support setting datadog source as an env var
Fixes https://github.com/BerriAI/litellm/issues/5508
* docs(logging.md): add dd_source to datadog docs
* fix(proxy_server.py): expose `/customer/list` endpoint for showing all customers
* (bedrock): Fix usage with Cloudflare AI Gateway, and proxies in general. (#5509 )
* feat(anthropic.py): support 'cache_control' param for content when it is a string
* Revert "(bedrock): Fix usage with Cloudflare AI Gateway, and proxies in gener…" (#5519 )
This reverts commit 3fac0349c2
.
* refactor: ci/cd run again
---------
Co-authored-by: David Manouchehri <david.manouchehri@ai.moda>
2024-09-04 22:16:55 -07:00
Ishaan Jaff
bfb0aceeae
add check for admin only routes
2024-09-03 15:03:32 -07:00
Ishaan Jaff
cf66ca89b9
allow setting allowed routes on proxy
2024-09-03 13:59:31 -07:00
Ishaan Jaff
c30fd9a775
fix auth checks for provider routes
2024-08-29 16:40:46 -07:00
Ishaan Jaff
a27cf9960b
Merge pull request #5420 from BerriAI/litellm_add_oauth2_mapping
...
[Feat-Proxy] Add hook for oauth2 proxy headers
2024-08-28 16:34:26 -07:00
Ishaan Jaff
f2c0e4d31e
add hook for oauth2 proxy
2024-08-28 15:51:20 -07:00
Ishaan Jaff
51d20628cd
add checks for safe request body
2024-08-28 14:07:53 -07:00
Ishaan Jaff
d4db993130
add /rerank test
2024-08-27 17:50:37 -07:00
Krish Dholakia
a583b95d85
Merge pull request #5308 from BerriAI/litellm_team_admin_permissions
...
feat(user_api_key_auth.py): allow team admin to add new members to team
2024-08-21 14:21:22 -07:00
Krrish Dholakia
e32a68c94b
refactor(team_endpoints.py): refactor auth checks for team member endpoints to ui team admin to manage it
2024-08-20 16:57:18 -07:00
Krrish Dholakia
64affd0d6b
feat(user_api_key_auth.py): allow team admin to add new members to team
2024-08-20 14:01:12 -07:00
Krish Dholakia
e49e454929
Merge pull request #5287 from BerriAI/litellm_fix_response_cost_cal
...
fix(cost_calculator.py): only override base model if custom pricing is set
2024-08-20 11:42:48 -07:00
Ishaan Jaff
6d0f96f437
fix _get_request_ip_address
2024-08-19 21:08:27 -07:00
Krrish Dholakia
b26f717c70
fix(user_api_key_auth.py): fix client_ip
2024-08-19 21:01:54 -07:00
Krrish Dholakia
5e8a2ced04
fix(user_api_key_auth.py): log requester ip address to logs on request rejection
...
Closes https://github.com/BerriAI/litellm/issues/5220
2024-08-19 11:03:58 -07:00
Krish Dholakia
0916197c9d
Merge pull request #5244 from BerriAI/litellm_better_error_logging_sentry
...
refactor: replace .error() with .exception() logging for better debugging on sentry
2024-08-16 19:16:20 -07:00
Ishaan Jaff
55df861291
docs oauh 2.0 enterprise feature
2024-08-16 14:00:24 -07:00
Ishaan Jaff
9a9710b8a1
add debugging for oauth2.0
2024-08-16 13:40:32 -07:00
Krrish Dholakia
2874b94fb1
refactor: replace .error() with .exception() logging for better debugging on sentry
2024-08-16 09:22:47 -07:00
Krish Dholakia
6b14dbc885
Merge branch 'main' into litellm_fix_pass_through_endpoints
2024-08-14 14:59:38 -07:00
Krrish Dholakia
9387662eab
fix(user_api_key_auth.py): more precisely expand scope to handle 'basic' tokens
2024-08-13 22:00:33 -07:00
Ishaan Jaff
acb31c0acd
return detailed error message on check_valid_ip
2024-08-13 21:29:21 -07:00
Krrish Dholakia
8b4b378562
feat(user_api_key_auth.py): support calling langfuse with litellm user_api_key_auth
2024-08-13 20:26:24 -07:00
Ishaan Jaff
7d439865e7
add test for test_check_valid_ip_sent_with_x_forwarded_for
2024-08-13 15:54:53 -07:00
Ishaan Jaff
e62aa0d1c2
use _check_valid_ip
2024-08-13 15:41:01 -07:00
Ishaan Jaff
eb18d0a26b
check use_x_forwarded_for
2024-08-13 15:18:24 -07:00
Krrish Dholakia
e57add26b5
fix(user_api_key_auth.py): move warning to debug log
2024-08-12 10:35:44 -07:00
Krrish Dholakia
3d4672ba53
fix(user_api_key_auth.py): Fixes https://github.com/BerriAI/litellm/issues/5111
2024-08-08 10:30:15 -07:00
Krrish Dholakia
86a3dba1bf
refactor(user_api_key_auth.py): refactor to replace user_id_information list with pydantic user_obj
...
Allows using the 'get_user_object' function in user_api_key_auth, keeping it consistent across jwt-auth and key-auth
2024-08-07 15:33:55 -07:00
Krrish Dholakia
228da08b81
fix: fix tests
2024-08-07 15:02:04 -07:00
Krrish Dholakia
57a730095c
fix(user_api_key_auth.py): respect team budgets over user budget, if key belongs to team
...
Closes https://github.com/BerriAI/litellm/issues/5097
2024-08-07 14:32:27 -07:00
Krrish Dholakia
4fd583af43
fix(user_api_key_auth.py): fix _get_user_role
2024-08-05 20:54:27 -07:00
Krrish Dholakia
4ba576724c
test: improve debugging for test
2024-08-05 19:41:08 -07:00
Ishaan Jaff
7c7c8fe102
use get_request_route
2024-08-05 10:13:47 -07:00
Krrish Dholakia
8fea55ffd7
fix(user_api_key_auth.py): handle older user_role's
...
Fixes issue where older user_role's (e.g. app_user) weren't being recognized. + Adds testing for it
2024-08-05 08:57:06 -07:00
Krrish Dholakia
996763af1e
fix(user_api_key_auth.py): fix linting errors
2024-08-02 16:34:32 -07:00
Krrish Dholakia
cbad9e7502
build(ui): allow admin_viewer to view teams tab
...
Allows admin viewe role to see available teams on proxy ui
2024-08-02 16:19:40 -07:00
Krrish Dholakia
f76a4bab49
fix(user_api_key_cache): fix check to not raise error if team object is missing
2024-07-30 18:25:04 -07:00
Krrish Dholakia
1915adfc2d
fix(auth_checks.py): fix redis usage for team cached objects
2024-07-30 17:30:00 -07:00
Ishaan Jaff
93f3143dc5
Merge pull request #4916 from BerriAI/litellm_fix_ui_login
...
Feat UI - allow using custom header for litellm api key
2024-07-29 17:08:53 -07:00
Ishaan Jaff
59b91c35d7
better debugging for custom headers
2024-07-29 16:59:15 -07:00
Ishaan Jaff
a18f5bd5c8
security - check max request size
2024-07-27 16:08:41 -07:00
Krish Dholakia
0cb872cbea
Merge pull request #4810 from BerriAI/litellm_team_modify_guardrails
...
feat(auth_checks.py): Allow admin to disable team from turning on/off guardrails
2024-07-22 22:32:24 -07:00
Ishaan Jaff
5f238f2857
check is_llm_api_route
2024-07-22 14:43:30 -07:00
Krrish Dholakia
a351b7cc3e
feat(auth_checks.py): Allow admin to disable team from turning on/off guardrails.
2024-07-20 18:39:05 -07:00
Krrish Dholakia
b92af48854
fix(user_api_key_auth.py): update team values in token cache if refreshed more recently
2024-07-19 17:35:59 -07:00