Commit graph

2077 commits

Author SHA1 Message Date
Ishaan Jaff
a844423e00 fix - use csv list for batch completions 2024-05-11 14:24:48 -07:00
Ishaan Jaff
b71f35de72 Merge pull request #3585 from BerriAI/litellm_router_batch_comp
[Litellm Proxy + litellm.Router] - Pass the same message/prompt to N models
2024-05-11 13:51:45 -07:00
Krrish Dholakia
926b86af87 feat(bedrock_httpx.py): moves to using httpx client for bedrock cohere calls 2024-05-11 13:43:08 -07:00
Ishaan Jaff
19111d2d63 edit dev config.yaml 2024-05-11 13:24:59 -07:00
Ishaan Jaff
c49033a687 support batch /chat/completions on proxy 2024-05-11 13:24:25 -07:00
Krish Dholakia
7f64c61275 Merge pull request #3582 from BerriAI/litellm_explicit_region_name_setting
feat(router.py): allow setting model_region in litellm_params
2024-05-11 11:36:22 -07:00
Krrish Dholakia
2ed155b4d4 feat(router.py): allow setting model_region in litellm_params
Closes https://github.com/BerriAI/litellm/issues/3580
2024-05-11 10:18:08 -07:00
Krish Dholakia
30117317a4 Merge branch 'main' into feat/add-azure-content-filter 2024-05-11 09:30:38 -07:00
Krrish Dholakia
bed16d5495 fix(vertex_ai.py): fix list tool call responses
Closes https://github.com/BerriAI/litellm/issues/3147
2024-05-10 20:05:58 -07:00
Ishaan Jaff
82344db621 fix triton params 2024-05-10 19:14:48 -07:00
Ishaan Jaff
d4eabefac1 Merge pull request #3573 from BerriAI/litellm_team_based_failure_callback
[Feat] Use Team based callbacks with litellm.failure_callbacks
2024-05-10 17:53:15 -07:00
Ishaan Jaff
3bee7279bf fix langfuse team based logging tests 2024-05-10 16:39:49 -07:00
Ishaan Jaff
b0777de041 fix - using failure callbacks with team based logging 2024-05-10 16:18:13 -07:00
Ishaan Jaff
3b2e295337 fix - proxy_server.py 2024-05-10 13:47:35 -07:00
Krish Dholakia
460481a02d Merge pull request #3378 from duckboy81/patch-1
Expand access for other jwt algorithms
2024-05-10 10:07:36 -07:00
Nick Wong
a1821428d1 added changes from upstream
Merge branch 'main' into fix/error-on-get-user-role
2024-05-09 16:14:14 -07:00
Nick Wong
ea0856408b removed extra default dict return, which causes error if user_role is a string 2024-05-09 16:13:26 -07:00
Krrish Dholakia
8bb710b9c1 feat(proxy_server.py): expose new /team/list endpoint
Closes https://github.com/BerriAI/litellm/issues/3523
2024-05-09 13:21:00 -07:00
Krrish Dholakia
29ef2261f9 fix(auth_checks.py): fix 'get_end_user_object'
await cache get
2024-05-09 13:05:56 -07:00
Krish Dholakia
64ca2fde53 Merge branch 'main' into litellm_region_based_routing 2024-05-08 22:19:51 -07:00
Krrish Dholakia
0ea8222508 feat(router.py): enable filtering model group by 'allowed_model_region' 2024-05-08 22:10:17 -07:00
Krrish Dholakia
7c99808529 feat(proxy_server.py): add CRUD endpoints for 'end_user' management
allow admin to specify region + default models for end users
2024-05-08 18:50:36 -07:00
Ishaan Jaff
3e9bbc989f ui - new build 2024-05-08 18:45:54 -07:00
Ishaan Jaff
eeece6e90b Merge branch 'main' into ui_show_spend_end_user 2024-05-08 18:29:25 -07:00
Ishaan Jaff
41fd17dc3d feat - get price by end_user 2024-05-08 18:19:27 -07:00
Ishaan Jaff
0260d1a925 fix - startTime, endTime in GlobalEndUsersSpend 2024-05-08 17:05:09 -07:00
Ishaan Jaff
bfa693e829 fix /global/spend/end_users 2024-05-08 17:03:38 -07:00
Krrish Dholakia
5f93cae3ff feat(proxy_server.py): return litellm version in response headers 2024-05-08 16:00:08 -07:00
Krish Dholakia
269d6cb16f Merge pull request #3437 from msabramo/add-engines-model-chat-completions-endpoint
Add `/engines/{model}/chat/completions` endpoint
2024-05-08 14:30:39 -07:00
Krrish Dholakia
24ffb1e601 fix(main.py): fix together ai text completion call 2024-05-08 09:10:45 -07:00
Krish Dholakia
1eb75273cf Merge branch 'main' into litellm_ui_fixes_6 2024-05-07 22:01:04 -07:00
Krrish Dholakia
ae442f895b feat(ui/model_dashboard.tsx): show if model is config or db model 2024-05-07 21:34:18 -07:00
Krrish Dholakia
0f12dd660d feat(model_dashboard.tsx): allow adding output cost per token on ui 2024-05-07 21:09:49 -07:00
Krrish Dholakia
b9ec7acb08 feat(model_dashboard.tsx): allow user to edit input cost per token for model on ui
also contains fixes for `/model/update`
2024-05-07 20:57:21 -07:00
Krrish Dholakia
1882ee1c4c feat(ui/model_dashboard.tsx): show if model is config or db model 2024-05-07 18:29:14 -07:00
Krish Dholakia
8b87500620 Merge pull request #3506 from BerriAI/litellm_reintegrate_langfuse_url_slack_alert
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
2024-05-07 15:03:29 -07:00
Krrish Dholakia
17a7d4f602 fix(proxy_server.py): return budget duration in user response object 2024-05-07 13:47:32 -07:00
Krrish Dholakia
f503929918 fix(proxy_server.py): fix /v1/models bug where it would return empty list
handle 'all-team-models' being set for a given key
2024-05-07 13:43:15 -07:00
Jean-Luc Duckworth
26db16e61d Fixed typo. test_jwt.py tests pass 2024-05-07 16:08:36 -04:00
Krrish Dholakia
e5c57cdddb feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
this ensures langfuse trace url returned in llm api exception err
2024-05-07 12:58:49 -07:00
Jean-Luc Duckworth
a04281a478 Expanding jwt access to other RS and PS algos. Updated to resolve merge conflicts. 2024-05-07 15:45:07 -04:00
Sara Ghaemi
8706dd220f made audience optional and updated docs 2024-05-07 11:37:04 -04:00
Sara Ghaemi
91c3e9a0fc Merge branch 'BerriAI:main' into main 2024-05-07 11:23:00 -04:00
Sara Ghaemi
2022a7febb Updated JWT handler to support PEM public key 2024-05-07 11:22:17 -04:00
Ishaan Jaff
8608795fc5 fix don't let slack alert block /model/new 2024-05-06 20:47:29 -07:00
Ishaan Jaff
43cd3f05d2 fix - /model/new 2024-05-06 20:45:17 -07:00
Krish Dholakia
185581d5a9 Merge branch 'main' into litellm_slack_daily_reports 2024-05-06 19:31:20 -07:00
Krrish Dholakia
3cada96f22 refactor(proxy_server.py): show ttl's on a top-level enum
Addresses - https://github.com/BerriAI/litellm/issues/2649#issuecomment-2097203372
2024-05-06 18:43:42 -07:00
Krrish Dholakia
2170e1723d feat(proxy_server.py): schedule slack daily report if enabled
if user enabled daily_reports, send them a slack report every 12 hours
2024-05-06 18:25:48 -07:00
Ishaan Jaff
c80707cbcc ui - new build 2024-05-06 18:06:25 -07:00