Ishaan Jaff
|
a844423e00
|
fix - use csv list for batch completions
|
2024-05-11 14:24:48 -07:00 |
|
Ishaan Jaff
|
b71f35de72
|
Merge pull request #3585 from BerriAI/litellm_router_batch_comp
[Litellm Proxy + litellm.Router] - Pass the same message/prompt to N models
|
2024-05-11 13:51:45 -07:00 |
|
Krrish Dholakia
|
926b86af87
|
feat(bedrock_httpx.py): moves to using httpx client for bedrock cohere calls
|
2024-05-11 13:43:08 -07:00 |
|
Ishaan Jaff
|
19111d2d63
|
edit dev config.yaml
|
2024-05-11 13:24:59 -07:00 |
|
Ishaan Jaff
|
c49033a687
|
support batch /chat/completions on proxy
|
2024-05-11 13:24:25 -07:00 |
|
Krish Dholakia
|
7f64c61275
|
Merge pull request #3582 from BerriAI/litellm_explicit_region_name_setting
feat(router.py): allow setting model_region in litellm_params
|
2024-05-11 11:36:22 -07:00 |
|
Krrish Dholakia
|
2ed155b4d4
|
feat(router.py): allow setting model_region in litellm_params
Closes https://github.com/BerriAI/litellm/issues/3580
|
2024-05-11 10:18:08 -07:00 |
|
Krish Dholakia
|
30117317a4
|
Merge branch 'main' into feat/add-azure-content-filter
|
2024-05-11 09:30:38 -07:00 |
|
Krrish Dholakia
|
bed16d5495
|
fix(vertex_ai.py): fix list tool call responses
Closes https://github.com/BerriAI/litellm/issues/3147
|
2024-05-10 20:05:58 -07:00 |
|
Ishaan Jaff
|
82344db621
|
fix triton params
|
2024-05-10 19:14:48 -07:00 |
|
Ishaan Jaff
|
d4eabefac1
|
Merge pull request #3573 from BerriAI/litellm_team_based_failure_callback
[Feat] Use Team based callbacks with litellm.failure_callbacks
|
2024-05-10 17:53:15 -07:00 |
|
Ishaan Jaff
|
3bee7279bf
|
fix langfuse team based logging tests
|
2024-05-10 16:39:49 -07:00 |
|
Ishaan Jaff
|
b0777de041
|
fix - using failure callbacks with team based logging
|
2024-05-10 16:18:13 -07:00 |
|
Ishaan Jaff
|
3b2e295337
|
fix - proxy_server.py
|
2024-05-10 13:47:35 -07:00 |
|
Krish Dholakia
|
460481a02d
|
Merge pull request #3378 from duckboy81/patch-1
Expand access for other jwt algorithms
|
2024-05-10 10:07:36 -07:00 |
|
Nick Wong
|
a1821428d1
|
added changes from upstream
Merge branch 'main' into fix/error-on-get-user-role
|
2024-05-09 16:14:14 -07:00 |
|
Nick Wong
|
ea0856408b
|
removed extra default dict return, which causes error if user_role is a string
|
2024-05-09 16:13:26 -07:00 |
|
Krrish Dholakia
|
8bb710b9c1
|
feat(proxy_server.py): expose new /team/list endpoint
Closes https://github.com/BerriAI/litellm/issues/3523
|
2024-05-09 13:21:00 -07:00 |
|
Krrish Dholakia
|
29ef2261f9
|
fix(auth_checks.py): fix 'get_end_user_object'
await cache get
|
2024-05-09 13:05:56 -07:00 |
|
Krish Dholakia
|
64ca2fde53
|
Merge branch 'main' into litellm_region_based_routing
|
2024-05-08 22:19:51 -07:00 |
|
Krrish Dholakia
|
0ea8222508
|
feat(router.py): enable filtering model group by 'allowed_model_region'
|
2024-05-08 22:10:17 -07:00 |
|
Krrish Dholakia
|
7c99808529
|
feat(proxy_server.py): add CRUD endpoints for 'end_user' management
allow admin to specify region + default models for end users
|
2024-05-08 18:50:36 -07:00 |
|
Ishaan Jaff
|
3e9bbc989f
|
ui - new build
|
2024-05-08 18:45:54 -07:00 |
|
Ishaan Jaff
|
eeece6e90b
|
Merge branch 'main' into ui_show_spend_end_user
|
2024-05-08 18:29:25 -07:00 |
|
Ishaan Jaff
|
41fd17dc3d
|
feat - get price by end_user
|
2024-05-08 18:19:27 -07:00 |
|
Ishaan Jaff
|
0260d1a925
|
fix - startTime, endTime in GlobalEndUsersSpend
|
2024-05-08 17:05:09 -07:00 |
|
Ishaan Jaff
|
bfa693e829
|
fix /global/spend/end_users
|
2024-05-08 17:03:38 -07:00 |
|
Krrish Dholakia
|
5f93cae3ff
|
feat(proxy_server.py): return litellm version in response headers
|
2024-05-08 16:00:08 -07:00 |
|
Krish Dholakia
|
269d6cb16f
|
Merge pull request #3437 from msabramo/add-engines-model-chat-completions-endpoint
Add `/engines/{model}/chat/completions` endpoint
|
2024-05-08 14:30:39 -07:00 |
|
Krrish Dholakia
|
24ffb1e601
|
fix(main.py): fix together ai text completion call
|
2024-05-08 09:10:45 -07:00 |
|
Krish Dholakia
|
1eb75273cf
|
Merge branch 'main' into litellm_ui_fixes_6
|
2024-05-07 22:01:04 -07:00 |
|
Krrish Dholakia
|
ae442f895b
|
feat(ui/model_dashboard.tsx): show if model is config or db model
|
2024-05-07 21:34:18 -07:00 |
|
Krrish Dholakia
|
0f12dd660d
|
feat(model_dashboard.tsx): allow adding output cost per token on ui
|
2024-05-07 21:09:49 -07:00 |
|
Krrish Dholakia
|
b9ec7acb08
|
feat(model_dashboard.tsx): allow user to edit input cost per token for model on ui
also contains fixes for `/model/update`
|
2024-05-07 20:57:21 -07:00 |
|
Krrish Dholakia
|
1882ee1c4c
|
feat(ui/model_dashboard.tsx): show if model is config or db model
|
2024-05-07 18:29:14 -07:00 |
|
Krish Dholakia
|
8b87500620
|
Merge pull request #3506 from BerriAI/litellm_reintegrate_langfuse_url_slack_alert
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
|
2024-05-07 15:03:29 -07:00 |
|
Krrish Dholakia
|
17a7d4f602
|
fix(proxy_server.py): return budget duration in user response object
|
2024-05-07 13:47:32 -07:00 |
|
Krrish Dholakia
|
f503929918
|
fix(proxy_server.py): fix /v1/models bug where it would return empty list
handle 'all-team-models' being set for a given key
|
2024-05-07 13:43:15 -07:00 |
|
Jean-Luc Duckworth
|
26db16e61d
|
Fixed typo. test_jwt.py tests pass
|
2024-05-07 16:08:36 -04:00 |
|
Krrish Dholakia
|
e5c57cdddb
|
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
this ensures langfuse trace url returned in llm api exception err
|
2024-05-07 12:58:49 -07:00 |
|
Jean-Luc Duckworth
|
a04281a478
|
Expanding jwt access to other RS and PS algos. Updated to resolve merge conflicts.
|
2024-05-07 15:45:07 -04:00 |
|
Sara Ghaemi
|
8706dd220f
|
made audience optional and updated docs
|
2024-05-07 11:37:04 -04:00 |
|
Sara Ghaemi
|
91c3e9a0fc
|
Merge branch 'BerriAI:main' into main
|
2024-05-07 11:23:00 -04:00 |
|
Sara Ghaemi
|
2022a7febb
|
Updated JWT handler to support PEM public key
|
2024-05-07 11:22:17 -04:00 |
|
Ishaan Jaff
|
8608795fc5
|
fix don't let slack alert block /model/new
|
2024-05-06 20:47:29 -07:00 |
|
Ishaan Jaff
|
43cd3f05d2
|
fix - /model/new
|
2024-05-06 20:45:17 -07:00 |
|
Krish Dholakia
|
185581d5a9
|
Merge branch 'main' into litellm_slack_daily_reports
|
2024-05-06 19:31:20 -07:00 |
|
Krrish Dholakia
|
3cada96f22
|
refactor(proxy_server.py): show ttl's on a top-level enum
Addresses - https://github.com/BerriAI/litellm/issues/2649#issuecomment-2097203372
|
2024-05-06 18:43:42 -07:00 |
|
Krrish Dholakia
|
2170e1723d
|
feat(proxy_server.py): schedule slack daily report if enabled
if user enabled daily_reports, send them a slack report every 12 hours
|
2024-05-06 18:25:48 -07:00 |
|
Ishaan Jaff
|
c80707cbcc
|
ui - new build
|
2024-05-06 18:06:25 -07:00 |
|