Krrish Dholakia
db666b01e5
feat(proxy_server.py): add CRUD endpoints for 'end_user' management
...
allow admin to specify region + default models for end users
2024-05-08 18:50:36 -07:00
Ishaan Jaff
dea4a081c7
ui - new build
2024-05-08 18:45:54 -07:00
Ishaan Jaff
6d955ef457
Merge branch 'main' into ui_show_spend_end_user
2024-05-08 18:29:25 -07:00
Ishaan Jaff
a38d9e35fd
feat - get price by end_user
2024-05-08 18:19:27 -07:00
Ishaan Jaff
b99a6717af
fix - startTime, endTime in GlobalEndUsersSpend
2024-05-08 17:05:09 -07:00
Ishaan Jaff
1eea4d1c90
fix /global/spend/end_users
2024-05-08 17:03:38 -07:00
Krrish Dholakia
6575143460
feat(proxy_server.py): return litellm version in response headers
2024-05-08 16:00:08 -07:00
Krish Dholakia
91bb7cd261
Merge pull request #3437 from msabramo/add-engines-model-chat-completions-endpoint
...
Add `/engines/{model}/chat/completions` endpoint
2024-05-08 14:30:39 -07:00
Krrish Dholakia
a854824c02
fix(main.py): fix together ai text completion call
2024-05-08 09:10:45 -07:00
Krish Dholakia
0e709fdc21
Merge branch 'main' into litellm_ui_fixes_6
2024-05-07 22:01:04 -07:00
Krrish Dholakia
fbcda918de
feat(ui/model_dashboard.tsx): show if model is config or db model
2024-05-07 21:34:18 -07:00
Krrish Dholakia
dc8239ecc6
feat(model_dashboard.tsx): allow adding output cost per token on ui
2024-05-07 21:09:49 -07:00
Krrish Dholakia
5a16bec6a1
feat(model_dashboard.tsx): allow user to edit input cost per token for model on ui
...
also contains fixes for `/model/update`
2024-05-07 20:57:21 -07:00
Krrish Dholakia
312249ca44
feat(ui/model_dashboard.tsx): show if model is config or db model
2024-05-07 18:29:14 -07:00
Krish Dholakia
2aaaa5e1b4
Merge pull request #3506 from BerriAI/litellm_reintegrate_langfuse_url_slack_alert
...
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
2024-05-07 15:03:29 -07:00
Krrish Dholakia
f210318bf1
fix(proxy_server.py): return budget duration in user response object
2024-05-07 13:47:32 -07:00
Krrish Dholakia
f2766fddbf
fix(proxy_server.py): fix /v1/models
bug where it would return empty list
...
handle 'all-team-models' being set for a given key
2024-05-07 13:43:15 -07:00
Jean-Luc Duckworth
d60aa8282e
Fixed typo. test_jwt.py tests pass
2024-05-07 16:08:36 -04:00
Krrish Dholakia
872470ff1f
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
...
this ensures langfuse trace url returned in llm api exception err
2024-05-07 12:58:49 -07:00
Jean-Luc Duckworth
d5767e9403
Expanding jwt access to other RS and PS algos. Updated to resolve merge conflicts.
2024-05-07 15:45:07 -04:00
Sara Ghaemi
66b2b5fab9
made audience optional and updated docs
2024-05-07 11:37:04 -04:00
Sara Ghaemi
0ca5af8ae8
Merge branch 'BerriAI:main' into main
2024-05-07 11:23:00 -04:00
Sara Ghaemi
1580f993fe
Updated JWT handler to support PEM public key
2024-05-07 11:22:17 -04:00
Ishaan Jaff
bfef424b39
fix don't let slack alert block /model/new
2024-05-06 20:47:29 -07:00
Ishaan Jaff
eb84c69ec6
fix - /model/new
2024-05-06 20:45:17 -07:00
Krish Dholakia
aa62d891a0
Merge branch 'main' into litellm_slack_daily_reports
2024-05-06 19:31:20 -07:00
Krrish Dholakia
26c0ed0f2d
refactor(proxy_server.py): show ttl's on a top-level enum
...
Addresses - https://github.com/BerriAI/litellm/issues/2649#issuecomment-2097203372
2024-05-06 18:43:42 -07:00
Krrish Dholakia
6b9b4f05ba
feat(proxy_server.py): schedule slack daily report if enabled
...
if user enabled daily_reports, send them a slack report every 12 hours
2024-05-06 18:25:48 -07:00
Ishaan Jaff
791ec2e26a
ui - new build
2024-05-06 18:06:25 -07:00
Krrish Dholakia
718f423d7d
feat(slack_alerting.py): support sending daily reports on deployments
...
allow admin to easily know slow + failing deployments
Closes https://github.com/BerriAI/litellm/issues/3483
2024-05-06 17:18:42 -07:00
Ishaan Jaff
c600371e6e
feat - send alert on adding new model
2024-05-06 15:45:07 -07:00
Ishaan Jaff
562ef2d2e1
fix - add better debugging on num_callbacks test
2024-05-06 13:42:20 -07:00
Ishaan Jaff
59dac1bc7a
ui - new build
2024-05-04 10:55:39 -07:00
Lunik
1639a51f24
🔊 fix: Correctly use verbose logging
...
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-04 11:04:23 +02:00
Lunik
8783fd4895
✨ feat: Use 8 severity levels for azure content safety
...
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-04 10:45:39 +02:00
Lunik
cb178723ca
📝 doc: Azure content safety Proxy usage
...
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-04 10:39:43 +02:00
Ishaan Jaff
fccdb92c6b
fix - select startTime and endTime on UI
2024-05-03 21:20:19 -07:00
Ishaan Jaff
01a11ccced
ui - new build
2024-05-03 20:40:58 -07:00
Marc Abramowitz
eb433bde86
Add route: "/engines/{model:path}/chat/completions"
...
Without this, it results in:
```pytb
Traceback (most recent call last):
File "/Users/abramowi/Code/OpenSource/litellm/litellm/proxy/proxy_server.py", line 3836, in completion
raise HTTPException(
fastapi.exceptions.HTTPException: 400: {'error': 'completion: Invalid model name passed in model=gpt-3.5-turbo/chat'}
```
2024-05-03 18:02:29 -07:00
Ishaan Jaff
e7034ea53d
feat - filter exceptions by model group
2024-05-03 16:54:24 -07:00
Ishaan Jaff
3dd1e8dfe7
Merge pull request #3427 from BerriAI/litellm_test_alert_size
...
[Test] - Ensure only 1 slack callback + Size of of all callbacks do not grow
2024-05-03 16:27:16 -07:00
Krish Dholakia
1b35a75245
Merge pull request #3430 from BerriAI/litellm_return_api_base
...
feat(proxy_server.py): return api base in response headers
2024-05-03 16:25:21 -07:00
Krrish Dholakia
5b39f8e282
feat(proxy_server.py): return api base in response headers
...
Closes https://github.com/BerriAI/litellm/issues/2631
2024-05-03 15:27:32 -07:00
Ishaan Jaff
ab27866b6a
fix test slack alerting len
2024-05-03 14:58:11 -07:00
Ishaan Jaff
3997ea6442
fix - return num callbacks in /active/callbacks
2024-05-03 14:24:01 -07:00
Ishaan Jaff
e99edaf4e1
Merge pull request #3426 from BerriAI/litellm_set_db_exceptions_on_ui
...
UI - set DB Exceptions webhook_url on UI
2024-05-03 14:05:37 -07:00
Ishaan Jaff
776f541f6c
fix bug where slack would get inserting several times
2024-05-03 14:04:38 -07:00
Lunik
9ba9b3891f
⚡ ️ perf: Remove test violation on each stream chunk
...
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-03 20:51:40 +02:00
Lunik
e7405f105c
✅ ci: Add tests
...
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-03 20:50:37 +02:00
Ishaan Jaff
23d334fe60
proxy - return num callbacks on /health/readiness
2024-05-03 09:14:32 -07:00