Ishaan Jaff
e5488a94dd
fix - add better debugging on num_callbacks test
2024-05-06 13:42:20 -07:00
Ishaan Jaff
5ffe9a8008
fix - select startTime and endTime on UI
2024-05-03 21:20:19 -07:00
Marc Abramowitz
15ba2977ef
Add route: "/engines/{model:path}/chat/completions"
...
Without this, it results in:
```pytb
Traceback (most recent call last):
File "/Users/abramowi/Code/OpenSource/litellm/litellm/proxy/proxy_server.py", line 3836, in completion
raise HTTPException(
fastapi.exceptions.HTTPException: 400: {'error': 'completion: Invalid model name passed in model=gpt-3.5-turbo/chat'}
```
2024-05-03 18:02:29 -07:00
Ishaan Jaff
ca054f95de
feat - filter exceptions by model group
2024-05-03 16:54:24 -07:00
Ishaan Jaff
a0136af9ec
Merge pull request #3427 from BerriAI/litellm_test_alert_size
...
[Test] - Ensure only 1 slack callback + Size of of all callbacks do not grow
2024-05-03 16:27:16 -07:00
Krish Dholakia
53993c612b
Merge pull request #3430 from BerriAI/litellm_return_api_base
...
feat(proxy_server.py): return api base in response headers
2024-05-03 16:25:21 -07:00
Krrish Dholakia
aca36d8b79
feat(proxy_server.py): return api base in response headers
...
Closes https://github.com/BerriAI/litellm/issues/2631
2024-05-03 15:27:32 -07:00
Ishaan Jaff
af9875f358
fix test slack alerting len
2024-05-03 14:58:11 -07:00
Ishaan Jaff
5b97d56e41
fix - return num callbacks in /active/callbacks
2024-05-03 14:24:01 -07:00
Ishaan Jaff
2c62353ce0
Merge pull request #3426 from BerriAI/litellm_set_db_exceptions_on_ui
...
UI - set DB Exceptions webhook_url on UI
2024-05-03 14:05:37 -07:00
Ishaan Jaff
b334b60c41
fix bug where slack would get inserting several times
2024-05-03 14:04:38 -07:00
Ishaan Jaff
051d4daa08
proxy - return num callbacks on /health/readiness
2024-05-03 09:14:32 -07:00
Marc Abramowitz
24ddba804f
Disambiguate invalid model name errors
...
because that error can be thrown in several different places, so
knowing the function it's being thrown from can be very useul for debugging.
2024-05-02 15:02:54 -07:00
Lunik
3ca174bc57
✨ feat: Add Azure Content-Safety Proxy hooks
...
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-02 23:21:08 +02:00
Krish Dholakia
1b5efc47e1
Merge pull request #3375 from msabramo/GH-3372
...
Fix route `/openai/deployments/{model}/chat/completions` not working properly
2024-05-02 13:00:25 -07:00
Krish Dholakia
dff4649509
Merge branch 'main' into litellm_openmeter_integration
2024-05-01 21:19:29 -07:00
Krrish Dholakia
0b1382aef4
build(ui): enable adding openmeter via proxy ui
2024-05-01 21:16:23 -07:00
Ishaan Jaff
5ce76a5b0d
feat - show slow count and total count
2024-05-01 17:18:14 -07:00
Ishaan Jaff
1721ab64aa
feat - return slow responses on admin UI
2024-05-01 17:16:33 -07:00
Ishaan Jaff
cb5073da62
fix - dont show cache hits on model latency tracker
2024-05-01 16:51:15 -07:00
Ishaan Jaff
5b70130049
fix latency tracking tool tip
2024-05-01 16:47:30 -07:00
Ishaan Jaff
cf2dd97f0b
clean up model latency metrics
2024-05-01 08:27:01 -07:00
Ishaan Jaff
1a2045d63a
ui - show tokens / sec
2024-04-30 22:44:28 -07:00
Ishaan Jaff
ee5ebd8f4f
fix - viewing model metrics
2024-04-30 18:26:14 -07:00
Ishaan Jaff
69d8f7b89b
fix - track litellm_model_name in LiteLLM_ErrorLogs
2024-04-30 17:31:40 -07:00
Ishaan Jaff
5756adc178
backend - show model latency per token
2024-04-30 17:23:36 -07:00
Ishaan Jaff
1c9aa7f0f1
fix passing starttime and endtime to model/exceptions
2024-04-30 16:53:53 -07:00
Ishaan Jaff
672d1c67fc
fix - backend return exceptions
2024-04-30 15:41:16 -07:00
Marc Abramowitz
eac4593ab9
Move chat_completions before completions
...
so that the `chat_completions` route is defined before the `completions` route.
This is necessary because the `chat_completions` route is more
specific than the `completions` route, and the order of route definitions
matters in FastAPI.
Without this, doing a request to
`/openai/deployments/{model_in_url}/chat/completions` might trigger
`completions` being called (with `model` set to `{model_in_url}/chat` instead of
`chat_completions` getting called, which is the correct function.
Fixes: GH-3372
2024-04-30 15:07:10 -07:00
Ishaan Jaff
1776f5aea4
stash /model/metrics/exceptions endpoints
2024-04-30 14:19:23 -07:00
Ishaan Jaff
ebe842d404
log startTime and EndTime for exceptions
2024-04-30 13:34:14 -07:00
Ishaan Jaff
ab2b78cd88
feat log request kwargs in error logs
2024-04-30 13:28:26 -07:00
Ishaan Jaff
918fff424f
fix log model_group
2024-04-30 13:11:09 -07:00
Ishaan Jaff
7f9b9d0173
fix - log api_base in errors
2024-04-30 13:02:42 -07:00
Ishaan Jaff
96cbdcb349
fix - working exception writing
2024-04-30 12:48:17 -07:00
Krrish Dholakia
a3fea74826
fix(proxy_server.py): return more detailed auth error message.
2024-04-29 07:24:19 -07:00
Krrish Dholakia
ae04330348
fix(proxy_server.py): fix setting offset-aware datetime
2024-04-25 21:18:32 -07:00
Krrish Dholakia
462724feef
fix(proxy/utils.py): log rejected proxy requests to langfuse
2024-04-25 19:26:27 -07:00
Ishaan Jaff
9f0284affa
fixes for testing alerting
2024-04-25 16:33:55 -07:00
Ishaan Jaff
a6132ab5d4
backend - update slack alert_to_webhook_url_map
2024-04-25 13:47:52 -07:00
Ishaan Jaff
2190b039fd
pass alert type on alerting handle
2024-04-25 13:05:34 -07:00
Krrish Dholakia
f260050e4d
fix(proxy_server.py): fix update router
2024-04-24 23:01:21 -07:00
Krrish Dholakia
9577548060
fix(proxy_server.py): fix updating non-router settings for proxy config
2024-04-24 22:50:04 -07:00
Krrish Dholakia
1988ce3247
feat(router.py): support mock testing fallbacks flag
...
allow user to test if fallbacks work as expected with a `mock_testing_fallbacks = True` flag set during a call
2024-04-24 20:13:10 -07:00
Krrish Dholakia
f4bd85a489
fix(proxy_server.py): fix /config/update
/
...
allows updating router config via UI and having the change be propogated across all proxy instances by persisting config changes to the db
2024-04-24 16:42:42 -07:00
Ishaan Jaff
f7053ac85f
Merge pull request #3277 from BerriAI/litellm_update_deployments
...
[UI] V0 - Edit Model tpm, rpm, api_base
2024-04-24 14:03:00 -07:00
Ishaan Jaff
c19f28b7d2
/model/update endpoint
2024-04-24 10:39:20 -07:00
Krrish Dholakia
840bb5a18d
fix(proxy_server.py): add new flag for disable sharing master key on ui
2024-04-24 10:06:01 -07:00
Ishaan Jaff
acf08e6d00
backend allow filtering by model_group
2024-04-23 22:03:20 -07:00
Ishaan Jaff
bd18a8909e
ui - filter by time and deployments
2024-04-23 20:42:15 -07:00