Commit graph

3582 commits

Author SHA1 Message Date
ishaan-jaff
7179d0cb48 (fix) s3 logger use r2 links 2024-01-25 18:36:20 -08:00
Krrish Dholakia
e948b39e3a test(test_streaming.py): fix test to handle none chunk 2024-01-25 18:34:13 -08:00
Krrish Dholakia
39aec43b86 test(main.py): adding more logging 2024-01-25 18:15:24 -08:00
Krrish Dholakia
ac131b3824 fix(utils.py): completion_cost support for image gen models 2024-01-25 18:08:18 -08:00
Ishaan Jaff
2b1177eaa9
Merge pull request #1625 from BerriAI/litellm_improve_admin_ui
[UI] Admin UI improvements
2024-01-25 16:48:02 -08:00
Krrish Dholakia
1ae22ea16d refactor: trigger new bump 2024-01-25 16:40:23 -08:00
Krrish Dholakia
014f83c847 fix(main.py): allow vertex ai project and location to be set in completion() call 2024-01-25 16:40:23 -08:00
ishaan-jaff
e00f46a6e9 (test) gpt-4-0125-preview 2024-01-25 16:40:23 -08:00
ishaan-jaff
c10bda3d30 (chore) cleanup testing file 2024-01-25 16:40:23 -08:00
ishaan-jaff
7e1b9158fe (test) embedding models 2024-01-25 16:40:23 -08:00
ishaan-jaff
5264a3eb53 (fix) do nothing if alerting is not switched on 2024-01-25 16:40:23 -08:00
ishaan-jaff
1c9b02ad99 (fix) alerting debug statements 2024-01-25 16:40:23 -08:00
ishaan-jaff
b8f917624f (docs) config.yaml 2024-01-25 16:40:23 -08:00
ishaan-jaff
3a1c8f453f (docs) track max_budget on proxy config.yaml 2024-01-25 16:40:23 -08:00
ishaan-jaff
229e4920df (fix) better alert message on budgets 2024-01-25 16:40:23 -08:00
ishaan-jaff
81c528f6ce (fix) raise correct error when proxy crossed budget 2024-01-25 16:40:23 -08:00
ishaan-jaff
56f49a87ba (fix) raise exception budget_duration is set and max_budget is Not 2024-01-25 16:40:23 -08:00
ishaan-jaff
7a2a7e047f (feat) slack alerting budgets 2024-01-25 16:40:23 -08:00
ishaan-jaff
55115a75b0 (feat) alerts proxy budgets 2024-01-25 16:40:23 -08:00
ishaan-jaff
565531fe9e v0 basic structure 2024-01-25 16:40:23 -08:00
ishaan-jaff
e56721d6c3 (feat) view spend/logs by user_id, view spend/user by user 2024-01-25 16:12:28 -08:00
Krrish Dholakia
76468cc351 refactor: trigger new bump 2024-01-25 16:06:01 -08:00
Krrish Dholakia
72275ad8cb fix(main.py): fix logging event loop for async logging but sync streaming 2024-01-25 15:59:53 -08:00
Krrish Dholakia
fbab810189 fix(main.py): allow vertex ai project and location to be set in completion() call 2024-01-25 15:00:51 -08:00
Krrish Dholakia
bbe6a92eb9 fix(main.py): fix order of assembly for streaming chunks 2024-01-25 14:51:08 -08:00
ishaan-jaff
e2d9e40886 (test) gpt-4-0125-preview 2024-01-25 14:42:10 -08:00
ishaan-jaff
72790f44da (chore) cleanup testing file 2024-01-25 14:36:11 -08:00
ishaan-jaff
6cbde02cab (test) embedding models 2024-01-25 14:30:49 -08:00
Krrish Dholakia
09ec6d6458 fix(utils.py): fix sagemaker async logging for sync streaming
https://github.com/BerriAI/litellm/issues/1592
2024-01-25 12:49:45 -08:00
Ishaan Jaff
06a5dbfb5e
Merge pull request #1615 from BerriAI/litellm_alerts_budget_tracking
[Feat] Alerts for Proxy Budgets
2024-01-25 12:00:18 -08:00
ishaan-jaff
ca12e70369 (fix) do nothing if alerting is not switched on 2024-01-25 11:58:55 -08:00
ishaan-jaff
e80d32dcdd (fix) alerting debug statements 2024-01-25 11:56:52 -08:00
ishaan-jaff
6dc9be4d43 (docs) config.yaml 2024-01-25 11:41:35 -08:00
ishaan-jaff
6fb3f8f239 (docs) track max_budget on proxy config.yaml 2024-01-25 11:40:56 -08:00
ishaan-jaff
b3f91844cb (fix) better alert message on budgets 2024-01-25 11:40:20 -08:00
ishaan-jaff
450b0a0ad1 (fix) raise correct error when proxy crossed budget 2024-01-25 11:39:57 -08:00
ishaan-jaff
126b87e3fa (fix) raise exception budget_duration is set and max_budget is Not 2024-01-25 11:32:05 -08:00
ishaan-jaff
3ef2afb0e4 (feat) slack alerting budgets 2024-01-25 11:18:06 -08:00
ishaan-jaff
1ab713c76c (feat) alerts proxy budgets 2024-01-25 10:01:32 -08:00
ishaan-jaff
d328a4bad0 v0 basic structure 2024-01-25 09:58:43 -08:00
Krrish Dholakia
39d5407e67 fix(proxy_server.py): don't set tpm/rpm limits unless set
https://github.com/BerriAI/litellm/issues/1594
2024-01-25 09:53:10 -08:00
Krrish Dholakia
7e06944d80 test(test_dynamodb_logs.py): skip flaky test 2024-01-24 23:18:45 -08:00
Krrish Dholakia
81846ffdec fix(proxy/utils.py): handle item not existing during batch updates 2024-01-24 21:49:47 -08:00
Krrish Dholakia
0752048b81 fix(dynamo_db.py): fix update bug 2024-01-24 21:29:56 -08:00
Krrish Dholakia
8e1157fc92 test(test_keys.py): reset proxy spend 2024-01-24 21:08:09 -08:00
Krrish Dholakia
34c4532e7e fix(proxy_server.py): fix handling none value for existing spend object pt.2 2024-01-24 20:39:00 -08:00
Krrish Dholakia
12b2ad9ed1 fix(proxy_server.py): fix handling none value for existing spend object 2024-01-24 20:37:56 -08:00
Krish Dholakia
6501fdb76e
Merge branch 'main' into litellm_global_spend_updates 2024-01-24 20:20:15 -08:00
Krrish Dholakia
43f139fafd fix(ollama_chat.py): fix default token counting for ollama chat 2024-01-24 20:09:17 -08:00
ishaan-jaff
3e59a02dfb (test) test /key/gen with max_budget=None 2024-01-24 19:27:53 -08:00