Commit graph

530 commits

Author SHA1 Message Date
Krrish Dholakia
b3a98d8400 fix(proxy_server.py): accept keys with none duration 2023-12-25 13:46:24 +05:30
Krrish Dholakia
9f79f75635 fix(proxy/utils.py): return different exceptions if key is invalid vs. expired
https://github.com/BerriAI/litellm/issues/1230
2023-12-25 10:29:44 +05:30
Krrish Dholakia
89ee9fe400 fix(proxy_server.py): manage budget at user-level not key-level
https://github.com/BerriAI/litellm/issues/1220
2023-12-22 15:10:38 +05:30
Krrish Dholakia
1a32228da5 feat(proxy_server.py): support max budget on proxy 2023-12-21 16:07:20 +05:30
Krrish Dholakia
81078c4004 fix(proxy/utils.py): jsonify object before db writes 2023-12-21 13:03:14 +05:30
Krrish Dholakia
3291de9e11 fix(proxy_server.py): setup depedencies on server startup 2023-12-16 11:56:11 -08:00
Krrish Dholakia
effdddc1c8 fix(custom_logger.py): enable pre_call hooks to modify incoming data to proxy 2023-12-13 16:20:37 -08:00
Krrish Dholakia
7b8851cce5 fix(ollama.py): fix async completion calls for ollama 2023-12-13 13:10:25 -08:00
Krrish Dholakia
3e7604420e fix(proxy/utils.py): fix db writes on retry 2023-12-11 21:14:12 -08:00
Krrish Dholakia
ea89a8a938 test(test_custom_callback_unit.py): adding unit tests for custom callbacks + fixing related bugs 2023-12-11 11:44:09 -08:00
Krrish Dholakia
6ef0e8485e fix(proxy_server.py): support for streaming 2023-12-09 16:23:04 -08:00
Krrish Dholakia
5fa2b6e5ad fix(proxy_server.py): enable pre+post-call hooks and max parallel request limits 2023-12-08 17:11:30 -08:00
Krrish Dholakia
bab958561e fix(proxy/utils.py): correctly raise exceptions on db call fail 2023-12-08 13:35:42 -08:00
Krrish Dholakia
9f0b83248a fix(proxy/utils.py): add backoff/retry logic to db read/writes 2023-12-08 13:34:31 -08:00
Krrish Dholakia
f06c3e0f08 fix(proxy_server.py): initializing sentry in proxy logging before db init 2023-12-08 11:58:16 -08:00
Krrish Dholakia
7aec95ed7c feat(proxy_server.py): add sentry logging for db read/writes 2023-12-08 11:40:19 -08:00
Krrish Dholakia
4e6a8d09d0 fix(proxy/utils.py): tracking db failed writes 2023-12-08 10:40:02 -08:00
Krrish Dholakia
f5afc429b3 fix(proxy_server.py): add call hooks pre+post completion and embedding calls 2023-12-07 20:35:32 -08:00
Krrish Dholakia
c7aaa4adf8 docs(deploy.md): add docker instructions to deploy docs 2023-12-07 09:22:54 -08:00
Krrish Dholakia
31f3187670 test: fix linting errors 2023-12-04 18:43:01 -08:00
Krrish Dholakia
030bd22078 feat(proxy_server.py): allow user to override api key auth 2023-12-04 18:32:58 -08:00
Krrish Dholakia
f20bdc9b79 test: fix linting errors 2023-12-04 12:45:15 -08:00
Krrish Dholakia
63e55f1865 fix(proxy_server.py): fix /key/generate post endpoint 2023-12-04 10:44:13 -08:00
Krrish Dholakia
492c9043f6 fix: fix linting issues 2023-12-02 19:30:03 -08:00
Krrish Dholakia
6b1b1b82cf fix(proxy_server.py): hash keys 2023-12-02 19:30:03 -08:00
Krrish Dholakia
4f22e7de18 feat(proxy_server.py): tracking spend per api key 2023-11-24 15:14:06 -08:00
ishaan-jaff
0a23af7990 (feat) proxy: cost tracking add track_cost_callback 2023-11-24 05:53:23 -08:00
Krrish Dholakia
c273d6f0d6 fix(router.py): add support for context window fallbacks on router 2023-11-23 16:43:02 -08:00
ishaan-jaff
9b53ea4b0f (feat) proxy: cost tracking per completion request 2023-11-23 16:08:59 -08:00
Krrish Dholakia
e7735274de fix(factory.py): fixing llama-2 non-chat models prompt templating 2023-11-07 21:33:54 -08:00