Commit graph

1984 commits

Author SHA1 Message Date
ishaan-jaff
829fd5868f (feat) proxy - remove streamlit ui on startup 2024-01-01 12:54:23 +05:30
ishaan-jaff
869bac5f9d (fix) proxy - remove extra print statemet 2024-01-01 10:52:09 +05:30
ishaan-jaff
f1c68e06fa (fix) proxy - remove errant print statement 2024-01-01 10:48:12 +05:30
Krrish Dholakia
e7fec954c6 fix(proxy_server.py): router model group alias routing
check model alias group routing before specific deployment routing, to deal with an alias being the same as a deployment name (e.g. gpt-3.5-turbo)

n
2023-12-30 17:55:24 +05:30
Krrish Dholakia
e1925d0e29 fix(router.py): support retry and fallbacks for atext_completion 2023-12-30 11:19:32 +05:30
Krrish Dholakia
8188475c16 feat(admin_ui.py): support creating keys on admin ui 2023-12-28 16:59:11 +05:30
Krrish Dholakia
1a7a7b0cc1 refactor(proxy_server.py): make print statements into print_verbose 2023-12-28 15:08:23 +05:30
Krrish Dholakia
f3846d4c30 refactor(proxy_server.py): move admin ui to be locally hosted within proxy 2023-12-28 15:06:32 +05:30
ishaan-jaff
cc1133d0cb (docs) proxy - swagger docs, add tags for model/info 2023-12-28 13:13:04 +05:30
Krrish Dholakia
b20bbaf96d fix(proxy_server.py): return [DONE] end of stream chunk 2023-12-28 11:30:49 +05:30
Krrish Dholakia
2285282ef8 feat(health_check.py): more detailed health check calls 2023-12-28 09:12:57 +05:30
ishaan-jaff
eba2969bff (fix) proxy - remove bloat - deprecated log_input_output 2023-12-27 17:36:53 +05:30
ishaan-jaff
636920c128 (feat) proxy remove appdirs as a dep 2023-12-27 17:33:47 +05:30
ishaan-jaff
2988cb0292 (feat) proxy - remove subprocess install 2023-12-27 17:28:03 +05:30
Krrish Dholakia
85549c3d66 fix(google_kms.py): support enums for key management system 2023-12-27 13:19:33 +05:30
Krrish Dholakia
6bd6d68990 feat(proxy_server.py): support maxage cache control 2023-12-26 17:50:27 +05:30
Krrish Dholakia
6f695838e5 feat(utils.py): support google kms for secret management
https://github.com/BerriAI/litellm/issues/1235
2023-12-26 15:39:40 +05:30
Krrish Dholakia
8f3732eeeb docs(user_keys.md): docs on passing user keys to litellm proxy 2023-12-26 13:55:28 +05:30
Krrish Dholakia
79978c44ba refactor: add black formatting 2023-12-25 14:11:20 +05:30
Krrish Dholakia
018405b956 fix(proxy/utils.py): return different exceptions if key is invalid vs. expired
https://github.com/BerriAI/litellm/issues/1230
2023-12-25 10:29:44 +05:30
Krrish Dholakia
6d73a77b01 fix(proxy_server.py): raise streaming exceptions 2023-12-25 07:18:09 +05:30
Krish Dholakia
7aab6061d7 Merge pull request #1211 from sihyeonn/fix/sh-success-callback
fix: success_callback logic for cost_tracking
2023-12-23 11:41:30 +05:30
Krrish Dholakia
a96bac14af fix(proxy_server.py): manage budget at user-level not key-level
https://github.com/BerriAI/litellm/issues/1220
2023-12-22 15:10:38 +05:30
Krrish Dholakia
61ab8dd5c1 fix(proxy_server.py): handle misformatted json body in chat completion request 2023-12-22 12:30:36 +05:30
Krrish Dholakia
aefaa157e5 feat(proxy_server.py): support max budget on proxy 2023-12-21 16:07:20 +05:30
Krrish Dholakia
158325497d feat(proxy_server.py): add new images/generation endpoint 2023-12-21 15:39:09 +05:30
Krrish Dholakia
fc935f0972 fix(proxy/utils.py): jsonify object before db writes 2023-12-21 13:03:14 +05:30
Sihyeon Jang
4cbcda607c fix: success_callback logic for cost_tracking
Signed-off-by: Sihyeon Jang <uneedsihyeon@gmail.com>
2023-12-21 16:09:59 +09:00
ishaan-jaff
36f42294c8 (feat) proxy key/generate pass metadata in requests 2023-12-20 13:42:49 +05:30
ishaan-jaff
c2c6b7cc31 (docs) swagger - add embedding tag 2023-12-20 09:04:56 +05:30
ishaan-jaff
935e83d445 (docs) swager - add embeddings tag 2023-12-20 06:29:36 +05:30
ishaan-jaff
84f77bfaf8 (docs) swagger docs add description 2023-12-20 06:27:26 +05:30
ishaan-jaff
343e192914 (fix) proxy: add link t swagger docs on startup 2023-12-20 06:02:05 +05:30
ishaan-jaff
8cdcdda4c1 (fix) proxy - health checks support cli model 2023-12-19 12:55:20 +05:30
Krrish Dholakia
c87cdbd156 build(Dockerfile): fixing build requirements 2023-12-16 17:52:30 -08:00
Krrish Dholakia
68c53256ab fix(proxy_server.py): setup depedencies on server startup 2023-12-16 11:56:11 -08:00
ishaan-jaff
975ab97c44 (feat) proxy - set cache configs on proxy 2023-12-16 14:44:39 +05:30
ishaan-jaff
d7630d5b0c (fix) proxy remove errant print 2023-12-16 12:26:02 +05:30
Krrish Dholakia
1da7d35218 feat(proxy_server.py): enable infinite retries on rate limited requests 2023-12-15 20:03:41 -08:00
ishaan-jaff
e3cb7ba6ba (feat) proxy: embeddings-add OpenAI exception mapping 2023-12-15 14:02:24 +05:30
Krrish Dholakia
8a6a6171e8 fix(proxy_server.py): allow post-call rules to be passed in as a file 2023-12-14 18:21:09 -08:00
Krrish Dholakia
c2b92fe8b8 fix(proxy_server.py): don't pass in user param if not sent 2023-12-14 14:17:33 -08:00
Krrish Dholakia
bb5b883316 fix(main.py): support async streaming for text completions endpoint 2023-12-14 13:56:32 -08:00
Krrish Dholakia
8ca8496140 fix(proxy_server.py): handle scenario where no json body is passed in - e.g. GET request
https://github.com/BerriAI/litellm/issues/1062
2023-12-14 09:57:35 -08:00
ishaan-jaff
467ea282f2 (feat) proxy add docstring for /test 2023-12-14 16:51:16 +05:30
ishaan-jaff
a8b4f9d631 (feat) proxy - model group alias 2023-12-14 13:24:10 +05:30
Krrish Dholakia
3fbeca134f fix(custom_logger.py): enable pre_call hooks to modify incoming data to proxy 2023-12-13 16:20:37 -08:00
Krrish Dholakia
2231601d5a fix(ollama.py): fix async completion calls for ollama 2023-12-13 13:10:25 -08:00
Krrish Dholakia
8bb826975d feat(proxy_server.py): add new /key/update endpoint 2023-12-12 17:18:51 -08:00
Krrish Dholakia
5b8907772d refactor(proxy_server.py): code cleanup 2023-12-12 15:49:41 -08:00