Commit graph

534 commits

Author SHA1 Message Date
Krrish Dholakia
2070a785a4 feat(utils.py): support google kms for secret management
https://github.com/BerriAI/litellm/issues/1235
2023-12-26 15:39:40 +05:30
Krrish Dholakia
871f207124 docs(user_keys.md): docs on passing user keys to litellm proxy 2023-12-26 13:55:28 +05:30
ishaan-jaff
d273d19bd9 (feat) proxy, use --model with --test 2023-12-26 09:40:58 +05:30
Krrish Dholakia
4905929de3 refactor: add black formatting 2023-12-25 14:11:20 +05:30
Krrish Dholakia
b3a98d8400 fix(proxy_server.py): accept keys with none duration 2023-12-25 13:46:24 +05:30
Krrish Dholakia
9f79f75635 fix(proxy/utils.py): return different exceptions if key is invalid vs. expired
https://github.com/BerriAI/litellm/issues/1230
2023-12-25 10:29:44 +05:30
Krrish Dholakia
402b2e5733 build(test_streaming.py): fix linting issues 2023-12-25 07:34:54 +05:30
Krrish Dholakia
343a06fd84 fix(proxy_server.py): raise streaming exceptions 2023-12-25 07:18:09 +05:30
Krrish Dholakia
b4b4a7d43a Revert "test(test_azure_perf.py): add perf testing for router streaming"
This reverts commit 4fa7f19888.
2023-12-23 13:19:07 +05:30
Krrish Dholakia
4fa7f19888 test(test_azure_perf.py): add perf testing for router streaming 2023-12-23 13:16:49 +05:30
Krish Dholakia
8afdc12918
Merge pull request #1211 from sihyeonn/fix/sh-success-callback
fix: success_callback logic for cost_tracking
2023-12-23 11:41:30 +05:30
Krrish Dholakia
89ee9fe400 fix(proxy_server.py): manage budget at user-level not key-level
https://github.com/BerriAI/litellm/issues/1220
2023-12-22 15:10:38 +05:30
Krrish Dholakia
979575a2a6 fix(proxy_server.py): handle misformatted json body in chat completion request 2023-12-22 12:30:36 +05:30
Krrish Dholakia
278f61f3ed fix(utils.py): handle 'os.environ/' being passed in as kwargs 2023-12-22 11:08:44 +05:30
Krrish Dholakia
1a32228da5 feat(proxy_server.py): support max budget on proxy 2023-12-21 16:07:20 +05:30
Krrish Dholakia
14115d0d60 feat(proxy_server.py): add new images/generation endpoint 2023-12-21 15:39:09 +05:30
Krrish Dholakia
81078c4004 fix(proxy/utils.py): jsonify object before db writes 2023-12-21 13:03:14 +05:30
Sihyeon Jang
4f41c3c513 fix: success_callback logic for cost_tracking
Signed-off-by: Sihyeon Jang <uneedsihyeon@gmail.com>
2023-12-21 16:09:59 +09:00
ishaan-jaff
683a1ee979 (feat) proxy key/generate pass metadata in requests 2023-12-20 13:42:49 +05:30
ishaan-jaff
7ad21de441 (feat) proxy /key/generate add metadata to _types 2023-12-20 13:42:49 +05:30
ishaan-jaff
c4b7ab6579 (feat) proxy - add metadata for keys 2023-12-20 13:42:49 +05:30
ishaan-jaff
229b56fc35 (docs) swagger - add embedding tag 2023-12-20 09:04:56 +05:30
ishaan-jaff
aa78415894 (docs) swager - add embeddings tag 2023-12-20 06:29:36 +05:30
ishaan-jaff
9548334e2f (docs) swagger docs add description 2023-12-20 06:27:26 +05:30
ishaan-jaff
8b26e64b5d (fix) proxy: add link t swagger docs on startup 2023-12-20 06:02:05 +05:30
ishaan-jaff
ce1b0b89ba (fix) proxy - health checks support cli model 2023-12-19 12:55:20 +05:30
Krrish Dholakia
071283c102 fix(router.py): init deployment_latency_map even if model_list is empty 2023-12-18 17:50:26 -08:00
Krrish Dholakia
3923c389fd build(Dockerfile): fixing build requirements 2023-12-16 17:52:30 -08:00
Krrish Dholakia
3291de9e11 fix(proxy_server.py): setup depedencies on server startup 2023-12-16 11:56:11 -08:00
ishaan-jaff
6b7d0eada4 (feat) proxy - set cache configs on proxy 2023-12-16 14:44:39 +05:30
ishaan-jaff
37bf94dcdf (fix) proxy remove errant print 2023-12-16 12:26:02 +05:30
Krrish Dholakia
4791dda66f feat(proxy_server.py): enable infinite retries on rate limited requests 2023-12-15 20:03:41 -08:00
ishaan-jaff
0b4f1dee42 (feat) proxy: embeddings-add OpenAI exception mapping 2023-12-15 14:02:24 +05:30
ishaan-jaff
f09ce1d316 (test) proxy - test exception mapping 2023-12-15 08:28:12 +05:30
Krrish Dholakia
adaf341f54 fix(proxy_server.py): allow post-call rules to be passed in as a file 2023-12-14 18:21:09 -08:00
ishaan-jaff
fff148a835 (test) router_get_deployments - stop using remove_model_id 2023-12-15 07:07:53 +05:30
Krrish Dholakia
2a4c1a1803 fix(proxy_server.py): don't pass in user param if not sent 2023-12-14 14:17:33 -08:00
Krrish Dholakia
1608dd7e0b fix(main.py): support async streaming for text completions endpoint 2023-12-14 13:56:32 -08:00
Krrish Dholakia
db8d793618 fix(proxy_server.py): handle scenario where no json body is passed in - e.g. GET request
https://github.com/BerriAI/litellm/issues/1062
2023-12-14 09:57:35 -08:00
ishaan-jaff
d1cf41888b (feat) proxy add docstring for /test 2023-12-14 16:51:16 +05:30
ishaan-jaff
80fc8050eb (feat) proxy - model group alias 2023-12-14 13:24:10 +05:30
ishaan-jaff
77bcaaae9e (fix) proxy cli --version 2023-12-14 13:22:39 +05:30
ishaan-jaff
241add8b33 (feat) proxy add --version 2023-12-14 12:28:42 +05:30
Krrish Dholakia
effdddc1c8 fix(custom_logger.py): enable pre_call hooks to modify incoming data to proxy 2023-12-13 16:20:37 -08:00
Krrish Dholakia
7b8851cce5 fix(ollama.py): fix async completion calls for ollama 2023-12-13 13:10:25 -08:00
Krrish Dholakia
693292a64c feat(proxy_server.py): add new /key/update endpoint 2023-12-12 17:18:51 -08:00
Krrish Dholakia
a266ad0319 refactor(proxy_server.py): code cleanup 2023-12-12 15:49:41 -08:00
Krrish Dholakia
8b07a6c046 fix(main.py): pass user_id + encoding_format for logging + to openai/azure 2023-12-12 15:46:44 -08:00
Krrish Dholakia
bdf29ca71f fix(sagemaker.py): debug streaming 2023-12-12 15:45:07 -08:00
ishaan-jaff
6d76878382 (fix) pydantic: Field "model_list" has conflict with protected namespace "model_". 2023-12-12 12:38:11 -08:00