Commit graph

5297 commits

Author SHA1 Message Date
Ishaan Jaff
7aa597afd1
Update new_release.yml 2024-01-04 13:41:44 +05:30
Ishaan Jaff
66607de725
(ci/cd) Create new_release.yml 2024-01-04 13:33:28 +05:30
ishaan-jaff
c231a6e4d3 (ci/cd) run proxy test with debug=True 2024-01-04 13:01:00 +05:30
ishaan-jaff
234c057e97 (fix) azure+cf gateway, health check 2024-01-04 12:34:07 +05:30
Krrish Dholakia
b0827a87b2 fix(caching.py): support s-maxage param for cache controls 2024-01-04 11:41:23 +05:30
Krrish Dholakia
4946b1ef6d docs(docs/index.md): add proxy details to docs 2024-01-04 11:20:43 +05:30
Krrish Dholakia
0f7d03f761 fix(proxy/rules.md): add docs on setting post-call rules on the proxy 2024-01-04 11:16:50 +05:30
ishaan-jaff
54653f9a4a (test) proxy + s3 caching 2024-01-04 11:11:08 +05:30
ishaan-jaff
aa757d19f5 (test) router - init clients - azure cloudflare, openai etc 2024-01-04 10:55:18 +05:30
ishaan-jaff
0864713b62 (test) cf azure 2024-01-04 10:26:41 +05:30
ishaan-jaff
8e10a1eb81 (docs) config with cloudflare exampel 2024-01-04 10:25:35 +05:30
ishaan-jaff
6d21ee3a2f (fix) proxy - cloudflare + Azure bug [non-streaming] 2024-01-04 10:24:51 +05:30
ishaan-jaff
b103ab1f0b (docs) proxy - caching 2024-01-03 18:02:14 +05:30
ishaan-jaff
30c6d164d2 (docs) use s3 Cache on litellm proxy 2024-01-03 17:56:44 +05:30
ishaan-jaff
f9139e05e8 (docs) proxy - cachig 2024-01-03 17:56:44 +05:30
Krrish Dholakia
469ae0a378 fix(proxy/utils.py): don't keep connecting to db if connection already established 2024-01-03 17:43:44 +05:30
Krrish Dholakia
f2da345173 fix(caching.py): handle cached_response being a dict not json string 2024-01-03 17:29:27 +05:30
Krrish Dholakia
f2210787cd feat(proxy_server.py): allow admins to update config via /config/update endpoint 2024-01-03 17:18:33 +05:30
ishaan-jaff
64a0c175d5 (docs) simplify caching docs 2024-01-03 16:21:23 +05:30
ishaan-jaff
8bfaee9654 (docs) simplify caching docs 2024-01-03 16:20:19 +05:30
ishaan-jaff
4680a26e2e (test) proxy - load test 2024-01-03 16:16:18 +05:30
ishaan-jaff
344e8d8508 (docs) s3 cache 2024-01-03 16:16:18 +05:30
Krrish Dholakia
d45101b652 fix(proxy_server.py): fix master key reset, to preserve key from env 2024-01-03 16:10:10 +05:30
Krrish Dholakia
40c974999e fix(proxy_server.py): reject bad /model/new POST requests 2024-01-03 15:54:58 +05:30
ishaan-jaff
cc29860785 (docs) s3 Cache 2024-01-03 15:42:23 +05:30
ishaan-jaff
d14a41863f (test) s3 cache with setting s3_bucket_name 2024-01-03 15:42:23 +05:30
ishaan-jaff
58ce5d44ae (feat) s3 cache support all boto3 params 2024-01-03 15:42:23 +05:30
Krrish Dholakia
b51371952b fix(proxy_server.py): handle base case for /model/info 2024-01-03 15:33:29 +05:30
Ishaan Jaff
92c4156ea1
Update README.md 2024-01-03 15:17:59 +05:30
Ishaan Jaff
2eeb20be46
Update README.md 2024-01-03 15:17:46 +05:30
Ishaan Jaff
c71fd26125
Update README.md 2024-01-03 15:17:24 +05:30
Ishaan Jaff
e91823218d
Update README.md 2024-01-03 15:16:50 +05:30
Ishaan Jaff
c27246e6f2
Update README.md 2024-01-03 15:15:24 +05:30
ishaan-jaff
fea0a933ae (test) use s3 buckets cache 2024-01-03 15:13:43 +05:30
ishaan-jaff
00364da993 (feat) add s3 Bucket as Cache 2024-01-03 15:13:43 +05:30
Krrish Dholakia
14e501845f fix(proxy_server.py): add support for setting master key via .env 2024-01-03 15:10:25 +05:30
Krrish Dholakia
ef8f1acfa4 refactor(proxy_server.py): more debug statements 2024-01-03 13:59:41 +05:30
Krrish Dholakia
6c8cc33d02 docs(caching.md): fix typo 2024-01-03 12:47:16 +05:30
Krrish Dholakia
8cee267a5b fix(caching.py): support ttl, s-max-age, and no-cache cache controls
https://github.com/BerriAI/litellm/issues/1306
2024-01-03 12:42:43 +05:30
ishaan-jaff
8772d87947 bump: version 1.16.12 → 1.16.13 2024-01-03 12:10:22 +05:30
ishaan-jaff
2bea0c742e (test) completion tokens counting + azure stream 2024-01-03 12:06:39 +05:30
ishaan-jaff
96cb6f3b10 (fix) azure+stream: count completion tokens 2024-01-03 12:06:39 +05:30
ishaan-jaff
f3b8d9c3ef (fix) counting response tokens+streaming 2024-01-03 12:06:39 +05:30
Krrish Dholakia
5055aeb254 docs(alerting.md): add alerting thresholds to docs 2024-01-03 11:21:56 +05:30
Krrish Dholakia
cd98d256b5 fix(proxy_server.py): add alerting for responses taking too long
https://github.com/BerriAI/litellm/issues/1298
2024-01-03 11:18:21 +05:30
Krrish Dholakia
0a6e4db999 bump: version 1.16.11 → 1.16.12 2024-01-03 10:12:48 +05:30
Krrish Dholakia
0d13c51615 fix(proxy/utils.py): fix self.alerting null case
https://github.com/BerriAI/litellm/issues/1298#issuecomment-1874798056
2024-01-03 10:12:21 +05:30
Krrish Dholakia
ff4eb5a5d4 docs(alerting.md): add slack alerting to docs 2024-01-02 22:47:01 +05:30
Krrish Dholakia
d17ffdbc83 docs(ui.md): update ui docs to for smtp server hosting 2024-01-02 22:30:17 +05:30
Krrish Dholakia
a778f8a00e bump: version 1.16.10 → 1.16.11 2024-01-02 22:26:47 +05:30