Commit graph

1130 commits

Author SHA1 Message Date
spdustin@gmail.com
767679ffcf Add/fix samples for Claude pre-fill and system messages 2024-01-05 23:04:17 +00:00
spdustin@gmail.com
5d074f5b56 Adds tests and updates docs for Claude "pre-fill" 2024-01-05 22:58:41 +00:00
Krrish Dholakia
b0827a87b2 fix(caching.py): support s-maxage param for cache controls 2024-01-04 11:41:23 +05:30
Krrish Dholakia
4946b1ef6d docs(docs/index.md): add proxy details to docs 2024-01-04 11:20:43 +05:30
Krrish Dholakia
0f7d03f761 fix(proxy/rules.md): add docs on setting post-call rules on the proxy 2024-01-04 11:16:50 +05:30
ishaan-jaff
b103ab1f0b (docs) proxy - caching 2024-01-03 18:02:14 +05:30
ishaan-jaff
30c6d164d2 (docs) use s3 Cache on litellm proxy 2024-01-03 17:56:44 +05:30
ishaan-jaff
f9139e05e8 (docs) proxy - cachig 2024-01-03 17:56:44 +05:30
ishaan-jaff
64a0c175d5 (docs) simplify caching docs 2024-01-03 16:21:23 +05:30
ishaan-jaff
8bfaee9654 (docs) simplify caching docs 2024-01-03 16:20:19 +05:30
ishaan-jaff
344e8d8508 (docs) s3 cache 2024-01-03 16:16:18 +05:30
ishaan-jaff
cc29860785 (docs) s3 Cache 2024-01-03 15:42:23 +05:30
Krrish Dholakia
6c8cc33d02 docs(caching.md): fix typo 2024-01-03 12:47:16 +05:30
Krrish Dholakia
8cee267a5b fix(caching.py): support ttl, s-max-age, and no-cache cache controls
https://github.com/BerriAI/litellm/issues/1306
2024-01-03 12:42:43 +05:30
Krrish Dholakia
5055aeb254 docs(alerting.md): add alerting thresholds to docs 2024-01-03 11:21:56 +05:30
Krrish Dholakia
ff4eb5a5d4 docs(alerting.md): add slack alerting to docs 2024-01-02 22:47:01 +05:30
Krrish Dholakia
d17ffdbc83 docs(ui.md): update ui docs to for smtp server hosting 2024-01-02 22:30:17 +05:30
ishaan-jaff
8186af64c7 (docs) xinference on proxy 2024-01-02 16:57:25 +05:30
ishaan-jaff
8f8ac03961 (docs) proxy - using xinference 2024-01-02 16:55:10 +05:30
ishaan-jaff
fdd4e72503 (docs) xinference embedding 2024-01-02 15:39:25 +05:30
ishaan-jaff
0d0ee9e108 (docs) passing user config 2024-01-02 14:43:02 +05:30
ishaan-jaff
1efd1cb30f (docs) passing user_config to completion 2024-01-02 14:19:44 +05:30
ishaan-jaff
60164cd5e4 (docs) pass user config to proxy / router 2024-01-02 14:14:14 +05:30
ishaan-jaff
11f92c0074 (docs) router- init params 2024-01-02 12:14:32 +05:30
Krrish Dholakia
95f850fecc docs(ui.md): add docs on self serve ui flow 2024-01-01 18:25:52 +05:30
Krrish Dholakia
61cd800b9f fix(ui/admin.py): handles trailing '/' case 2024-01-01 17:49:54 +05:30
ishaan-jaff
20236c1c69 (docs) proxy 2024-01-01 12:40:12 +05:30
ishaan-jaff
a98d752f7b (docs) use embeddings with proxy 2024-01-01 12:31:24 +05:30
ishaan-jaff
cf902a53b4 (docs) using /embeddings with Proxy 2024-01-01 12:31:13 +05:30
ishaan-jaff
21cccd02e8 (docs) langfuse + langchain log metadata 2024-01-01 11:53:38 +05:30
ishaan-jaff
a549962f14 (docs) request/response format 2024-01-01 11:52:47 +05:30
ishaan-jaff
76f9c8cc8f (docs) request/response format proxy 2024-01-01 11:52:30 +05:30
ishaan-jaff
8b29f9a48b (docs) pass metatadata to proxy + openai client 2024-01-01 11:12:31 +05:30
ishaan-jaff
5b9973136e (docs) langfuse + proxy - log metadata 2024-01-01 11:03:00 +05:30
Krrish Dholakia
d0d08b4dce docs(routing.md): adding latency-based routing to docs 2024-01-01 08:36:40 +05:30
ishaan-jaff
c269c65371 (docs) langfuse log trace id, trace user id 2023-12-30 20:25:23 +05:30
ishaan-jaff
bf4a9f40e8 (docs) cache context manager 2023-12-30 19:50:22 +05:30
ishaan-jaff
1c93642951 (docs) caching use context manager 2023-12-30 19:43:26 +05:30
ishaan-jaff
231148ed73 (docs) caching 2023-12-30 19:04:36 +05:30
ishaan-jaff
7ecd7b3e8d (docs) proxy - timeout per request 2023-12-30 11:18:03 +05:30
ishaan-jaff
6252987798 (docs) proxy - set timeout per request 2023-12-30 11:17:31 +05:30
Krrish Dholakia
1f76b0e721 docs(routing.md): add retry_after to docs 2023-12-29 15:22:12 +05:30
Krrish Dholakia
1e07f0fce8 fix(caching.py): hash the cache key to prevent key too long errors 2023-12-29 15:03:33 +05:30
Krrish Dholakia
6e68cd1125 docs(load_test.md): add litellm load test script to docs 2023-12-29 13:41:44 +05:30
ishaan-jaff
52a9696303 (docs) cloudflare 2023-12-29 12:10:32 +05:30
Krrish Dholakia
a351211d03 docs(users.md): add user rate limits to docs 2023-12-28 19:28:32 +05:30
ishaan-jaff
a1484171b5 (docs) voyage ai embeddings 2023-12-28 17:15:16 +05:30
ishaan-jaff
aa2bd93166 (docs) add voyage ai 2023-12-28 17:12:58 +05:30
ishaan-jaff
01f7e85057 (docs) add mistral embeddings 2023-12-28 16:54:26 +05:30
ishaan-jaff
7f74a0331c (docs) add mistral-embed 2023-12-28 16:50:52 +05:30