Commit graph

65 commits

Author SHA1 Message Date
Ishaan Jaff
8fada93fff docs on using vertex tts 2024-08-23 17:57:49 -07:00
Ishaan Jaff
0ea1f367d7 use litellm proxy with vertex ai sdk 2024-08-21 17:47:01 -07:00
Ishaan Jaff
f947cec7fc add test vtx embedding 2024-08-21 17:05:47 -07:00
Ishaan Jaff
6cb3675a06 fix using prompt caching on proxy 2024-08-15 20:12:11 -07:00
Ishaan Jaff
a3dd3a19fa docs cachedContent endpoint 2024-08-08 16:06:23 -07:00
Ishaan Jaff
909977ace6 add example using litellm proxy with gemini context caching 2024-08-08 11:35:41 -07:00
Ishaan Jaff
68e94f0976 example mistral sdk 2024-07-25 19:48:54 -07:00
Ishaan Jaff
673105c88f update tests 2024-07-22 14:44:47 -07:00
Ishaan Jaff
40d9278dcb test - pass through langfuse requests 2024-06-28 17:28:21 -07:00
Ishaan Jaff
d7f1445615 doc - OTEL trace propogation 2024-06-11 14:25:33 -07:00
Ishaan Jaff
d829e6a297 test - propogate trace IDs across services 2024-06-11 14:00:25 -07:00
Ishaan Jaff
8bc02b34c2 test -base64 cache hits 2024-04-10 16:46:56 -07:00
Ishaan Jaff
28e4706bfd test - re-order embedding responses 2024-04-08 12:02:40 -07:00
Krrish Dholakia
49e8cdbff9 fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
2024-03-26 08:08:15 -07:00
Krrish Dholakia
eb3ca85d7e feat(router.py): enable pre-call checks
filter models outside of context window limits of a given message for a model group

 https://github.com/BerriAI/litellm/issues/872
2024-03-23 18:03:30 -07:00
Ishaan Jaff
aca6ec85e2 (fix) add some better load testing 2024-03-22 19:48:54 -07:00
Krrish Dholakia
8fefe625d9 fix(proxy/utils.py): batch writing updates to db 2024-03-18 16:47:02 -07:00
Krish Dholakia
c022568a3a
Merge branch 'main' into litellm_faster_api_key_checking 2024-03-09 18:45:03 -08:00
Krrish Dholakia
0cf056f493 fix(proxy_server.py): use argon2 for faster api key checking
0.04s latency boost on load test
2024-03-07 21:48:18 -08:00
ishaan-jaff
6f0faca85b (feat) print debug info per deployment 2024-03-07 18:33:09 -08:00
ishaan-jaff
735728c4dd (feat) add better debugging when DB read/write fails 2024-03-05 18:16:59 -08:00
ishaan-jaff
7aa1c8556d (test) large request 2024-02-12 21:49:47 -08:00
ishaan-jaff
09c36c6e78 (test) llama index VectorStoreIndex 2024-02-09 16:49:03 -08:00
ishaan-jaff
782e84a421 (test) Proxy llama index request 2024-02-09 16:09:03 -08:00
ishaan-jaff
bd37a9cb5e (fix) proxy - streaming sagemaker 2024-01-23 11:12:16 -08:00
ishaan-jaff
bc3332fc7c (docs) use proxy + Sagemaker stream test 2024-01-22 22:11:06 -08:00
ishaan-jaff
e6b5152e63 (chore) update load test 2024-01-19 08:52:17 -08:00
ishaan-jaff
0414e40d4a (docs) also test gpt-4 vision enhancements 2024-01-17 18:46:41 -08:00
ishaan-jaff
44a7e4157c (test) proxy - add openai exception mapping error 2024-01-15 09:56:20 -08:00
ishaan-jaff
746213f5d1 (test) proxy - add JS OpenAI test 2024-01-15 09:53:57 -08:00
ishaan-jaff
4680a26e2e (test) proxy - load test 2024-01-03 16:16:18 +05:30
ishaan-jaff
0acaaf8f8f (test) sustained load test proxy 2024-01-02 12:10:34 +05:30
ishaan-jaff
c8f8bd9e57 (test) proxy - log metadata to langfuse 2024-01-01 11:54:16 +05:30
ishaan-jaff
694956b44e (test) proxy - pass metadata to openai client 2024-01-01 11:12:57 +05:30
Krrish Dholakia
4905929de3 refactor: add black formatting 2023-12-25 14:11:20 +05:30
ishaan-jaff
f09ce1d316 (test) proxy - test exception mapping 2023-12-15 08:28:12 +05:30
Krrish Dholakia
397eefabe1 test: remove local test 2023-12-05 12:45:52 -08:00
ishaan-jaff
d5f67a0a25 (docs) proxy + langchain 2023-12-05 12:20:42 -08:00
ishaan-jaff
a0f8bf23ab (test) proxy: langchain compatible 2023-12-05 11:45:35 -08:00
Krrish Dholakia
6b1b1b82cf fix(proxy_server.py): hash keys 2023-12-02 19:30:03 -08:00
ishaan-jaff
2d55cc753e (feat) proxy: use orjson 2023-11-30 19:50:47 -08:00
ishaan-jaff
853af29a25 (test) load test embedding 2023-11-30 19:04:51 -08:00
ishaan-jaff
be8bdb580a (test) proxy + router: add bursty load test 2023-11-30 13:17:11 -08:00
ishaan-jaff
d3672452ce (test) 1k requests 2023-11-29 16:22:18 -08:00
ishaan-jaff
ee6f5a84db (test) load test completion 2023-11-28 15:44:56 -08:00
ishaan-jaff
40d9e8ab23 (test) load test 2023-11-27 18:08:47 -08:00
ishaan-jaff
8560794963 (test) load test router 2023-11-27 16:37:57 -08:00
ishaan-jaff
5e2c13fb11 (test) load test proxy completion 2023-11-27 12:13:21 -08:00
ishaan-jaff
dca0a5ad0f (test) load test embedding: proxy 2023-11-24 17:14:44 -08:00
ishaan-jaff
111c7afaca (docs) proxy performance 2023-11-24 17:07:46 -08:00