Commit graph

57 commits

Author SHA1 Message Date
Ishaan Jaff
a7cb2ed21f test - pass through langfuse requests 2024-06-28 17:28:21 -07:00
Ishaan Jaff
8d3bda27eb doc - OTEL trace propogation 2024-06-11 14:25:33 -07:00
Ishaan Jaff
8ce9c3a2b6 test - propogate trace IDs across services 2024-06-11 14:00:25 -07:00
Ishaan Jaff
4058327b43 test -base64 cache hits 2024-04-10 16:46:56 -07:00
Ishaan Jaff
95debe0e6a test - re-order embedding responses 2024-04-08 12:02:40 -07:00
Krrish Dholakia
00d27a324d fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
2024-03-26 08:08:15 -07:00
Krrish Dholakia
4e70a3e09a feat(router.py): enable pre-call checks
filter models outside of context window limits of a given message for a model group

 https://github.com/BerriAI/litellm/issues/872
2024-03-23 18:03:30 -07:00
Ishaan Jaff
c6ffd456ff (fix) add some better load testing 2024-03-22 19:48:54 -07:00
Krrish Dholakia
1b10123f07 fix(proxy/utils.py): batch writing updates to db 2024-03-18 16:47:02 -07:00
Krish Dholakia
ba9d96e65e Merge branch 'main' into litellm_faster_api_key_checking 2024-03-09 18:45:03 -08:00
Krrish Dholakia
b81c94e94f fix(proxy_server.py): use argon2 for faster api key checking
0.04s latency boost on load test
2024-03-07 21:48:18 -08:00
ishaan-jaff
db002315e3 (feat) print debug info per deployment 2024-03-07 18:33:09 -08:00
ishaan-jaff
79f5e36540 (feat) add better debugging when DB read/write fails 2024-03-05 18:16:59 -08:00
ishaan-jaff
c40d31558d (test) large request 2024-02-12 21:49:47 -08:00
ishaan-jaff
3c06d2e34b (test) llama index VectorStoreIndex 2024-02-09 16:49:03 -08:00
ishaan-jaff
7a194de52b (test) Proxy llama index request 2024-02-09 16:09:03 -08:00
ishaan-jaff
1f229a46ad (fix) proxy - streaming sagemaker 2024-01-23 11:12:16 -08:00
ishaan-jaff
4353c9422f (docs) use proxy + Sagemaker stream test 2024-01-22 22:11:06 -08:00
ishaan-jaff
54c84e15e6 (chore) update load test 2024-01-19 08:52:17 -08:00
ishaan-jaff
9cf86a04a5 (docs) also test gpt-4 vision enhancements 2024-01-17 18:46:41 -08:00
ishaan-jaff
936df75a1a (test) proxy - add openai exception mapping error 2024-01-15 09:56:20 -08:00
ishaan-jaff
b4218439a5 (test) proxy - add JS OpenAI test 2024-01-15 09:53:57 -08:00
ishaan-jaff
f569b84ed2 (test) proxy - load test 2024-01-03 16:16:18 +05:30
ishaan-jaff
4f1df08f54 (test) sustained load test proxy 2024-01-02 12:10:34 +05:30
ishaan-jaff
99156edf82 (test) proxy - log metadata to langfuse 2024-01-01 11:54:16 +05:30
ishaan-jaff
bf0329e313 (test) proxy - pass metadata to openai client 2024-01-01 11:12:57 +05:30
Krrish Dholakia
79978c44ba refactor: add black formatting 2023-12-25 14:11:20 +05:30
ishaan-jaff
7a1e10c371 (test) proxy - test exception mapping 2023-12-15 08:28:12 +05:30
Krrish Dholakia
c2acdfce09 test: remove local test 2023-12-05 12:45:52 -08:00
ishaan-jaff
7ba22141ed (docs) proxy + langchain 2023-12-05 12:20:42 -08:00
ishaan-jaff
67272e1d8e (test) proxy: langchain compatible 2023-12-05 11:45:35 -08:00
Krrish Dholakia
6015bff80b fix(proxy_server.py): hash keys 2023-12-02 19:30:03 -08:00
ishaan-jaff
c9e21d97cd (feat) proxy: use orjson 2023-11-30 19:50:47 -08:00
ishaan-jaff
ae6e852219 (test) load test embedding 2023-11-30 19:04:51 -08:00
ishaan-jaff
b6ffcd00b9 (test) proxy + router: add bursty load test 2023-11-30 13:17:11 -08:00
ishaan-jaff
700fee3eba (test) 1k requests 2023-11-29 16:22:18 -08:00
ishaan-jaff
bcc58e16be (test) load test completion 2023-11-28 15:44:56 -08:00
ishaan-jaff
547edd24e6 (test) load test 2023-11-27 18:08:47 -08:00
ishaan-jaff
aef3d2699f (test) load test router 2023-11-27 16:37:57 -08:00
ishaan-jaff
367468d655 (test) load test proxy completion 2023-11-27 12:13:21 -08:00
ishaan-jaff
c0dfc8d9b3 (test) load test embedding: proxy 2023-11-24 17:14:44 -08:00
ishaan-jaff
32b7c236e6 (docs) proxy performance 2023-11-24 17:07:46 -08:00
Krrish Dholakia
9618718080 test: refactor testing order 2023-11-24 12:47:28 -08:00
Krrish Dholakia
f630901a77 test(test_router_cooldowns.py): adding logging 2023-11-24 12:30:08 -08:00
Krrish Dholakia
2df4791ae9 fix: fix linting errors 2023-11-22 19:59:25 -08:00
ishaan-jaff
52e2ac0106 (test) proxy test exception mapping 2023-11-22 16:22:05 -08:00
Krrish Dholakia
2a681e578c fix(proxy_server): fix linting issues 2023-11-22 08:47:59 -08:00
ishaan-jaff
12c2d1411a (test) load test with api.litellm.ai 2023-11-21 21:07:27 -08:00
ishaan-jaff
bd5c89aab9 (test) load test proxy 2023-11-21 21:04:46 -08:00
ishaan-jaff
67b7aba40f (test) load test q 2023-11-21 20:48:56 -08:00