Ishaan Jaff
|
a7cb2ed21f
|
test - pass through langfuse requests
|
2024-06-28 17:28:21 -07:00 |
|
Ishaan Jaff
|
8d3bda27eb
|
doc - OTEL trace propogation
|
2024-06-11 14:25:33 -07:00 |
|
Ishaan Jaff
|
8ce9c3a2b6
|
test - propogate trace IDs across services
|
2024-06-11 14:00:25 -07:00 |
|
Ishaan Jaff
|
4058327b43
|
test -base64 cache hits
|
2024-04-10 16:46:56 -07:00 |
|
Ishaan Jaff
|
95debe0e6a
|
test - re-order embedding responses
|
2024-04-08 12:02:40 -07:00 |
|
Krrish Dholakia
|
00d27a324d
|
fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
|
2024-03-26 08:08:15 -07:00 |
|
Krrish Dholakia
|
4e70a3e09a
|
feat(router.py): enable pre-call checks
filter models outside of context window limits of a given message for a model group
https://github.com/BerriAI/litellm/issues/872
|
2024-03-23 18:03:30 -07:00 |
|
Ishaan Jaff
|
c6ffd456ff
|
(fix) add some better load testing
|
2024-03-22 19:48:54 -07:00 |
|
Krrish Dholakia
|
1b10123f07
|
fix(proxy/utils.py): batch writing updates to db
|
2024-03-18 16:47:02 -07:00 |
|
Krish Dholakia
|
ba9d96e65e
|
Merge branch 'main' into litellm_faster_api_key_checking
|
2024-03-09 18:45:03 -08:00 |
|
Krrish Dholakia
|
b81c94e94f
|
fix(proxy_server.py): use argon2 for faster api key checking
0.04s latency boost on load test
|
2024-03-07 21:48:18 -08:00 |
|
ishaan-jaff
|
db002315e3
|
(feat) print debug info per deployment
|
2024-03-07 18:33:09 -08:00 |
|
ishaan-jaff
|
79f5e36540
|
(feat) add better debugging when DB read/write fails
|
2024-03-05 18:16:59 -08:00 |
|
ishaan-jaff
|
c40d31558d
|
(test) large request
|
2024-02-12 21:49:47 -08:00 |
|
ishaan-jaff
|
3c06d2e34b
|
(test) llama index VectorStoreIndex
|
2024-02-09 16:49:03 -08:00 |
|
ishaan-jaff
|
7a194de52b
|
(test) Proxy llama index request
|
2024-02-09 16:09:03 -08:00 |
|
ishaan-jaff
|
1f229a46ad
|
(fix) proxy - streaming sagemaker
|
2024-01-23 11:12:16 -08:00 |
|
ishaan-jaff
|
4353c9422f
|
(docs) use proxy + Sagemaker stream test
|
2024-01-22 22:11:06 -08:00 |
|
ishaan-jaff
|
54c84e15e6
|
(chore) update load test
|
2024-01-19 08:52:17 -08:00 |
|
ishaan-jaff
|
9cf86a04a5
|
(docs) also test gpt-4 vision enhancements
|
2024-01-17 18:46:41 -08:00 |
|
ishaan-jaff
|
936df75a1a
|
(test) proxy - add openai exception mapping error
|
2024-01-15 09:56:20 -08:00 |
|
ishaan-jaff
|
b4218439a5
|
(test) proxy - add JS OpenAI test
|
2024-01-15 09:53:57 -08:00 |
|
ishaan-jaff
|
f569b84ed2
|
(test) proxy - load test
|
2024-01-03 16:16:18 +05:30 |
|
ishaan-jaff
|
4f1df08f54
|
(test) sustained load test proxy
|
2024-01-02 12:10:34 +05:30 |
|
ishaan-jaff
|
99156edf82
|
(test) proxy - log metadata to langfuse
|
2024-01-01 11:54:16 +05:30 |
|
ishaan-jaff
|
bf0329e313
|
(test) proxy - pass metadata to openai client
|
2024-01-01 11:12:57 +05:30 |
|
Krrish Dholakia
|
79978c44ba
|
refactor: add black formatting
|
2023-12-25 14:11:20 +05:30 |
|
ishaan-jaff
|
7a1e10c371
|
(test) proxy - test exception mapping
|
2023-12-15 08:28:12 +05:30 |
|
Krrish Dholakia
|
c2acdfce09
|
test: remove local test
|
2023-12-05 12:45:52 -08:00 |
|
ishaan-jaff
|
7ba22141ed
|
(docs) proxy + langchain
|
2023-12-05 12:20:42 -08:00 |
|
ishaan-jaff
|
67272e1d8e
|
(test) proxy: langchain compatible
|
2023-12-05 11:45:35 -08:00 |
|
Krrish Dholakia
|
6015bff80b
|
fix(proxy_server.py): hash keys
|
2023-12-02 19:30:03 -08:00 |
|
ishaan-jaff
|
c9e21d97cd
|
(feat) proxy: use orjson
|
2023-11-30 19:50:47 -08:00 |
|
ishaan-jaff
|
ae6e852219
|
(test) load test embedding
|
2023-11-30 19:04:51 -08:00 |
|
ishaan-jaff
|
b6ffcd00b9
|
(test) proxy + router: add bursty load test
|
2023-11-30 13:17:11 -08:00 |
|
ishaan-jaff
|
700fee3eba
|
(test) 1k requests
|
2023-11-29 16:22:18 -08:00 |
|
ishaan-jaff
|
bcc58e16be
|
(test) load test completion
|
2023-11-28 15:44:56 -08:00 |
|
ishaan-jaff
|
547edd24e6
|
(test) load test
|
2023-11-27 18:08:47 -08:00 |
|
ishaan-jaff
|
aef3d2699f
|
(test) load test router
|
2023-11-27 16:37:57 -08:00 |
|
ishaan-jaff
|
367468d655
|
(test) load test proxy completion
|
2023-11-27 12:13:21 -08:00 |
|
ishaan-jaff
|
c0dfc8d9b3
|
(test) load test embedding: proxy
|
2023-11-24 17:14:44 -08:00 |
|
ishaan-jaff
|
32b7c236e6
|
(docs) proxy performance
|
2023-11-24 17:07:46 -08:00 |
|
Krrish Dholakia
|
9618718080
|
test: refactor testing order
|
2023-11-24 12:47:28 -08:00 |
|
Krrish Dholakia
|
f630901a77
|
test(test_router_cooldowns.py): adding logging
|
2023-11-24 12:30:08 -08:00 |
|
Krrish Dholakia
|
2df4791ae9
|
fix: fix linting errors
|
2023-11-22 19:59:25 -08:00 |
|
ishaan-jaff
|
52e2ac0106
|
(test) proxy test exception mapping
|
2023-11-22 16:22:05 -08:00 |
|
Krrish Dholakia
|
2a681e578c
|
fix(proxy_server): fix linting issues
|
2023-11-22 08:47:59 -08:00 |
|
ishaan-jaff
|
12c2d1411a
|
(test) load test with api.litellm.ai
|
2023-11-21 21:07:27 -08:00 |
|
ishaan-jaff
|
bd5c89aab9
|
(test) load test proxy
|
2023-11-21 21:04:46 -08:00 |
|
ishaan-jaff
|
67b7aba40f
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|