Ishaan Jaff
|
68e94f0976
|
example mistral sdk
|
2024-07-25 19:48:54 -07:00 |
|
Ishaan Jaff
|
673105c88f
|
update tests
|
2024-07-22 14:44:47 -07:00 |
|
Ishaan Jaff
|
40d9278dcb
|
test - pass through langfuse requests
|
2024-06-28 17:28:21 -07:00 |
|
Ishaan Jaff
|
d7f1445615
|
doc - OTEL trace propogation
|
2024-06-11 14:25:33 -07:00 |
|
Ishaan Jaff
|
d829e6a297
|
test - propogate trace IDs across services
|
2024-06-11 14:00:25 -07:00 |
|
Ishaan Jaff
|
8bc02b34c2
|
test -base64 cache hits
|
2024-04-10 16:46:56 -07:00 |
|
Ishaan Jaff
|
28e4706bfd
|
test - re-order embedding responses
|
2024-04-08 12:02:40 -07:00 |
|
Krrish Dholakia
|
49e8cdbff9
|
fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
|
2024-03-26 08:08:15 -07:00 |
|
Krrish Dholakia
|
eb3ca85d7e
|
feat(router.py): enable pre-call checks
filter models outside of context window limits of a given message for a model group
https://github.com/BerriAI/litellm/issues/872
|
2024-03-23 18:03:30 -07:00 |
|
Ishaan Jaff
|
aca6ec85e2
|
(fix) add some better load testing
|
2024-03-22 19:48:54 -07:00 |
|
Krrish Dholakia
|
8fefe625d9
|
fix(proxy/utils.py): batch writing updates to db
|
2024-03-18 16:47:02 -07:00 |
|
Krish Dholakia
|
c022568a3a
|
Merge branch 'main' into litellm_faster_api_key_checking
|
2024-03-09 18:45:03 -08:00 |
|
Krrish Dholakia
|
0cf056f493
|
fix(proxy_server.py): use argon2 for faster api key checking
0.04s latency boost on load test
|
2024-03-07 21:48:18 -08:00 |
|
ishaan-jaff
|
6f0faca85b
|
(feat) print debug info per deployment
|
2024-03-07 18:33:09 -08:00 |
|
ishaan-jaff
|
735728c4dd
|
(feat) add better debugging when DB read/write fails
|
2024-03-05 18:16:59 -08:00 |
|
ishaan-jaff
|
7aa1c8556d
|
(test) large request
|
2024-02-12 21:49:47 -08:00 |
|
ishaan-jaff
|
09c36c6e78
|
(test) llama index VectorStoreIndex
|
2024-02-09 16:49:03 -08:00 |
|
ishaan-jaff
|
782e84a421
|
(test) Proxy llama index request
|
2024-02-09 16:09:03 -08:00 |
|
ishaan-jaff
|
bd37a9cb5e
|
(fix) proxy - streaming sagemaker
|
2024-01-23 11:12:16 -08:00 |
|
ishaan-jaff
|
bc3332fc7c
|
(docs) use proxy + Sagemaker stream test
|
2024-01-22 22:11:06 -08:00 |
|
ishaan-jaff
|
e6b5152e63
|
(chore) update load test
|
2024-01-19 08:52:17 -08:00 |
|
ishaan-jaff
|
0414e40d4a
|
(docs) also test gpt-4 vision enhancements
|
2024-01-17 18:46:41 -08:00 |
|
ishaan-jaff
|
44a7e4157c
|
(test) proxy - add openai exception mapping error
|
2024-01-15 09:56:20 -08:00 |
|
ishaan-jaff
|
746213f5d1
|
(test) proxy - add JS OpenAI test
|
2024-01-15 09:53:57 -08:00 |
|
ishaan-jaff
|
4680a26e2e
|
(test) proxy - load test
|
2024-01-03 16:16:18 +05:30 |
|
ishaan-jaff
|
0acaaf8f8f
|
(test) sustained load test proxy
|
2024-01-02 12:10:34 +05:30 |
|
ishaan-jaff
|
c8f8bd9e57
|
(test) proxy - log metadata to langfuse
|
2024-01-01 11:54:16 +05:30 |
|
ishaan-jaff
|
694956b44e
|
(test) proxy - pass metadata to openai client
|
2024-01-01 11:12:57 +05:30 |
|
Krrish Dholakia
|
4905929de3
|
refactor: add black formatting
|
2023-12-25 14:11:20 +05:30 |
|
ishaan-jaff
|
f09ce1d316
|
(test) proxy - test exception mapping
|
2023-12-15 08:28:12 +05:30 |
|
Krrish Dholakia
|
397eefabe1
|
test: remove local test
|
2023-12-05 12:45:52 -08:00 |
|
ishaan-jaff
|
d5f67a0a25
|
(docs) proxy + langchain
|
2023-12-05 12:20:42 -08:00 |
|
ishaan-jaff
|
a0f8bf23ab
|
(test) proxy: langchain compatible
|
2023-12-05 11:45:35 -08:00 |
|
Krrish Dholakia
|
6b1b1b82cf
|
fix(proxy_server.py): hash keys
|
2023-12-02 19:30:03 -08:00 |
|
ishaan-jaff
|
2d55cc753e
|
(feat) proxy: use orjson
|
2023-11-30 19:50:47 -08:00 |
|
ishaan-jaff
|
853af29a25
|
(test) load test embedding
|
2023-11-30 19:04:51 -08:00 |
|
ishaan-jaff
|
be8bdb580a
|
(test) proxy + router: add bursty load test
|
2023-11-30 13:17:11 -08:00 |
|
ishaan-jaff
|
d3672452ce
|
(test) 1k requests
|
2023-11-29 16:22:18 -08:00 |
|
ishaan-jaff
|
ee6f5a84db
|
(test) load test completion
|
2023-11-28 15:44:56 -08:00 |
|
ishaan-jaff
|
40d9e8ab23
|
(test) load test
|
2023-11-27 18:08:47 -08:00 |
|
ishaan-jaff
|
8560794963
|
(test) load test router
|
2023-11-27 16:37:57 -08:00 |
|
ishaan-jaff
|
5e2c13fb11
|
(test) load test proxy completion
|
2023-11-27 12:13:21 -08:00 |
|
ishaan-jaff
|
dca0a5ad0f
|
(test) load test embedding: proxy
|
2023-11-24 17:14:44 -08:00 |
|
ishaan-jaff
|
111c7afaca
|
(docs) proxy performance
|
2023-11-24 17:07:46 -08:00 |
|
Krrish Dholakia
|
16e1070dbe
|
test: refactor testing order
|
2023-11-24 12:47:28 -08:00 |
|
Krrish Dholakia
|
2a033fd8a2
|
test(test_router_cooldowns.py): adding logging
|
2023-11-24 12:30:08 -08:00 |
|
Krrish Dholakia
|
2f93c0155a
|
fix: fix linting errors
|
2023-11-22 19:59:25 -08:00 |
|
ishaan-jaff
|
cb41b14cc2
|
(test) proxy test exception mapping
|
2023-11-22 16:22:05 -08:00 |
|
Krrish Dholakia
|
e7bb4a0cbd
|
fix(proxy_server): fix linting issues
|
2023-11-22 08:47:59 -08:00 |
|
ishaan-jaff
|
2f3e13e43b
|
(test) load test with api.litellm.ai
|
2023-11-21 21:07:27 -08:00 |
|