Commit graph

155 commits

Author SHA1 Message Date
ishaan-jaff
66dc4784d2 (chore) linting fix 2023-12-05 17:29:09 -08:00
ishaan-jaff
7bee766b65 (test) call 1 deployment on router 2023-12-05 16:56:38 -08:00
ishaan-jaff
8b9206ae51 (test) router: test async embedding + embedding 2023-12-05 14:28:23 -08:00
ishaan-jaff
a1ac6aad8d (test) router: openai async, sync, stream, no stream 2023-12-05 14:21:37 -08:00
ishaan-jaff
e9ae3234e4 (test) router: add tests for azure completion, acompletion 2023-12-05 13:59:27 -08:00
ishaan-jaff
842c91ee31 (test) azure - test async + sync embedding 2023-12-05 13:35:05 -08:00
ishaan-jaff
1622a3e4fa (test) router Azure regular chat completion call 2023-12-05 13:28:07 -08:00
ishaan-jaff
0729770390 (test) Router: Test Azure acompletion, stream 2023-12-05 13:22:27 -08:00
ishaan-jaff
0b3ea42af9 (test) router streaming + azure 2023-12-05 12:54:00 -08:00
ishaan-jaff
8759a2f463 (test) test bedrock on router 2023-12-05 09:35:08 -08:00
ishaan-jaff
cf9f3d0005 (fix) router: azure/embedding support 2023-11-29 19:06:36 -08:00
ishaan-jaff
54678d5f9c (test) router:get_available_deployment 2023-11-29 17:54:41 -08:00
ishaan-jaff
871614c154 (test) test weighted selection router 2023-11-29 17:30:18 -08:00
Krrish Dholakia
32a14e6cf0 fix(router.py): removing model id before making call 2023-11-28 10:09:45 -08:00
Krrish Dholakia
2b9c5bf706 fix(router.py): check if fallbacks is none 2023-11-25 14:58:07 -08:00
Krrish Dholakia
9618718080 test: refactor testing order 2023-11-24 12:47:28 -08:00
Krrish Dholakia
d17a19cb0a test(test_router.py): reintegrate router testing 2023-11-24 11:48:24 -08:00
Krrish Dholakia
e3766b01a0 test(test_stream_chunk_builder.py): fix setting api key 2023-11-24 11:47:48 -08:00
Krrish Dholakia
5a9a3aa89c fix(main.py): fix streaming_chunk_builder to return usage 2023-11-24 11:27:04 -08:00
ishaan-jaff
f41683011a (test) router: acompletion + caching 2023-11-23 21:05:48 -08:00
Krrish Dholakia
3a8d7ec835 fix(router.py): add modelgroup to call metadata 2023-11-23 20:55:49 -08:00
ishaan-jaff
b6fe6b2839 (test) temp comment router to test ci/cd 2023-11-23 20:28:43 -08:00
Krrish Dholakia
1b26a0931e fix(utils.py): make failure logging sync 2023-11-23 20:19:27 -08:00
ishaan-jaff
7fda40e2be (test) testing fix 2023-11-23 17:20:22 -08:00
ishaan-jaff
b2bde6ef56 (test) router: raises exception 2023-11-23 17:09:36 -08:00
ishaan-jaff
f1df76e55e (test) router: azure pass key in model list 2023-11-23 16:51:52 -08:00
ishaan-jaff
49a6ebfa30 (test) router raise exceptions 2023-11-23 16:28:19 -08:00
ishaan-jaff
0cb21d7dae (test) router: ensure errors are raised 2023-11-23 16:08:59 -08:00
Krrish Dholakia
0e3064ac8c fix(router.py): fix caching for tracking cooldowns + usage 2023-11-23 11:13:32 -08:00
ishaan-jaff
3e9d09948e (test) router with fallback deployments 2023-11-22 20:52:56 -08:00
Krrish Dholakia
497419a766 fix(router.py): add support for cooldowns with redis 2023-11-22 19:54:22 -08:00
Krrish Dholakia
a2207d462e feat(router.py): add server cooldown logic 2023-11-22 15:59:48 -08:00
Krrish Dholakia
9bb2c7ee0f fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n

n
2023-11-22 10:04:27 -08:00
Krrish Dholakia
73d70ef01c feat(router.py): adding latency-based routing strategy 2023-11-21 21:19:27 -08:00
Krrish Dholakia
cf0a9f591c fix(router.py): introducing usage-based-routing 2023-11-17 17:56:26 -08:00
Krrish Dholakia
237e6a4c32 fix(test_router.py): catch timeouts 2023-11-17 15:56:06 -08:00
Krrish Dholakia
02ed97d0b2 fix(acompletion): support client side timeouts + raise exceptions correctly for async calls 2023-11-17 15:39:47 -08:00
Krrish Dholakia
0f6713993d fix(router.py): enabling retrying with expo backoff (without tenacity) for router 2023-11-14 20:57:51 -08:00
Krrish Dholakia
9b582b2c85 fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors 2023-11-14 16:26:05 -08:00
Krrish Dholakia
e7d1840d5f fix(main.py): fix caching for router 2023-11-11 17:45:23 -08:00
Krrish Dholakia
4c103592e3 test(test_streaming.py): set cache to none 2023-11-10 15:04:01 -08:00
Krrish Dholakia
548605def8 fix(utils.py): return function call as part of response object 2023-11-10 11:02:10 -08:00
Krrish Dholakia
67e8b12a09 fix(utils.py): fix cached responses - translate dict to objects 2023-11-10 10:38:20 -08:00
ishaan-jaff
6d85d6486b (test) router functionality 2023-11-08 09:48:48 -08:00
Krrish Dholakia
632533f2e2 bump: version 0.13.6.dev3 → 0.13.6 2023-11-06 18:19:20 -08:00
Krrish Dholakia
9baafd7dbf fix(huggingface_restapi.py): output parsing chat template models 2023-11-06 11:43:12 -08:00
mc-marcocheng
bd43b96d8d Feature: Router aembedding 2023-10-25 11:19:45 +08:00
Krrish Dholakia
3477486886 fix(router.py): adding support for async completion calls
https://github.com/BerriAI/litellm/issues/676
2023-10-24 17:20:31 -07:00
Krrish Dholakia
c7e8bf3e80 test(test_router.py): fixing router testing 2023-10-24 10:21:10 -07:00
mc-marcocheng
c098d4a531 test_litellm_params_not_overwritten_by_function_calling 2023-10-24 22:04:45 +08:00