Commit graph

57 commits

Author SHA1 Message Date
ishaan-jaff
5e065ebb8f (test) router - explcitly call one deployment 2023-12-05 21:57:00 -08:00
ishaan-jaff
0eccc1b1f8 (test) router: call 1 deployment 2023-12-05 17:35:35 -08:00
ishaan-jaff
1fa9ddd739 (chore) linting fix 2023-12-05 17:29:09 -08:00
ishaan-jaff
703a575a5d (test) call 1 deployment on router 2023-12-05 16:56:38 -08:00
ishaan-jaff
c717ed4d05 (test) router: test async embedding + embedding 2023-12-05 14:28:23 -08:00
ishaan-jaff
3ff57493f4 (test) router: openai async, sync, stream, no stream 2023-12-05 14:21:37 -08:00
ishaan-jaff
bc70a6fba8 (test) router: add tests for azure completion, acompletion 2023-12-05 13:59:27 -08:00
ishaan-jaff
0d1b42eda5 (test) azure - test async + sync embedding 2023-12-05 13:35:05 -08:00
ishaan-jaff
1463cc6023 (test) router Azure regular chat completion call 2023-12-05 13:28:07 -08:00
ishaan-jaff
e579918dd9 (test) Router: Test Azure acompletion, stream 2023-12-05 13:22:27 -08:00
ishaan-jaff
5829227d86 (test) router streaming + azure 2023-12-05 12:54:00 -08:00
ishaan-jaff
3bdf61f02a (test) test bedrock on router 2023-12-05 09:35:08 -08:00
ishaan-jaff
3891462b29 (fix) router: azure/embedding support 2023-11-29 19:06:36 -08:00
ishaan-jaff
305faab542 (test) router:get_available_deployment 2023-11-29 17:54:41 -08:00
ishaan-jaff
7a38a45d62 (test) test weighted selection router 2023-11-29 17:30:18 -08:00
Krrish Dholakia
094144de58 fix(router.py): removing model id before making call 2023-11-28 10:09:45 -08:00
Krrish Dholakia
ab0bc87427 fix(router.py): check if fallbacks is none 2023-11-25 14:58:07 -08:00
Krrish Dholakia
16e1070dbe test: refactor testing order 2023-11-24 12:47:28 -08:00
Krrish Dholakia
3dcbf6197a test(test_router.py): reintegrate router testing 2023-11-24 11:48:24 -08:00
Krrish Dholakia
c75e90663c test(test_stream_chunk_builder.py): fix setting api key 2023-11-24 11:47:48 -08:00
Krrish Dholakia
4a5dae3941 fix(main.py): fix streaming_chunk_builder to return usage 2023-11-24 11:27:04 -08:00
ishaan-jaff
ba9dbbe6d5 (test) router: acompletion + caching 2023-11-23 21:05:48 -08:00
Krrish Dholakia
187403c5cc fix(router.py): add modelgroup to call metadata 2023-11-23 20:55:49 -08:00
ishaan-jaff
db8ed601b5 (test) temp comment router to test ci/cd 2023-11-23 20:28:43 -08:00
Krrish Dholakia
7d221fe863 fix(utils.py): make failure logging sync 2023-11-23 20:19:27 -08:00
ishaan-jaff
704af2ca34 (test) testing fix 2023-11-23 17:20:22 -08:00
ishaan-jaff
3118101e2c (test) router: raises exception 2023-11-23 17:09:36 -08:00
ishaan-jaff
60dab0d700 (test) router: azure pass key in model list 2023-11-23 16:51:52 -08:00
ishaan-jaff
a1bb880872 (test) router raise exceptions 2023-11-23 16:28:19 -08:00
ishaan-jaff
1365092c89 (test) router: ensure errors are raised 2023-11-23 16:08:59 -08:00
Krrish Dholakia
61fc76a8c4 fix(router.py): fix caching for tracking cooldowns + usage 2023-11-23 11:13:32 -08:00
ishaan-jaff
db146bc40a (test) router with fallback deployments 2023-11-22 20:52:56 -08:00
Krrish Dholakia
5d5ca9f7ef fix(router.py): add support for cooldowns with redis 2023-11-22 19:54:22 -08:00
Krrish Dholakia
3e76d4b422 feat(router.py): add server cooldown logic 2023-11-22 15:59:48 -08:00
Krrish Dholakia
10fe16c965 fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n

n
2023-11-22 10:04:27 -08:00
Krrish Dholakia
76f46902ed feat(router.py): adding latency-based routing strategy 2023-11-21 21:19:27 -08:00
Krrish Dholakia
478bd7def6 fix(router.py): introducing usage-based-routing 2023-11-17 17:56:26 -08:00
Krrish Dholakia
a99efc544e fix(test_router.py): catch timeouts 2023-11-17 15:56:06 -08:00
Krrish Dholakia
0ab6b2451d fix(acompletion): support client side timeouts + raise exceptions correctly for async calls 2023-11-17 15:39:47 -08:00
Krrish Dholakia
59eaeba92a fix(router.py): enabling retrying with expo backoff (without tenacity) for router 2023-11-14 20:57:51 -08:00
Krrish Dholakia
a7222f257c fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors 2023-11-14 16:26:05 -08:00
Krrish Dholakia
c6ce3fedcd fix(main.py): fix caching for router 2023-11-11 17:45:23 -08:00
Krrish Dholakia
40edb546dc test(test_streaming.py): set cache to none 2023-11-10 15:04:01 -08:00
Krrish Dholakia
18a8bd5543 fix(utils.py): return function call as part of response object 2023-11-10 11:02:10 -08:00
Krrish Dholakia
a4c9e6bd46 fix(utils.py): fix cached responses - translate dict to objects 2023-11-10 10:38:20 -08:00
ishaan-jaff
b8ebab5d4f (test) router functionality 2023-11-08 09:48:48 -08:00
Krrish Dholakia
6a45879932 bump: version 0.13.6.dev3 → 0.13.6 2023-11-06 18:19:20 -08:00
Krrish Dholakia
65c01eae23 fix(huggingface_restapi.py): output parsing chat template models 2023-11-06 11:43:12 -08:00
mc-marcocheng
bf98d48bba Feature: Router aembedding 2023-10-25 11:19:45 +08:00
Krrish Dholakia
0f08335edd fix(router.py): adding support for async completion calls
https://github.com/BerriAI/litellm/issues/676
2023-10-24 17:20:31 -07:00