Commit graph

103 commits

Author SHA1 Message Date
Krrish Dholakia
c372c873a0 test(test_router.py): fix test to check cast 2024-04-04 13:32:50 -07:00
Krrish Dholakia
b9030be792 test(test_router.py): fix test to check type 2024-04-04 11:45:12 -07:00
Krrish Dholakia
f536fb13e6 fix(proxy_server.py): persist models added via /model/new to db
allows models to be used across instances

https://github.com/BerriAI/litellm/issues/2319 , https://github.com/BerriAI/litellm/issues/2329
2024-04-03 20:16:41 -07:00
Krrish Dholakia
52b1538b2e fix(router.py): support context window fallbacks for pre-call checks 2024-04-01 10:51:54 -07:00
Ishaan Jaff
6d408dcce7 (fix) test aimg gen on router 2024-03-28 12:27:26 -07:00
Krrish Dholakia
49e8cdbff9 fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
2024-03-26 08:08:15 -07:00
Krrish Dholakia
e8e7964025 docs(routing.md): add pre-call checks to docs 2024-03-23 19:10:34 -07:00
Krrish Dholakia
b7321ae4ee fix(router.py): fix pre call check logic 2024-03-23 18:56:08 -07:00
Krrish Dholakia
eb3ca85d7e feat(router.py): enable pre-call checks
filter models outside of context window limits of a given message for a model group

 https://github.com/BerriAI/litellm/issues/872
2024-03-23 18:03:30 -07:00
Krrish Dholakia
478307d4cf fix(bedrock.py): support anthropic messages api on bedrock (claude-3) 2024-03-04 17:15:47 -08:00
ishaan-jaff
9bac163e4e (test) claude-instant-1 2024-03-04 08:32:13 -08:00
Krrish Dholakia
4c951d20bc test: removing aws tests - account suspended - pending their approval 2024-02-28 13:46:20 -08:00
ishaan-jaff
693efc8e84 (feat) add moderation on router 2024-02-14 11:00:09 -08:00
ishaan-jaff
b0902f0a8c (ci/cd) add more logging to timeout test 2024-01-23 18:39:19 -08:00
ishaan-jaff
b40176810e (test) dynamic timeouts - router 2024-01-23 13:27:49 -08:00
Krrish Dholakia
05754ef238 test(test_router.py): add more testing for dynamically passing params to router 2024-01-23 10:31:49 -08:00
ishaan-jaff
3f37cf0bec (test) router.py revert to stable version 2024-01-23 09:05:19 -08:00
Krrish Dholakia
ca5fb3f7f1 test(test_router.py): fix test check 2024-01-23 07:07:30 -08:00
Krrish Dholakia
53b879bc6c fix(router.py): ensure no unsupported args are passed to completion() 2024-01-22 22:33:06 -08:00
Krrish Dholakia
f19f0dad89 fix(router.py): fix client init 2024-01-22 22:15:39 -08:00
Krrish Dholakia
969594a4b1 test(test_router.py): handle rate limiting error 2024-01-11 00:00:17 +05:30
Krrish Dholakia
990c32a5d6 test(test_router.py): handle image gen timeouts 2024-01-10 20:56:52 +05:30
Krrish Dholakia
e97eff4243 test(test_router.py): fix router test 2024-01-09 11:08:35 +05:30
Krrish Dholakia
3935f99083 test(test_router.py): add retries 2023-12-30 15:54:46 +05:30
Krrish Dholakia
69935db239 fix(router.py): periodically re-initialize azure/openai clients to solve max conn issue 2023-12-30 15:48:34 +05:30
Krrish Dholakia
4905929de3 refactor: add black formatting 2023-12-25 14:11:20 +05:30
Krrish Dholakia
14115d0d60 feat(proxy_server.py): add new images/generation endpoint 2023-12-21 15:39:09 +05:30
Krrish Dholakia
be68796eba fix(router.py): add support for async image generation endpoints 2023-12-21 14:38:44 +05:30
Krrish Dholakia
04bbd0649f fix(router.py): only do sync image gen fallbacks for now
The customhttptransport we use for dall-e-2 only works for sync httpx calls, not async. Will need to spend some time writing the async version

n
2023-12-20 19:10:59 +05:30
Krrish Dholakia
4040f60feb feat(router.py): support async image generation on router 2023-12-20 17:24:20 +05:30
Krrish Dholakia
5fe5149070 refactor(test_router_caching.py): add tests for router caching 2023-12-15 20:38:51 -08:00
Krrish Dholakia
e5268fa6bc fix(router.py): support openai-compatible endpoints 2023-12-15 14:47:54 -08:00
ishaan-jaff
26c6c1a03e (test) router - acompletion + cache 2023-12-14 19:46:56 +05:30
Krrish Dholakia
edbf97adf2 test: testing fixes 2023-12-12 10:57:51 -08:00
Krrish Dholakia
3e908bf507 fix(router.py): reset caching correctly 2023-12-11 19:57:34 -08:00
ishaan-jaff
da08be1f43 (test) router + caching 2023-12-09 15:41:36 -08:00
ishaan-jaff
a9f103995d (test) async completion + caching on router 2023-12-09 14:25:38 -08:00
Krrish Dholakia
b4aa9f0ae4 test(test_router.py): fix init issue 2023-12-09 12:28:51 -08:00
Krrish Dholakia
a250489405 test(test_router.py): run as asyncio.run 2023-12-09 12:28:28 -08:00
Krrish Dholakia
4bf875d3ed fix(router.py): fix least-busy routing 2023-12-08 20:29:49 -08:00
Krrish Dholakia
dfba305508 test: test_router.py
fix router testing

n
2023-12-07 20:10:36 -08:00
ishaan-jaff
cc48b35a8d (test) router - read os.environ/ OpenAI 2023-12-06 10:56:27 -08:00
ishaan-jaff
8f47293ce8 (chore) linting fix 2023-12-06 10:48:01 -08:00
ishaan-jaff
11a8713a50 (test) router - set sync stream client 2023-12-06 10:08:37 -08:00
ishaan-jaff
bd0579703c (test) router - reading os.environ/ with client 2023-12-06 09:26:21 -08:00
ishaan-jaff
527aadd1ab (test) router - reading os.environ/ variables 2023-12-06 09:19:51 -08:00
ishaan-jaff
5e065ebb8f (test) router - explcitly call one deployment 2023-12-05 21:57:00 -08:00
ishaan-jaff
0eccc1b1f8 (test) router: call 1 deployment 2023-12-05 17:35:35 -08:00
ishaan-jaff
1fa9ddd739 (chore) linting fix 2023-12-05 17:29:09 -08:00
ishaan-jaff
703a575a5d (test) call 1 deployment on router 2023-12-05 16:56:38 -08:00