Krrish Dholakia
|
821844c1a3
|
fix(router.py): fix max retries on set_client
|
2024-04-24 22:03:01 -07:00 |
|
Krrish Dholakia
|
84d43484c6
|
fix(router.py): make sure pre call rpm check runs even when model not in model cost map
|
2024-04-11 09:27:46 -07:00 |
|
Krrish Dholakia
|
a47a719caa
|
fix(router.py): generate consistent model id's
having the same id for a deployment, lets redis usage caching work across multiple instances
|
2024-04-10 15:23:57 -07:00 |
|
Ishaan Jaff
|
a55f3cdace
|
test - router re-use openai client
|
2024-04-06 11:33:17 -07:00 |
|
Krrish Dholakia
|
2e40ab959d
|
test(test_router.py): fix casting
|
2024-04-04 13:54:16 -07:00 |
|
Krrish Dholakia
|
c372c873a0
|
test(test_router.py): fix test to check cast
|
2024-04-04 13:32:50 -07:00 |
|
Krrish Dholakia
|
b9030be792
|
test(test_router.py): fix test to check type
|
2024-04-04 11:45:12 -07:00 |
|
Krrish Dholakia
|
f536fb13e6
|
fix(proxy_server.py): persist models added via /model/new to db
allows models to be used across instances
https://github.com/BerriAI/litellm/issues/2319 , https://github.com/BerriAI/litellm/issues/2329
|
2024-04-03 20:16:41 -07:00 |
|
Krrish Dholakia
|
52b1538b2e
|
fix(router.py): support context window fallbacks for pre-call checks
|
2024-04-01 10:51:54 -07:00 |
|
Ishaan Jaff
|
6d408dcce7
|
(fix) test aimg gen on router
|
2024-03-28 12:27:26 -07:00 |
|
Krrish Dholakia
|
49e8cdbff9
|
fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
|
2024-03-26 08:08:15 -07:00 |
|
Krrish Dholakia
|
e8e7964025
|
docs(routing.md): add pre-call checks to docs
|
2024-03-23 19:10:34 -07:00 |
|
Krrish Dholakia
|
b7321ae4ee
|
fix(router.py): fix pre call check logic
|
2024-03-23 18:56:08 -07:00 |
|
Krrish Dholakia
|
eb3ca85d7e
|
feat(router.py): enable pre-call checks
filter models outside of context window limits of a given message for a model group
https://github.com/BerriAI/litellm/issues/872
|
2024-03-23 18:03:30 -07:00 |
|
Krrish Dholakia
|
478307d4cf
|
fix(bedrock.py): support anthropic messages api on bedrock (claude-3)
|
2024-03-04 17:15:47 -08:00 |
|
ishaan-jaff
|
9bac163e4e
|
(test) claude-instant-1
|
2024-03-04 08:32:13 -08:00 |
|
Krrish Dholakia
|
4c951d20bc
|
test: removing aws tests - account suspended - pending their approval
|
2024-02-28 13:46:20 -08:00 |
|
ishaan-jaff
|
693efc8e84
|
(feat) add moderation on router
|
2024-02-14 11:00:09 -08:00 |
|
ishaan-jaff
|
b0902f0a8c
|
(ci/cd) add more logging to timeout test
|
2024-01-23 18:39:19 -08:00 |
|
ishaan-jaff
|
b40176810e
|
(test) dynamic timeouts - router
|
2024-01-23 13:27:49 -08:00 |
|
Krrish Dholakia
|
05754ef238
|
test(test_router.py): add more testing for dynamically passing params to router
|
2024-01-23 10:31:49 -08:00 |
|
ishaan-jaff
|
3f37cf0bec
|
(test) router.py revert to stable version
|
2024-01-23 09:05:19 -08:00 |
|
Krrish Dholakia
|
ca5fb3f7f1
|
test(test_router.py): fix test check
|
2024-01-23 07:07:30 -08:00 |
|
Krrish Dholakia
|
53b879bc6c
|
fix(router.py): ensure no unsupported args are passed to completion()
|
2024-01-22 22:33:06 -08:00 |
|
Krrish Dholakia
|
f19f0dad89
|
fix(router.py): fix client init
|
2024-01-22 22:15:39 -08:00 |
|
Krrish Dholakia
|
969594a4b1
|
test(test_router.py): handle rate limiting error
|
2024-01-11 00:00:17 +05:30 |
|
Krrish Dholakia
|
990c32a5d6
|
test(test_router.py): handle image gen timeouts
|
2024-01-10 20:56:52 +05:30 |
|
Krrish Dholakia
|
e97eff4243
|
test(test_router.py): fix router test
|
2024-01-09 11:08:35 +05:30 |
|
Krrish Dholakia
|
3935f99083
|
test(test_router.py): add retries
|
2023-12-30 15:54:46 +05:30 |
|
Krrish Dholakia
|
69935db239
|
fix(router.py): periodically re-initialize azure/openai clients to solve max conn issue
|
2023-12-30 15:48:34 +05:30 |
|
Krrish Dholakia
|
4905929de3
|
refactor: add black formatting
|
2023-12-25 14:11:20 +05:30 |
|
Krrish Dholakia
|
14115d0d60
|
feat(proxy_server.py): add new images/generation endpoint
|
2023-12-21 15:39:09 +05:30 |
|
Krrish Dholakia
|
be68796eba
|
fix(router.py): add support for async image generation endpoints
|
2023-12-21 14:38:44 +05:30 |
|
Krrish Dholakia
|
04bbd0649f
|
fix(router.py): only do sync image gen fallbacks for now
The customhttptransport we use for dall-e-2 only works for sync httpx calls, not async. Will need to spend some time writing the async version
n
|
2023-12-20 19:10:59 +05:30 |
|
Krrish Dholakia
|
4040f60feb
|
feat(router.py): support async image generation on router
|
2023-12-20 17:24:20 +05:30 |
|
Krrish Dholakia
|
5fe5149070
|
refactor(test_router_caching.py): add tests for router caching
|
2023-12-15 20:38:51 -08:00 |
|
Krrish Dholakia
|
e5268fa6bc
|
fix(router.py): support openai-compatible endpoints
|
2023-12-15 14:47:54 -08:00 |
|
ishaan-jaff
|
26c6c1a03e
|
(test) router - acompletion + cache
|
2023-12-14 19:46:56 +05:30 |
|
Krrish Dholakia
|
edbf97adf2
|
test: testing fixes
|
2023-12-12 10:57:51 -08:00 |
|
Krrish Dholakia
|
3e908bf507
|
fix(router.py): reset caching correctly
|
2023-12-11 19:57:34 -08:00 |
|
ishaan-jaff
|
da08be1f43
|
(test) router + caching
|
2023-12-09 15:41:36 -08:00 |
|
ishaan-jaff
|
a9f103995d
|
(test) async completion + caching on router
|
2023-12-09 14:25:38 -08:00 |
|
Krrish Dholakia
|
b4aa9f0ae4
|
test(test_router.py): fix init issue
|
2023-12-09 12:28:51 -08:00 |
|
Krrish Dholakia
|
a250489405
|
test(test_router.py): run as asyncio.run
|
2023-12-09 12:28:28 -08:00 |
|
Krrish Dholakia
|
4bf875d3ed
|
fix(router.py): fix least-busy routing
|
2023-12-08 20:29:49 -08:00 |
|
Krrish Dholakia
|
dfba305508
|
test: test_router.py
fix router testing
n
|
2023-12-07 20:10:36 -08:00 |
|
ishaan-jaff
|
cc48b35a8d
|
(test) router - read os.environ/ OpenAI
|
2023-12-06 10:56:27 -08:00 |
|
ishaan-jaff
|
8f47293ce8
|
(chore) linting fix
|
2023-12-06 10:48:01 -08:00 |
|
ishaan-jaff
|
11a8713a50
|
(test) router - set sync stream client
|
2023-12-06 10:08:37 -08:00 |
|
ishaan-jaff
|
bd0579703c
|
(test) router - reading os.environ/ with client
|
2023-12-06 09:26:21 -08:00 |
|