Krrish Dholakia
|
e8e7964025
|
docs(routing.md): add pre-call checks to docs
|
2024-03-23 19:10:34 -07:00 |
|
Krrish Dholakia
|
b7321ae4ee
|
fix(router.py): fix pre call check logic
|
2024-03-23 18:56:08 -07:00 |
|
Krrish Dholakia
|
eb3ca85d7e
|
feat(router.py): enable pre-call checks
filter models outside of context window limits of a given message for a model group
https://github.com/BerriAI/litellm/issues/872
|
2024-03-23 18:03:30 -07:00 |
|
Krrish Dholakia
|
478307d4cf
|
fix(bedrock.py): support anthropic messages api on bedrock (claude-3)
|
2024-03-04 17:15:47 -08:00 |
|
ishaan-jaff
|
9bac163e4e
|
(test) claude-instant-1
|
2024-03-04 08:32:13 -08:00 |
|
Krrish Dholakia
|
4c951d20bc
|
test: removing aws tests - account suspended - pending their approval
|
2024-02-28 13:46:20 -08:00 |
|
ishaan-jaff
|
693efc8e84
|
(feat) add moderation on router
|
2024-02-14 11:00:09 -08:00 |
|
ishaan-jaff
|
b0902f0a8c
|
(ci/cd) add more logging to timeout test
|
2024-01-23 18:39:19 -08:00 |
|
ishaan-jaff
|
b40176810e
|
(test) dynamic timeouts - router
|
2024-01-23 13:27:49 -08:00 |
|
Krrish Dholakia
|
05754ef238
|
test(test_router.py): add more testing for dynamically passing params to router
|
2024-01-23 10:31:49 -08:00 |
|
ishaan-jaff
|
3f37cf0bec
|
(test) router.py revert to stable version
|
2024-01-23 09:05:19 -08:00 |
|
Krrish Dholakia
|
ca5fb3f7f1
|
test(test_router.py): fix test check
|
2024-01-23 07:07:30 -08:00 |
|
Krrish Dholakia
|
53b879bc6c
|
fix(router.py): ensure no unsupported args are passed to completion()
|
2024-01-22 22:33:06 -08:00 |
|
Krrish Dholakia
|
f19f0dad89
|
fix(router.py): fix client init
|
2024-01-22 22:15:39 -08:00 |
|
Krrish Dholakia
|
969594a4b1
|
test(test_router.py): handle rate limiting error
|
2024-01-11 00:00:17 +05:30 |
|
Krrish Dholakia
|
990c32a5d6
|
test(test_router.py): handle image gen timeouts
|
2024-01-10 20:56:52 +05:30 |
|
Krrish Dholakia
|
e97eff4243
|
test(test_router.py): fix router test
|
2024-01-09 11:08:35 +05:30 |
|
Krrish Dholakia
|
3935f99083
|
test(test_router.py): add retries
|
2023-12-30 15:54:46 +05:30 |
|
Krrish Dholakia
|
69935db239
|
fix(router.py): periodically re-initialize azure/openai clients to solve max conn issue
|
2023-12-30 15:48:34 +05:30 |
|
Krrish Dholakia
|
4905929de3
|
refactor: add black formatting
|
2023-12-25 14:11:20 +05:30 |
|
Krrish Dholakia
|
14115d0d60
|
feat(proxy_server.py): add new images/generation endpoint
|
2023-12-21 15:39:09 +05:30 |
|
Krrish Dholakia
|
be68796eba
|
fix(router.py): add support for async image generation endpoints
|
2023-12-21 14:38:44 +05:30 |
|
Krrish Dholakia
|
04bbd0649f
|
fix(router.py): only do sync image gen fallbacks for now
The customhttptransport we use for dall-e-2 only works for sync httpx calls, not async. Will need to spend some time writing the async version
n
|
2023-12-20 19:10:59 +05:30 |
|
Krrish Dholakia
|
4040f60feb
|
feat(router.py): support async image generation on router
|
2023-12-20 17:24:20 +05:30 |
|
Krrish Dholakia
|
5fe5149070
|
refactor(test_router_caching.py): add tests for router caching
|
2023-12-15 20:38:51 -08:00 |
|
Krrish Dholakia
|
e5268fa6bc
|
fix(router.py): support openai-compatible endpoints
|
2023-12-15 14:47:54 -08:00 |
|
ishaan-jaff
|
26c6c1a03e
|
(test) router - acompletion + cache
|
2023-12-14 19:46:56 +05:30 |
|
Krrish Dholakia
|
edbf97adf2
|
test: testing fixes
|
2023-12-12 10:57:51 -08:00 |
|
Krrish Dholakia
|
3e908bf507
|
fix(router.py): reset caching correctly
|
2023-12-11 19:57:34 -08:00 |
|
ishaan-jaff
|
da08be1f43
|
(test) router + caching
|
2023-12-09 15:41:36 -08:00 |
|
ishaan-jaff
|
a9f103995d
|
(test) async completion + caching on router
|
2023-12-09 14:25:38 -08:00 |
|
Krrish Dholakia
|
b4aa9f0ae4
|
test(test_router.py): fix init issue
|
2023-12-09 12:28:51 -08:00 |
|
Krrish Dholakia
|
a250489405
|
test(test_router.py): run as asyncio.run
|
2023-12-09 12:28:28 -08:00 |
|
Krrish Dholakia
|
4bf875d3ed
|
fix(router.py): fix least-busy routing
|
2023-12-08 20:29:49 -08:00 |
|
Krrish Dholakia
|
dfba305508
|
test: test_router.py
fix router testing
n
|
2023-12-07 20:10:36 -08:00 |
|
ishaan-jaff
|
cc48b35a8d
|
(test) router - read os.environ/ OpenAI
|
2023-12-06 10:56:27 -08:00 |
|
ishaan-jaff
|
8f47293ce8
|
(chore) linting fix
|
2023-12-06 10:48:01 -08:00 |
|
ishaan-jaff
|
11a8713a50
|
(test) router - set sync stream client
|
2023-12-06 10:08:37 -08:00 |
|
ishaan-jaff
|
bd0579703c
|
(test) router - reading os.environ/ with client
|
2023-12-06 09:26:21 -08:00 |
|
ishaan-jaff
|
527aadd1ab
|
(test) router - reading os.environ/ variables
|
2023-12-06 09:19:51 -08:00 |
|
ishaan-jaff
|
5e065ebb8f
|
(test) router - explcitly call one deployment
|
2023-12-05 21:57:00 -08:00 |
|
ishaan-jaff
|
0eccc1b1f8
|
(test) router: call 1 deployment
|
2023-12-05 17:35:35 -08:00 |
|
ishaan-jaff
|
1fa9ddd739
|
(chore) linting fix
|
2023-12-05 17:29:09 -08:00 |
|
ishaan-jaff
|
703a575a5d
|
(test) call 1 deployment on router
|
2023-12-05 16:56:38 -08:00 |
|
ishaan-jaff
|
c717ed4d05
|
(test) router: test async embedding + embedding
|
2023-12-05 14:28:23 -08:00 |
|
ishaan-jaff
|
3ff57493f4
|
(test) router: openai async, sync, stream, no stream
|
2023-12-05 14:21:37 -08:00 |
|
ishaan-jaff
|
bc70a6fba8
|
(test) router: add tests for azure completion, acompletion
|
2023-12-05 13:59:27 -08:00 |
|
ishaan-jaff
|
0d1b42eda5
|
(test) azure - test async + sync embedding
|
2023-12-05 13:35:05 -08:00 |
|
ishaan-jaff
|
1463cc6023
|
(test) router Azure regular chat completion call
|
2023-12-05 13:28:07 -08:00 |
|
ishaan-jaff
|
e579918dd9
|
(test) Router: Test Azure acompletion, stream
|
2023-12-05 13:22:27 -08:00 |
|