Krrish Dholakia
|
9cdc714baf
|
test(test_router.py): add testing
|
2024-06-24 17:28:12 -07:00 |
|
Krrish Dholakia
|
ddd32e5341
|
fix(router.py): use user-defined model_input_tokens for pre-call filter checks
|
2024-06-24 17:25:26 -07:00 |
|
Krrish Dholakia
|
8843b0dc77
|
feat(dynamic_rate_limiter.py): working e2e
|
2024-06-22 14:41:22 -07:00 |
|
Krrish Dholakia
|
6a7982fa40
|
feat(dynamic_rate_limiter.py): passing base case
|
2024-06-21 22:46:46 -07:00 |
|
Krrish Dholakia
|
a8290ec294
|
fix(router.py): fix set_client init to check if custom_llm_provider is azure not if in model name
fixes issue where 'azure_ai/' was being init as azureopenai client
|
2024-06-21 17:09:20 -07:00 |
|
Krrish Dholakia
|
477ed99896
|
feat(router.py): allow user to call specific deployment via id
Allows easier health checks for specific deployments by just passing in model id
|
2024-06-19 13:02:46 -07:00 |
|
Krrish Dholakia
|
121f4d8a1b
|
fix(router.py): support multiple orgs in 1 model definition
Closes https://github.com/BerriAI/litellm/issues/3949
|
2024-06-18 19:36:58 -07:00 |
|
Krrish Dholakia
|
58cce8a922
|
fix(types/router.py): modelgroupinfo to handle mode being None and supported_openai_params not being a list
|
2024-06-08 20:13:45 -07:00 |
|
Krrish Dholakia
|
005128addc
|
feat(router.py): enable settting 'order' for a deployment in model list
Allows user to control which model gets called first in model group
|
2024-06-06 09:46:51 -07:00 |
|
Krrish Dholakia
|
482929bece
|
fix(router.py): security fix - don't show api key in invalid model setup error message
|
2024-05-29 16:14:57 -07:00 |
|
Krrish Dholakia
|
6b140e77eb
|
test(test_router.py): fix testing
|
2024-05-21 17:31:31 -07:00 |
|
Krrish Dholakia
|
c989b92801
|
feat(router.py): Fixes https://github.com/BerriAI/litellm/issues/3769
|
2024-05-21 17:24:51 -07:00 |
|
Krrish Dholakia
|
684e4e8c89
|
fix(router.py): overloads for better router.acompletion typing
|
2024-05-13 14:27:16 -07:00 |
|
Krrish Dholakia
|
2ed155b4d4
|
feat(router.py): allow setting model_region in litellm_params
Closes https://github.com/BerriAI/litellm/issues/3580
|
2024-05-11 10:18:08 -07:00 |
|
Krrish Dholakia
|
a12878b0f8
|
fix(router.py): cooldown deployments, for 401 errors
|
2024-04-30 17:54:00 -07:00 |
|
Krrish Dholakia
|
17c3f110c1
|
fix(router.py): unify retry timeout logic across sync + async function_with_retries
|
2024-04-30 15:23:19 -07:00 |
|
Krrish Dholakia
|
e322b8a1f8
|
fix(router.py): fix trailing slash handling for api base which contains /v1
|
2024-04-27 17:36:28 -07:00 |
|
Krish Dholakia
|
26bacef87b
|
Merge branch 'main' into litellm_default_router_retries
|
2024-04-27 11:21:57 -07:00 |
|
Krrish Dholakia
|
069d1f863d
|
fix(router.py): add /v1/ if missing to base url, for openai-compatible api's
Fixes https://github.com/BerriAI/litellm/issues/2279
|
2024-04-26 17:05:07 -07:00 |
|
Krrish Dholakia
|
ca4918b9a7
|
fix(router.py): support verify_ssl flag
Fixes https://github.com/BerriAI/litellm/issues/3162#issuecomment-2075273807
|
2024-04-26 15:38:01 -07:00 |
|
Krrish Dholakia
|
08e36547d6
|
fix(router.py): allow passing httpx.timeout to timeout param in router
Closes https://github.com/BerriAI/litellm/issues/3162
|
2024-04-26 14:57:19 -07:00 |
|
Krrish Dholakia
|
f1b2405fe0
|
fix(router.py): fix default retry logic
|
2024-04-25 11:57:27 -07:00 |
|
Ishaan Jaff
|
c0b554169c
|
Revert "fix(router.py): fix max retries on set_client"
This reverts commit 821844c1a3 .
|
2024-04-24 23:19:14 -07:00 |
|
Krrish Dholakia
|
2154ec624b
|
fix(router.py): fix max retries on set_client
|
2024-04-24 22:03:01 -07:00 |
|
Krrish Dholakia
|
0863c10b0b
|
fix(router.py): make sure pre call rpm check runs even when model not in model cost map
|
2024-04-11 09:27:46 -07:00 |
|
Krrish Dholakia
|
f5206d592a
|
fix(router.py): generate consistent model id's
having the same id for a deployment, lets redis usage caching work across multiple instances
|
2024-04-10 15:23:57 -07:00 |
|
Ishaan Jaff
|
d8788bbe9b
|
test - router re-use openai client
|
2024-04-06 11:33:17 -07:00 |
|
Krrish Dholakia
|
a7da061ed8
|
test(test_router.py): fix casting
|
2024-04-04 13:54:16 -07:00 |
|
Krrish Dholakia
|
99e9b29526
|
test(test_router.py): fix test to check cast
|
2024-04-04 13:32:50 -07:00 |
|
Krrish Dholakia
|
0294c3f8a9
|
test(test_router.py): fix test to check type
|
2024-04-04 11:45:12 -07:00 |
|
Krrish Dholakia
|
129bb52e9d
|
fix(proxy_server.py): persist models added via /model/new to db
allows models to be used across instances
https://github.com/BerriAI/litellm/issues/2319 , https://github.com/BerriAI/litellm/issues/2329
|
2024-04-03 20:16:41 -07:00 |
|
Krrish Dholakia
|
b2b8375987
|
fix(router.py): support context window fallbacks for pre-call checks
|
2024-04-01 10:51:54 -07:00 |
|
Ishaan Jaff
|
11c922a966
|
(fix) test aimg gen on router
|
2024-03-28 12:27:26 -07:00 |
|
Krrish Dholakia
|
00d27a324d
|
fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
|
2024-03-26 08:08:15 -07:00 |
|
Krrish Dholakia
|
8c6402b02d
|
docs(routing.md): add pre-call checks to docs
|
2024-03-23 19:10:34 -07:00 |
|
Krrish Dholakia
|
292cdd81e4
|
fix(router.py): fix pre call check logic
|
2024-03-23 18:56:08 -07:00 |
|
Krrish Dholakia
|
4e70a3e09a
|
feat(router.py): enable pre-call checks
filter models outside of context window limits of a given message for a model group
https://github.com/BerriAI/litellm/issues/872
|
2024-03-23 18:03:30 -07:00 |
|
Krrish Dholakia
|
dad65ca602
|
fix(bedrock.py): support anthropic messages api on bedrock (claude-3)
|
2024-03-04 17:15:47 -08:00 |
|
ishaan-jaff
|
b5fcc89682
|
(test) claude-instant-1
|
2024-03-04 08:32:13 -08:00 |
|
Krrish Dholakia
|
129e84f0c5
|
test: removing aws tests - account suspended - pending their approval
|
2024-02-28 13:46:20 -08:00 |
|
ishaan-jaff
|
6c1c2e8c7d
|
(feat) add moderation on router
|
2024-02-14 11:00:09 -08:00 |
|
ishaan-jaff
|
b7c458f25d
|
(ci/cd) add more logging to timeout test
|
2024-01-23 18:39:19 -08:00 |
|
ishaan-jaff
|
7276849499
|
(test) dynamic timeouts - router
|
2024-01-23 13:27:49 -08:00 |
|
Krrish Dholakia
|
ea9bb27e52
|
test(test_router.py): add more testing for dynamically passing params to router
|
2024-01-23 10:31:49 -08:00 |
|
ishaan-jaff
|
9aa40c63ee
|
(test) router.py revert to stable version
|
2024-01-23 09:05:19 -08:00 |
|
Krrish Dholakia
|
0b794cbd8f
|
test(test_router.py): fix test check
|
2024-01-23 07:07:30 -08:00 |
|
Krrish Dholakia
|
2912c3dcbb
|
fix(router.py): ensure no unsupported args are passed to completion()
|
2024-01-22 22:33:06 -08:00 |
|
Krrish Dholakia
|
9aaec44dff
|
fix(router.py): fix client init
|
2024-01-22 22:15:39 -08:00 |
|
Krrish Dholakia
|
253842bc7a
|
test(test_router.py): handle rate limiting error
|
2024-01-11 00:00:17 +05:30 |
|
Krrish Dholakia
|
4e719850bc
|
test(test_router.py): handle image gen timeouts
|
2024-01-10 20:56:52 +05:30 |
|