Commit graph

145 commits

Author SHA1 Message Date
Ishaan Jaff
887d072375 test_router_provider_wildcard_routing 2024-08-07 14:12:40 -07:00
Ishaan Jaff
5e0e113b39 test provider wildcard routing 2024-08-07 13:52:00 -07:00
Krrish Dholakia
14d0ae6aa4 fix(types/router.py): remove model_info pydantic field
Fixes https://github.com/BerriAI/litellm/issues/5042
2024-08-05 09:58:44 -07:00
Krrish Dholakia
ca0de7c0da test(test_router.py): handle azure api instability 2024-07-25 19:54:40 -07:00
Krish Dholakia
c6b6dbeb6b Merge branch 'main' into litellm_fix_httpx_transport 2024-07-06 19:12:06 -07:00
Krrish Dholakia
5c700a6141 test: fix test 2024-07-02 22:13:41 -07:00
Krrish Dholakia
a5124b3146 test: skip bad test 2024-07-02 17:46:50 -07:00
Krrish Dholakia
b75073339f test(test_router.py): fix test 2024-07-02 17:45:33 -07:00
Krish Dholakia
637369d2ac Merge branch 'main' into litellm_fix_httpx_transport 2024-07-02 17:17:43 -07:00
Ishaan Jaff
b4d61a3510 fix img gen test 2024-06-29 20:54:22 -07:00
Ishaan Jaff
658d7175a5 test- router when using openai prefix 2024-06-29 17:28:08 -07:00
Krrish Dholakia
931a9934f1 fix(router.py): fix get_router_model_info for azure models 2024-06-28 22:13:29 -07:00
Krrish Dholakia
6e53de5462 fix(router.py): only return 'max_tokens', 'input_cost_per_token', etc. in 'get_router_model_info' if base_model is set 2024-06-28 10:45:31 -07:00
Krrish Dholakia
d67cdee850 fix(router.py): fix setting httpx mounts 2024-06-26 17:22:04 -07:00
Krrish Dholakia
9cdc714baf test(test_router.py): add testing 2024-06-24 17:28:12 -07:00
Krrish Dholakia
ddd32e5341 fix(router.py): use user-defined model_input_tokens for pre-call filter checks 2024-06-24 17:25:26 -07:00
Krrish Dholakia
8843b0dc77 feat(dynamic_rate_limiter.py): working e2e 2024-06-22 14:41:22 -07:00
Krrish Dholakia
6a7982fa40 feat(dynamic_rate_limiter.py): passing base case 2024-06-21 22:46:46 -07:00
Krrish Dholakia
a8290ec294 fix(router.py): fix set_client init to check if custom_llm_provider is azure not if in model name
fixes issue where 'azure_ai/' was being init as azureopenai client
2024-06-21 17:09:20 -07:00
Krrish Dholakia
477ed99896 feat(router.py): allow user to call specific deployment via id
Allows easier health checks for specific deployments by just passing in model id
2024-06-19 13:02:46 -07:00
Krrish Dholakia
121f4d8a1b fix(router.py): support multiple orgs in 1 model definition
Closes https://github.com/BerriAI/litellm/issues/3949
2024-06-18 19:36:58 -07:00
Krrish Dholakia
58cce8a922 fix(types/router.py): modelgroupinfo to handle mode being None and supported_openai_params not being a list 2024-06-08 20:13:45 -07:00
Krrish Dholakia
005128addc feat(router.py): enable settting 'order' for a deployment in model list
Allows user to control which model gets called first in model group
2024-06-06 09:46:51 -07:00
Krrish Dholakia
482929bece fix(router.py): security fix - don't show api key in invalid model setup error message 2024-05-29 16:14:57 -07:00
Krrish Dholakia
6b140e77eb test(test_router.py): fix testing 2024-05-21 17:31:31 -07:00
Krrish Dholakia
c989b92801 feat(router.py): Fixes https://github.com/BerriAI/litellm/issues/3769 2024-05-21 17:24:51 -07:00
Krrish Dholakia
684e4e8c89 fix(router.py): overloads for better router.acompletion typing 2024-05-13 14:27:16 -07:00
Krrish Dholakia
2ed155b4d4 feat(router.py): allow setting model_region in litellm_params
Closes https://github.com/BerriAI/litellm/issues/3580
2024-05-11 10:18:08 -07:00
Krrish Dholakia
a12878b0f8 fix(router.py): cooldown deployments, for 401 errors 2024-04-30 17:54:00 -07:00
Krrish Dholakia
17c3f110c1 fix(router.py): unify retry timeout logic across sync + async function_with_retries 2024-04-30 15:23:19 -07:00
Krrish Dholakia
e322b8a1f8 fix(router.py): fix trailing slash handling for api base which contains /v1 2024-04-27 17:36:28 -07:00
Krish Dholakia
26bacef87b Merge branch 'main' into litellm_default_router_retries 2024-04-27 11:21:57 -07:00
Krrish Dholakia
069d1f863d fix(router.py): add /v1/ if missing to base url, for openai-compatible api's
Fixes https://github.com/BerriAI/litellm/issues/2279
2024-04-26 17:05:07 -07:00
Krrish Dholakia
ca4918b9a7 fix(router.py): support verify_ssl flag
Fixes https://github.com/BerriAI/litellm/issues/3162#issuecomment-2075273807
2024-04-26 15:38:01 -07:00
Krrish Dholakia
08e36547d6 fix(router.py): allow passing httpx.timeout to timeout param in router
Closes https://github.com/BerriAI/litellm/issues/3162
2024-04-26 14:57:19 -07:00
Krrish Dholakia
f1b2405fe0 fix(router.py): fix default retry logic 2024-04-25 11:57:27 -07:00
Ishaan Jaff
c0b554169c Revert "fix(router.py): fix max retries on set_client"
This reverts commit 821844c1a3.
2024-04-24 23:19:14 -07:00
Krrish Dholakia
2154ec624b fix(router.py): fix max retries on set_client 2024-04-24 22:03:01 -07:00
Krrish Dholakia
0863c10b0b fix(router.py): make sure pre call rpm check runs even when model not in model cost map 2024-04-11 09:27:46 -07:00
Krrish Dholakia
f5206d592a fix(router.py): generate consistent model id's
having the same id for a deployment, lets redis usage caching work across multiple instances
2024-04-10 15:23:57 -07:00
Ishaan Jaff
d8788bbe9b test - router re-use openai client 2024-04-06 11:33:17 -07:00
Krrish Dholakia
a7da061ed8 test(test_router.py): fix casting 2024-04-04 13:54:16 -07:00
Krrish Dholakia
99e9b29526 test(test_router.py): fix test to check cast 2024-04-04 13:32:50 -07:00
Krrish Dholakia
0294c3f8a9 test(test_router.py): fix test to check type 2024-04-04 11:45:12 -07:00
Krrish Dholakia
129bb52e9d fix(proxy_server.py): persist models added via /model/new to db
allows models to be used across instances

https://github.com/BerriAI/litellm/issues/2319 , https://github.com/BerriAI/litellm/issues/2329
2024-04-03 20:16:41 -07:00
Krrish Dholakia
b2b8375987 fix(router.py): support context window fallbacks for pre-call checks 2024-04-01 10:51:54 -07:00
Ishaan Jaff
11c922a966 (fix) test aimg gen on router 2024-03-28 12:27:26 -07:00
Krrish Dholakia
00d27a324d fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
2024-03-26 08:08:15 -07:00
Krrish Dholakia
8c6402b02d docs(routing.md): add pre-call checks to docs 2024-03-23 19:10:34 -07:00
Krrish Dholakia
292cdd81e4 fix(router.py): fix pre call check logic 2024-03-23 18:56:08 -07:00