Krrish Dholakia
87ff26ff27
fix(router.py): unify retry timeout logic across sync + async function_with_retries
2024-04-30 15:23:19 -07:00
Krrish Dholakia
3cc82f558e
fix(utils.py): add exception mapping for gemini error
2024-04-30 14:17:10 -07:00
Krrish Dholakia
638477a023
test: fix test
2024-04-30 14:06:09 -07:00
Krrish Dholakia
df43012bdd
test(test_router_fallbacks.py): use rpm test -> more stable
2024-04-30 14:01:01 -07:00
Krrish Dholakia
fee488bd53
test(test_image_generation.py): fix test
2024-04-30 12:15:37 -07:00
Krrish Dholakia
285a3733a9
test(test_image_generation.py): fix test
2024-04-30 12:14:29 -07:00
Krrish Dholakia
90cdfef1c1
fix(lowest_latency.py): allow setting a buffer for getting values within a certain latency threshold
...
if an endpoint is slow - it's completion time might not be updated till the call is completed. This prevents us from overloading those endpoints, in a simple way.
2024-04-30 12:00:26 -07:00
Krrish Dholakia
00d1440d0d
test(test_image_generation.py): change img model for test - bedrock EOL
2024-04-30 08:55:40 -07:00
Krrish Dholakia
d717fa2588
test(test_tpm_rpm_routing_v2.py): fix test - bump number of iteration s
2024-04-30 08:48:55 -07:00
Krrish Dholakia
1e53c06064
test(test_router_caching.py): remove unstable test
...
test would fail due to timing issues
2024-04-29 18:37:31 -07:00
Krish Dholakia
09bae3d8ad
Merge pull request #3351 from elisalimli/main
...
Fix Cohere tool calling
2024-04-29 16:45:48 -07:00
Krish Dholakia
32534b5e91
Merge pull request #3358 from sumanth13131/usage-based-routing-RPM-fix
...
usage based routing RPM count fix
2024-04-29 16:45:25 -07:00
Ishaan Jaff
d58dd2cbeb
Merge pull request #3360 from BerriAI/litellm_random_pick_lowest_latency
...
[Fix] Lowest Latency routing - random pick deployments when all latencies=0
2024-04-29 16:31:32 -07:00
Ishaan Jaff
5247d7b6a5
test - lowest latency router
2024-04-29 15:51:01 -07:00
Krrish Dholakia
a978f2d881
fix(lowest_tpm_rpm_v2.py): shuffle deployments with same tpm values
2024-04-29 15:23:47 -07:00
sumanth
89e655c79e
usage based routing RPM count fix
2024-04-30 00:29:38 +05:30
Krrish Dholakia
2cfb97141d
fix(utils.py): replicate now also has token based pricing for some models
2024-04-29 08:06:15 -07:00
alisalim17
0aa8b94ff5
test: completion with Cohere command-r-plus model
2024-04-29 18:38:12 +04:00
Krish Dholakia
ffc6af0b22
Merge pull request #3334 from CyanideByte/main
...
protected_namespaces warning fixed for model_name & model_info
2024-04-29 07:16:05 -07:00
Lucca Zenobio
a9e2ef6212
test
2024-04-29 10:05:30 -03:00
Krrish Dholakia
1f6c342e94
test: fix test
2024-04-28 09:45:01 -07:00
Krish Dholakia
1841b74f49
Merge branch 'main' into litellm_common_auth_params
2024-04-28 08:38:06 -07:00
Krrish Dholakia
b9c0b55e7c
test: fix test - set num_retries=0
2024-04-27 21:02:19 -07:00
CyanideByte
82be9a7e67
Merge branch 'BerriAI:main' into main
2024-04-27 20:51:33 -07:00
Krrish Dholakia
3e8d9fc80d
test: skip local test
2024-04-27 19:07:49 -07:00
Krrish Dholakia
a3257fd5d3
test(test_router_init.py): fix test
2024-04-27 18:40:00 -07:00
Krrish Dholakia
d07713a275
test: fix test
2024-04-27 17:48:07 -07:00
Krrish Dholakia
280148543f
fix(router.py): fix trailing slash handling for api base which contains /v1
2024-04-27 17:36:28 -07:00
Krrish Dholakia
ec19c1654b
fix(router.py): set initial value of default litellm params to none
2024-04-27 17:22:50 -07:00
Krrish Dholakia
d9e0d7ce52
test: replace flaky endpoint
2024-04-27 16:37:09 -07:00
CyanideByte
a4c7d933a9
Added pytest for pydantic protected namespace warning
2024-04-27 15:44:40 -07:00
Krrish Dholakia
9f24421d44
fix(router.py): fix router should_retry
2024-04-27 15:13:20 -07:00
Krrish Dholakia
5e0bd5982e
fix(router.py): fix sync should_retry logic
2024-04-27 14:48:07 -07:00
CyanideByte
e1786848cb
protected_namespaces fixed for model_info
2024-04-27 13:08:45 -07:00
Ishaan Jaff
6762d07c7f
Merge pull request #3330 from BerriAI/litellm_rdct_msgs
...
[Feat] Redact Logging Messages/Response content on Logging Providers with `litellm.turn_off_message_logging=True`
2024-04-27 11:25:09 -07:00
Krish Dholakia
1a06f009d1
Merge branch 'main' into litellm_default_router_retries
2024-04-27 11:21:57 -07:00
Krrish Dholakia
2c67791663
test(test_completion.py): modify acompletion test to call pre-deployed watsonx endpoint
2024-04-27 11:19:00 -07:00
Krrish Dholakia
48f19cf839
feat(utils.py): unify common auth params across azure/vertex_ai/bedrock/watsonx
2024-04-27 11:06:18 -07:00
Ishaan Jaff
743dfdb950
test - redacting messages from langfuse
2024-04-27 10:03:34 -07:00
Krish Dholakia
2a006c3d39
Revert "Fix Anthropic Messages Prompt Template function to add a third condition: list of text-content dictionaries"
2024-04-27 08:57:18 -07:00
Krish Dholakia
2d976cfabc
Merge pull request #3270 from simonsanvil/feature/watsonx-integration
...
(feat) add IBM watsonx.ai as an llm provider
2024-04-27 05:48:34 -07:00
Emir Ayar
2ecbf6663a
Add test for completion with text content dictionaries
2024-04-27 12:27:12 +02:00
Krrish Dholakia
e05764bdb7
fix(router.py): add /v1/
if missing to base url, for openai-compatible api's
...
Fixes https://github.com/BerriAI/litellm/issues/2279
2024-04-26 17:05:07 -07:00
Krrish Dholakia
180718c33f
fix(router.py): support verify_ssl flag
...
Fixes https://github.com/BerriAI/litellm/issues/3162#issuecomment-2075273807
2024-04-26 15:38:01 -07:00
Krrish Dholakia
7730520fb0
fix(router.py): allow passing httpx.timeout to timeout param in router
...
Closes https://github.com/BerriAI/litellm/issues/3162
2024-04-26 14:57:19 -07:00
Krish Dholakia
4b0f73500f
Merge branch 'main' into litellm_default_router_retries
2024-04-26 14:52:24 -07:00
Krrish Dholakia
9eb75cc159
test(test_streaming.py): fix test
2024-04-25 20:22:18 -07:00
Krrish Dholakia
5307510592
test: rename test
2024-04-25 20:07:40 -07:00
Krrish Dholakia
850b056df5
fix(utils.py): add more logging to identify ci/cd issue
2024-04-25 19:57:24 -07:00
Krish Dholakia
40b6b4794b
Merge pull request #3310 from BerriAI/litellm_langfuse_error_logging_2
...
fix(proxy/utils.py): log rejected proxy requests to langfuse
2024-04-25 19:49:59 -07:00