Commit graph

68 commits

Author SHA1 Message Date
Ishaan Jaff
6f34169f17 add e2e tests for keys with regex patterns for /models and /model/info 2024-11-27 18:38:36 -08:00
Ishaan Jaff
a18aeaa2fb add test_regex_pattern_matching_e2e_test 2024-11-25 15:39:42 -08:00
Krish Dholakia
4ac66bd843
LiteLLM Minor Fixes and Improvements (09/07/2024) (#5580)
* fix(litellm_logging.py): set completion_start_time_float to end_time_float if none

Fixes https://github.com/BerriAI/litellm/issues/5500

* feat(_init_.py): add new 'openai_text_completion_compatible_providers' list

Fixes https://github.com/BerriAI/litellm/issues/5558

Handles correctly routing fireworks ai calls when done via text completions

* fix: fix linting errors

* fix: fix linting errors

* fix(openai.py): fix exception raised

* fix(openai.py): fix error handling

* fix(_redis.py): allow all supported arguments for redis cluster (#5554)

* Revert "fix(_redis.py): allow all supported arguments for redis cluster (#5554)" (#5583)

This reverts commit f2191ef4cb.

* fix(router.py): return model alias w/ underlying deployment on router.get_model_list()

Fixes https://github.com/BerriAI/litellm/issues/5524#issuecomment-2336410666

* test: handle flaky tests

---------

Co-authored-by: Jonas Dittrich <58814480+Kakadus@users.noreply.github.com>
2024-09-09 18:54:17 -07:00
Krrish Dholakia
0a016d33e6 Revert "fix(router.py): return model alias w/ underlying deployment on router.get_model_list()"
This reverts commit 638896309c.
2024-09-07 18:04:56 -07:00
Krrish Dholakia
638896309c fix(router.py): return model alias w/ underlying deployment on router.get_model_list()
Fixes https://github.com/BerriAI/litellm/issues/5524#issuecomment-2336410666
2024-09-07 18:01:31 -07:00
Ishaan Jaff
f1ffa82062 fix use provider specific routing 2024-08-07 14:37:20 -07:00
Ishaan Jaff
404360b28d test pass through endpoint 2024-08-06 12:16:00 -07:00
Ishaan Jaff
b35c63001d fix setup for endpoints 2024-07-31 17:09:08 -07:00
Ishaan Jaff
c8dfc95e90 add examples on config 2024-07-31 15:29:06 -07:00
Ishaan Jaff
9863520376 support using */* 2024-07-25 18:48:56 -07:00
Ishaan Jaff
e2397c3b83 fix test_team_2logging langfuse 2024-06-19 21:14:18 -07:00
Ishaan Jaff
d409ffbaa9 fix test_chat_completion_different_deployments 2024-06-17 23:04:48 -07:00
Ishaan Jaff
cb386fda20 test - making mistral embedding request on proxy 2024-06-12 15:10:20 -07:00
Marc Abramowitz
83c242bbb3 Add commented set_verbose line to proxy_config
because I've wanted to do this a couple of times and couldn't remember
the exact syntax.
2024-05-16 15:59:37 -07:00
Krrish Dholakia
54587db402 fix(alerting.py): fix datetime comparison logic 2024-05-14 22:10:09 -07:00
Ishaan Jaff
9bde3ccd1d (ci/cd) fixes 2024-05-13 20:49:02 -07:00
Krrish Dholakia
99e8f0715e test(test_end_users.py): fix end user region routing test 2024-05-11 22:42:43 -07:00
Ishaan Jaff
9c4f1ec3e5 fix - failing test_end_user_specific_region test 2024-05-11 17:05:37 -07:00
Ishaan Jaff
a4695c3010 test - using langfuse as a failure callback 2024-05-10 17:37:32 -07:00
Krrish Dholakia
3d18897d69 feat(router.py): enable filtering model group by 'allowed_model_region' 2024-05-08 22:10:17 -07:00
Ishaan Jaff
6a06aba443 (ci/cd) use db connection limit 2024-05-06 11:15:22 -07:00
Ishaan Jaff
e8d3dd475a fix fake endpoint used on ci/cd 2024-05-06 10:37:39 -07:00
Ishaan Jaff
56a75ee7fe (ci/cd) fix tests 2024-05-01 13:42:54 -07:00
Krrish Dholakia
d4bca6707b ci(proxy_server_config.yaml): use redis for usage-based-routing-v2 2024-04-22 13:34:36 -07:00
Krrish Dholakia
1507b23e30 test(test_openai_endpoints.py): make test stricter 2024-04-20 12:11:54 -07:00
Krrish Dholakia
01a1a8f731 fix(caching.py): dual cache async_batch_get_cache fix + testing
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Ishaan Jaff
adae555fb1
Merge branch 'main' into litellm_fix_using_wildcard_openai_models_proxy 2024-04-15 14:35:06 -07:00
Ishaan Jaff
6df5337e65 test - wildcard openai models on proxy 2024-04-15 14:05:26 -07:00
Ishaan Jaff
ecc6aa060f test - team based logging on proxy 2024-04-15 13:26:55 -07:00
Krrish Dholakia
ea1574c160 test(test_openai_endpoints.py): add concurrency testing for user defined rate limits on proxy 2024-04-12 18:56:13 -07:00
Krrish Dholakia
74aa230eac fix(main.py): automatically infer mode for text completion models 2024-04-12 14:16:21 -07:00
Krrish Dholakia
3665b890f8 build(proxy_server_config.yaml): cleanup config 2024-04-11 20:20:09 -07:00
Krrish Dholakia
bdfb74f8a5 test(test_openai_endpoints.py): add local test, for proxy concurrency 2024-04-11 17:16:23 -07:00
Krrish Dholakia
695eb129ad fix(router.py): fix client init for streaming timeouts 2024-04-05 12:30:15 -07:00
Krrish Dholakia
9fe1c26e0d build(proxy_server_config.yaml): add store_model_in_db flag 2024-04-03 23:20:41 -07:00
Krrish Dholakia
c0204310ee fix(main.py): fix translation to text_completions format for async text completion calls 2024-03-30 09:02:51 -07:00
Ishaan Jaff
4d22fe0947 bump: version 1.34.4 → 1.34.5 2024-03-26 18:04:05 -07:00
Ishaan Jaff
da503eab18
Merge branch 'main' into litellm_remove_litellm_telemetry 2024-03-26 11:35:02 -07:00
Ishaan Jaff
b6af44c4dc (fix) telemetry = false 2024-03-26 11:23:23 -07:00
Krrish Dholakia
49e8cdbff9 fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
2024-03-26 08:08:15 -07:00
Krrish Dholakia
f588bff69b fix(proxy_server.py): fix spend log update 2024-03-18 20:26:28 -07:00
Krish Dholakia
2c2f322d5a
Update proxy_server_config.yaml 2024-03-16 11:16:13 -07:00
Krish Dholakia
f3cf1ec71f
Update proxy_server_config.yaml 2024-03-15 22:40:39 -07:00
Krish Dholakia
24e673dd6d
Update proxy_server_config.yaml 2024-03-15 22:19:37 -07:00
Krish Dholakia
961c3548f5
Update proxy_server_config.yaml 2024-03-15 19:06:06 -07:00
ishaan-jaff
d28145f916 (fix) fake openai endpoint 2024-03-13 15:07:50 -07:00
ishaan-jaff
acd8f7b48c (fix) example fake openai endpoint 2024-03-13 15:06:44 -07:00
ishaan-jaff
788583a737 (ci/cd) use fake-openai-endpoint in docker builds 2024-03-13 14:57:41 -07:00
Krrish Dholakia
f683acda61 build: fix default config 2024-03-11 13:56:10 -07:00
Krrish Dholakia
1369e18e85 build: fix default config.yaml 2024-03-11 13:43:50 -07:00