Commit graph

4168 commits

Author SHA1 Message Date
ishaan-jaff
5b63827430 (ci/cd) fix test_config_no_auth 2024-02-06 15:17:35 -08:00
ishaan-jaff
6640690ad6 (fix) test_normal_router_tpm_limit 2024-02-06 15:17:35 -08:00
ishaan-jaff
97dbf14b32 (fix) parallel_request_limiter debug 2024-02-06 15:17:35 -08:00
ishaan-jaff
0d5f6cacc4 (ci/cd) run again 2024-02-06 15:17:35 -08:00
ishaan-jaff
233590e8c2 (fix) proxy_startup test 2024-02-06 15:17:35 -08:00
ishaan-jaff
b3a4982eda (fix) rename proxy startup test 2024-02-06 15:17:35 -08:00
ishaan-jaff
45ab0f01c0 (ci/cd) run in verbose mode 2024-02-06 15:17:35 -08:00
Krrish Dholakia
b47f9dcb6d fix(ollama.py): support format for ollama 2024-02-06 15:17:35 -08:00
Krrish Dholakia
80eb8d0eae fix(ollama_chat.py): explicitly state if ollama call is streaming or not 2024-02-06 15:17:35 -08:00
Krrish Dholakia
3db9830d4b fix(utils.py): use print_verbose for statements, so debug can be seen when running sdk 2024-02-06 15:17:35 -08:00
Krrish Dholakia
d189e95045 fix(ollama_chat.py): fix ollama chat completion token counting 2024-02-06 15:17:35 -08:00
ishaan-jaff
3b977679f8 (fix) test_normal_router_tpm_limit 2024-02-06 15:17:35 -08:00
ishaan-jaff
7ccb7c00d8 (ci/cd) print debug info for test_proxy_gunicorn_startup_config_dict 2024-02-06 15:17:35 -08:00
ishaan-jaff
ca029d13ee (fix) proxy startup test 2024-02-06 15:17:35 -08:00
ishaan-jaff
732ac6df49 (feat) proxy - upperbound params /key/generate 2024-02-06 15:17:35 -08:00
ishaan-jaff
e21f906463 (test) test_upperbound_key_params 2024-02-06 15:17:35 -08:00
ishaan-jaff
8d0c235004 (feat) upperbound_key_generate_params 2024-02-06 15:17:35 -08:00
ishaan-jaff
0996ea3f36 (docs) upperbound_key_generate_params 2024-02-06 15:17:35 -08:00
Krrish Dholakia
f363f0f5ba fix(langfuse.py): support logging failed llm api calls to langfuse 2024-02-06 15:17:35 -08:00
ishaan-jaff
4e3f048967 (feat) max_user_budget 2024-02-06 15:16:20 -08:00
ishaan-jaff
047b2f9b1a (Feat) support max_user_budget 2024-02-06 15:13:59 -08:00
Krrish Dholakia
0609968853 test(test_key_generate_dynamodb.py): fix test 2024-02-06 14:36:24 -08:00
Krish Dholakia
e36566a212
Merge branch 'main' into litellm_admin_ui_view_all_keys 2024-02-06 14:34:57 -08:00
Krish Dholakia
f70bbc7b2e
Merge branch 'main' into litellm_vertex_ai_streaming_fix 2024-02-06 14:33:54 -08:00
Krish Dholakia
4fe4d0c1f3
Merge branch 'main' into litellm_fix_proxy_health_readiness 2024-02-06 14:31:57 -08:00
ishaan-jaff
223cc88f0c (ci/cd) run again 2024-02-06 14:00:27 -08:00
Krrish Dholakia
c46aca7951 refactor(main.py): trigger deploy
n
2024-02-06 13:58:20 -08:00
ishaan-jaff
3a5923cffa (fix) langfuse show semantic-similarity in tags 2024-02-06 13:58:20 -08:00
ishaan-jaff
d9f407b4c2 (feat) show semantic-cache on health/readiness 2024-02-06 13:58:20 -08:00
ishaan-jaff
d05984ef13 (feat) working semantic cache on proxy 2024-02-06 13:58:13 -08:00
Krrish Dholakia
d1549cb2f3 refactor(main.py): trigger deploy
n
2024-02-06 13:55:51 -08:00
ishaan-jaff
a6afba8cf2 (fix) langfuse show semantic-similarity in tags 2024-02-06 13:41:22 -08:00
ishaan-jaff
3d0ece828a (feat) show semantic-cache on health/readiness 2024-02-06 13:35:34 -08:00
Krrish Dholakia
f0d4b62b6b test(test_completion.py): fix test 2024-02-06 13:32:03 -08:00
ishaan-jaff
f0e632ebc8 (ci/cd) run again 2024-02-06 13:32:03 -08:00
ishaan-jaff
d2a44de4c9 (fix) mark semantic caching as beta test 2024-02-06 13:32:03 -08:00
ishaan-jaff
ba4ca4d02a (fix) semantic caching 2024-02-06 13:32:03 -08:00
ishaan-jaff
9f7ec4c9f9 (fix) test-semantic caching 2024-02-06 13:32:03 -08:00
ishaan-jaff
755f44613d (feat) redis-semantic cache on proxy 2024-02-06 13:32:03 -08:00
ishaan-jaff
e74363b480 (fix) use semantic cache on proxy 2024-02-06 13:32:03 -08:00
ishaan-jaff
48fa97125d allow setting redis_semantic cache_embedding model 2024-02-06 13:32:03 -08:00
ishaan-jaff
80af1c9a58 (feat) log semantic_sim to langfuse 2024-02-06 13:32:03 -08:00
ishaan-jaff
2881e7b111 (feat) working semantic cache on proxy 2024-02-06 13:32:03 -08:00
ishaan-jaff
5f2877e699 (feat) redis-semantic cache 2024-02-06 13:31:58 -08:00
ishaan-jaff
f1dea5571a (feat) working semantic-cache on litellm proxy 2024-02-06 13:31:58 -08:00
ishaan-jaff
8a75cbd3ad (test) async semantic cache 2024-02-06 13:31:58 -08:00
ishaan-jaff
f66b6f5cd7 (feat) RedisSemanticCache - async 2024-02-06 13:31:58 -08:00
ishaan-jaff
58c4a29fbc (fix) semantic cache 2024-02-06 13:31:58 -08:00
ishaan-jaff
28676b2e0b (test) semantic caching 2024-02-06 13:31:58 -08:00
ishaan-jaff
5d345b5b57 (test) semantic cache 2024-02-06 13:31:58 -08:00