ishaan-jaff
|
5b63827430
|
(ci/cd) fix test_config_no_auth
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
6640690ad6
|
(fix) test_normal_router_tpm_limit
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
97dbf14b32
|
(fix) parallel_request_limiter debug
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
0d5f6cacc4
|
(ci/cd) run again
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
233590e8c2
|
(fix) proxy_startup test
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
b3a4982eda
|
(fix) rename proxy startup test
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
45ab0f01c0
|
(ci/cd) run in verbose mode
|
2024-02-06 15:17:35 -08:00 |
|
Krrish Dholakia
|
b47f9dcb6d
|
fix(ollama.py): support format for ollama
|
2024-02-06 15:17:35 -08:00 |
|
Krrish Dholakia
|
80eb8d0eae
|
fix(ollama_chat.py): explicitly state if ollama call is streaming or not
|
2024-02-06 15:17:35 -08:00 |
|
Krrish Dholakia
|
3db9830d4b
|
fix(utils.py): use print_verbose for statements, so debug can be seen when running sdk
|
2024-02-06 15:17:35 -08:00 |
|
Krrish Dholakia
|
d189e95045
|
fix(ollama_chat.py): fix ollama chat completion token counting
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
3b977679f8
|
(fix) test_normal_router_tpm_limit
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
7ccb7c00d8
|
(ci/cd) print debug info for test_proxy_gunicorn_startup_config_dict
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
ca029d13ee
|
(fix) proxy startup test
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
732ac6df49
|
(feat) proxy - upperbound params /key/generate
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
e21f906463
|
(test) test_upperbound_key_params
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
8d0c235004
|
(feat) upperbound_key_generate_params
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
0996ea3f36
|
(docs) upperbound_key_generate_params
|
2024-02-06 15:17:35 -08:00 |
|
Krrish Dholakia
|
f363f0f5ba
|
fix(langfuse.py): support logging failed llm api calls to langfuse
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
4e3f048967
|
(feat) max_user_budget
|
2024-02-06 15:16:20 -08:00 |
|
ishaan-jaff
|
047b2f9b1a
|
(Feat) support max_user_budget
|
2024-02-06 15:13:59 -08:00 |
|
Krrish Dholakia
|
0609968853
|
test(test_key_generate_dynamodb.py): fix test
|
2024-02-06 14:36:24 -08:00 |
|
Krish Dholakia
|
e36566a212
|
Merge branch 'main' into litellm_admin_ui_view_all_keys
|
2024-02-06 14:34:57 -08:00 |
|
Krish Dholakia
|
f70bbc7b2e
|
Merge branch 'main' into litellm_vertex_ai_streaming_fix
|
2024-02-06 14:33:54 -08:00 |
|
Krish Dholakia
|
4fe4d0c1f3
|
Merge branch 'main' into litellm_fix_proxy_health_readiness
|
2024-02-06 14:31:57 -08:00 |
|
ishaan-jaff
|
223cc88f0c
|
(ci/cd) run again
|
2024-02-06 14:00:27 -08:00 |
|
Krrish Dholakia
|
c46aca7951
|
refactor(main.py): trigger deploy
n
|
2024-02-06 13:58:20 -08:00 |
|
ishaan-jaff
|
3a5923cffa
|
(fix) langfuse show semantic-similarity in tags
|
2024-02-06 13:58:20 -08:00 |
|
ishaan-jaff
|
d9f407b4c2
|
(feat) show semantic-cache on health/readiness
|
2024-02-06 13:58:20 -08:00 |
|
ishaan-jaff
|
d05984ef13
|
(feat) working semantic cache on proxy
|
2024-02-06 13:58:13 -08:00 |
|
Krrish Dholakia
|
d1549cb2f3
|
refactor(main.py): trigger deploy
n
|
2024-02-06 13:55:51 -08:00 |
|
ishaan-jaff
|
a6afba8cf2
|
(fix) langfuse show semantic-similarity in tags
|
2024-02-06 13:41:22 -08:00 |
|
ishaan-jaff
|
3d0ece828a
|
(feat) show semantic-cache on health/readiness
|
2024-02-06 13:35:34 -08:00 |
|
Krrish Dholakia
|
f0d4b62b6b
|
test(test_completion.py): fix test
|
2024-02-06 13:32:03 -08:00 |
|
ishaan-jaff
|
f0e632ebc8
|
(ci/cd) run again
|
2024-02-06 13:32:03 -08:00 |
|
ishaan-jaff
|
d2a44de4c9
|
(fix) mark semantic caching as beta test
|
2024-02-06 13:32:03 -08:00 |
|
ishaan-jaff
|
ba4ca4d02a
|
(fix) semantic caching
|
2024-02-06 13:32:03 -08:00 |
|
ishaan-jaff
|
9f7ec4c9f9
|
(fix) test-semantic caching
|
2024-02-06 13:32:03 -08:00 |
|
ishaan-jaff
|
755f44613d
|
(feat) redis-semantic cache on proxy
|
2024-02-06 13:32:03 -08:00 |
|
ishaan-jaff
|
e74363b480
|
(fix) use semantic cache on proxy
|
2024-02-06 13:32:03 -08:00 |
|
ishaan-jaff
|
48fa97125d
|
allow setting redis_semantic cache_embedding model
|
2024-02-06 13:32:03 -08:00 |
|
ishaan-jaff
|
80af1c9a58
|
(feat) log semantic_sim to langfuse
|
2024-02-06 13:32:03 -08:00 |
|
ishaan-jaff
|
2881e7b111
|
(feat) working semantic cache on proxy
|
2024-02-06 13:32:03 -08:00 |
|
ishaan-jaff
|
5f2877e699
|
(feat) redis-semantic cache
|
2024-02-06 13:31:58 -08:00 |
|
ishaan-jaff
|
f1dea5571a
|
(feat) working semantic-cache on litellm proxy
|
2024-02-06 13:31:58 -08:00 |
|
ishaan-jaff
|
8a75cbd3ad
|
(test) async semantic cache
|
2024-02-06 13:31:58 -08:00 |
|
ishaan-jaff
|
f66b6f5cd7
|
(feat) RedisSemanticCache - async
|
2024-02-06 13:31:58 -08:00 |
|
ishaan-jaff
|
58c4a29fbc
|
(fix) semantic cache
|
2024-02-06 13:31:58 -08:00 |
|
ishaan-jaff
|
28676b2e0b
|
(test) semantic caching
|
2024-02-06 13:31:58 -08:00 |
|
ishaan-jaff
|
5d345b5b57
|
(test) semantic cache
|
2024-02-06 13:31:58 -08:00 |
|