ishaan-jaff
|
c69eaebfd8
|
(fix) dockerfile for semantic caching
|
2024-02-06 19:23:27 -08:00 |
|
ishaan-jaff
|
7b26b3b789
|
(ci/cd) run again
|
2024-02-06 18:25:15 -08:00 |
|
ishaan-jaff
|
83628938ab
|
bump: version 1.22.9 → 1.22.10
|
2024-02-06 17:12:46 -08:00 |
|
Ishaan Jaff
|
73c6ce890b
|
Merge pull request #1859 from BerriAI/litellm_allow_using_budgets_without_keys
[Feat] Budgets for 'user' param passed to /chat/completions, /embeddings etc
|
2024-02-06 16:32:25 -08:00 |
|
ishaan-jaff
|
6369424629
|
(ci/cd) run again
|
2024-02-06 16:08:25 -08:00 |
|
Krish Dholakia
|
0fd64bc906
|
Merge pull request #1839 from BerriAI/litellm_slack_langfuse_alerting
fix(proxy/utils.py): if langfuse trace id passed in, include in slack alert
|
2024-02-06 15:49:00 -08:00 |
|
Krish Dholakia
|
9e9fb747ce
|
Merge branch 'main' into litellm_slack_langfuse_alerting
|
2024-02-06 15:48:52 -08:00 |
|
ishaan-jaff
|
8208ebd9db
|
(docs) budget per end_user
|
2024-02-06 15:39:45 -08:00 |
|
ishaan-jaff
|
196787359f
|
(test) track_cost_ for end users
|
2024-02-06 15:25:51 -08:00 |
|
ishaan-jaff
|
52b864976b
|
(feat) support max_user_budget
|
2024-02-06 15:19:36 -08:00 |
|
Krrish Dholakia
|
be81183782
|
refactor(main.py): trigger deploy
n
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
78f75647da
|
(fix) redisvl requirements.txt issue
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
8ba2c8dbf7
|
(fix) langfuse show semantic-similarity in tags
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
eb3b68a2f0
|
(fix) dockerfile requirements.txt
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
325ca43946
|
(feat) show semantic-cache on health/readiness
|
2024-02-06 15:17:40 -08:00 |
|
Krrish Dholakia
|
0d03b28a3b
|
test(test_completion.py): fix test
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
b5db630dba
|
(ci/cd) run again
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
43061d612d
|
(fix) mark semantic caching as beta test
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
e32c2beddd
|
(fix) semantic caching
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
102f20fc03
|
(docs) litellm semantic caching
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
b49b37568a
|
(docs) redis cache
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
f3de05cc54
|
(fix) test-semantic caching
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
f8248b2c79
|
(feat) redis-semantic cache on proxy
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
04433c01fd
|
(docs) using semantic caching on proxy
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
58f47c9e29
|
(fix) use semantic cache on proxy
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
91424b66d7
|
allow setting redis_semantic cache_embedding model
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
e2c88ce154
|
(feat) log semantic_sim to langfuse
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
fb1212ac82
|
(fix) add redisvl==0.0.7
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
e0d5c953d6
|
(feat) working semantic cache on proxy
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
c8d5714e59
|
(feat) redis-semantic cache
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
aa7580411d
|
(feat) working semantic-cache on litellm proxy
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
1d151e4777
|
(test) async semantic cache
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
08d72fd2a0
|
(feat) RedisSemanticCache - async
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
553b993473
|
(fix) semantic cache
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
2ad8b70f50
|
(test) semantic caching
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
a5afbf6d56
|
(test) semantic cache
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
a510adb1e6
|
(feat) working - sync semantic caching
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
d67a9ada4f
|
(feat )add semantic cache
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
d85b1f8816
|
(feat) show langfuse logging tags better through proxy
|
2024-02-06 15:17:35 -08:00 |
|
Krrish Dholakia
|
6de6da71b7
|
bump: version 1.22.8 → 1.22.9
|
2024-02-06 15:17:35 -08:00 |
|
Krrish Dholakia
|
eee5353e77
|
fix(utils.py): round max tokens to be int always
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
647dbb9331
|
(ci/cd) run again
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
01701c95b8
|
(ci/cd) run again
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
5b63827430
|
(ci/cd) fix test_config_no_auth
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
6640690ad6
|
(fix) test_normal_router_tpm_limit
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
97dbf14b32
|
(fix) parallel_request_limiter debug
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
0d5f6cacc4
|
(ci/cd) run again
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
334acfb5f8
|
(ci/cd) run pytest without -s
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
233590e8c2
|
(fix) proxy_startup test
|
2024-02-06 15:17:35 -08:00 |
|
ishaan-jaff
|
b3a4982eda
|
(fix) rename proxy startup test
|
2024-02-06 15:17:35 -08:00 |
|