Ishaan Jaff
|
c7f14e936a
|
(code quality) run ruff rule to ban unused imports (#7313)
* remove unused imports
* fix AmazonConverseConfig
* fix test
* fix import
* ruff check fixes
* test fixes
* fix testing
* fix imports
|
2024-12-19 12:33:42 -08:00 |
|
Ishaan Jaff
|
4d1b4beb3d
|
(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208)
* use folder for caching
* fix importing caching
* fix clickhouse pyright
* fix linting
* fix correctly pass kwargs and args
* fix test case for embedding
* fix linting
* fix embedding caching logic
* fix refactor handle utils.py
* fix test_embedding_caching_azure_individual_items_reordered
|
2024-10-14 16:34:01 +05:30 |
|
Ishaan Jaff
|
fb5be57bb8
|
v0 add rerank on litellm proxy
|
2024-08-27 17:28:39 -07:00 |
|
Ishaan Jaff
|
4685b9909a
|
feat - allow accessing data post success call
|
2024-08-19 11:35:33 -07:00 |
|
Krrish Dholakia
|
61f4b71ef7
|
refactor: replace .error() with .exception() logging for better debugging on sentry
|
2024-08-16 09:22:47 -07:00 |
|
Krrish Dholakia
|
7e769f3b89
|
fix: fix linting errors
|
2024-07-13 14:39:42 -07:00 |
|
Krrish Dholakia
|
196b94455e
|
fix(dynamic_rate_limiter.py): add rpm allocation, priority + quota reservation to docs
|
2024-07-01 23:35:42 -07:00 |
|
Krrish Dholakia
|
6b529d4e0e
|
fix(dynamic_rate_limiter.py): support setting priority + reserving tpm/rpm
|
2024-07-01 23:08:54 -07:00 |
|
Krrish Dholakia
|
0781014706
|
test(test_dynamic_rate_limit_handler.py): refactor tests for rpm suppprt
|
2024-07-01 20:16:10 -07:00 |
|
Krrish Dholakia
|
f23b17091d
|
fix(dynamic_rate_limiter.py): support dynamic rate limiting on rpm
|
2024-07-01 17:45:10 -07:00 |
|
Krrish Dholakia
|
bae7377128
|
docs(team_budgets.md): fix script
/
|
2024-06-22 15:42:05 -07:00 |
|
Krrish Dholakia
|
a31a05d45d
|
feat(dynamic_rate_limiter.py): working e2e
|
2024-06-22 14:41:22 -07:00 |
|
Krrish Dholakia
|
532f24bfb7
|
refactor: instrument 'dynamic_rate_limiting' callback on proxy
|
2024-06-22 00:32:29 -07:00 |
|
Krrish Dholakia
|
068e8dff5b
|
feat(dynamic_rate_limiter.py): passing base case
|
2024-06-21 22:46:46 -07:00 |
|
Krrish Dholakia
|
a028600932
|
feat(dynamic_rate_limiter.py): update cache with active project
|
2024-06-21 20:25:40 -07:00 |
|
Krrish Dholakia
|
2545da777b
|
feat(dynamic_rate_limiter.py): initial commit for dynamic rate limiting
Closes https://github.com/BerriAI/litellm/issues/4124
|
2024-06-21 18:41:31 -07:00 |
|