litellm-mirror/litellm
Krrish Dholakia dff9f37e24 test(test_router_caching.py): remove unstable test
test would fail due to timing issues
2024-04-29 18:37:31 -07:00
..
deprecated_litellm_server refactor: add black formatting 2023-12-25 14:11:20 +05:30
integrations fix(langfuse.py): support 'existing_trace_id' param 2024-04-29 16:39:17 -07:00
llms fix: cohere tool results 2024-04-29 14:20:24 +04:00
proxy docs(load_test.md): load test multiple instances of the proxy w/ tpm/rpm limits on deployments 2024-04-29 15:58:14 -07:00
router_strategy Merge pull request #3358 from sumanth13131/usage-based-routing-RPM-fix 2024-04-29 16:45:25 -07:00
tests test(test_router_caching.py): remove unstable test 2024-04-29 18:37:31 -07:00
types fix(lowest_tpm_rpm_v2.py): add more detail to 'No deployments available' error message 2024-04-29 15:04:37 -07:00
__init__.py Merge branch 'main' into litellm_common_auth_params 2024-04-28 08:38:06 -07:00
_logging.py fix(parallel_request_limiter.py): handle metadata being none 2024-03-14 10:02:41 -07:00
_redis.py fix(_redis.py): support redis ssl as a kwarg REDIS_SSL 2024-04-20 10:19:44 -07:00
_service_logger.py fix(test_lowest_tpm_rpm_routing_v2.py): unit testing for usage-based-routing-v2 2024-04-18 21:38:00 -07:00
_version.py (fix) ci/cd don't let importing litellm._version block starting proxy 2024-02-01 16:23:16 -08:00
budget_manager.py feat(utils.py): support region based pricing for bedrock + use bedrock's token counts if given 2024-01-26 14:53:58 -08:00
caching.py Merge branch 'main' into litellm_ssl_caching_fix 2024-04-19 17:20:27 -07:00
cost.json store llm costs in budget manager 2023-09-09 19:11:35 -07:00
exceptions.py fix - show api_base, model in exceptions 2024-04-24 14:03:48 -07:00
main.py fix(watsonx.py): use common litellm params for api key, api base, etc. 2024-04-27 10:15:27 -07:00
model_prices_and_context_window_backup.json build(model_prices_and_context_window.json): add token-based replicate costs to model cost map 2024-04-29 08:20:44 -07:00
requirements.txt Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas. 2023-11-22 23:07:33 -05:00
router.py fix(router.py): fix high-traffic bug for usage-based-routing-v2 2024-04-29 16:48:01 -07:00
timeout.py refactor: add black formatting 2023-12-25 14:11:20 +05:30
utils.py Merge pull request #3354 from BerriAI/litellm_replicate_cost_tracking 2024-04-29 09:13:41 -07:00