litellm-mirror/litellm
Ishaan Jaff c8eff2dc65
[Feat-Prometheus] Track exception status on litellm_deployment_failure_responses (#5706)
* add litellm_deployment_cooled_down

* track num cooldowns on prometheus

* track exception status

* fix linting

* docs prom metrics

* cleanup premium user checks

* prom track deployment failure state

* docs prometheus
2024-09-14 18:44:31 -07:00
..
adapters fix(anthropic_adapter.py): fix sync streaming 2024-08-03 20:52:29 -07:00
assistants fix linting error 2024-09-05 09:56:33 -07:00
batches use correct type hints for audio transcriptions 2024-09-05 09:12:27 -07:00
deprecated_litellm_server refactor: add black formatting 2023-12-25 14:11:20 +05:30
files fix import error 2024-09-05 10:09:44 -07:00
fine_tuning refactor get_secret 2024-09-03 10:42:12 -07:00
integrations [Feat-Prometheus] Track exception status on litellm_deployment_failure_responses (#5706) 2024-09-14 18:44:31 -07:00
litellm_core_utils sambanova support (#5547) (#5703) 2024-09-14 17:23:04 -07:00
llms sambanova support (#5547) (#5703) 2024-09-14 17:23:04 -07:00
proxy [Feat-Prometheus] Track exception status on litellm_deployment_failure_responses (#5706) 2024-09-14 18:44:31 -07:00
rerank_api LiteLLM Minor Fixes and Improvements (09/12/2024) (#5658) 2024-09-12 23:04:06 -07:00
router_strategy LiteLLM Minor Fixes and Improvements (09/13/2024) (#5689) 2024-09-14 10:02:55 -07:00
router_utils [Feat-Prometheus] Track exception status on litellm_deployment_failure_responses (#5706) 2024-09-14 18:44:31 -07:00
secret_managers LiteLLM Minor Fixes and Improvements (11/09/2024) (#5634) 2024-09-11 22:36:06 -07:00
tests [Fix] Router cooldown logic - use % thresholds instead of allowed fails to cooldown deployments (#5698) 2024-09-14 18:01:19 -07:00
types LiteLLM Minor Fixes and Improvements (09/13/2024) (#5689) 2024-09-14 10:02:55 -07:00
__init__.py sambanova support (#5547) (#5703) 2024-09-14 17:23:04 -07:00
_logging.py fix(_logging.py): fix timestamp format for json logs 2024-06-20 15:20:21 -07:00
_redis.py allow setting password for redis cluster 2024-09-07 09:25:13 -07:00
_service_logger.py fix handle case when service logger has no attribute prometheusServicesLogger 2024-08-08 17:19:12 -07:00
_version.py (fix) ci/cd don't let importing litellm._version block starting proxy 2024-02-01 16:23:16 -08:00
budget_manager.py feat(proxy_server.py): return litellm version in response headers 2024-05-08 16:00:08 -07:00
caching.py LiteLLM Minor Fixes and Improvements (11/09/2024) (#5634) 2024-09-11 22:36:06 -07:00
cost.json store llm costs in budget manager 2023-09-09 19:11:35 -07:00
cost_calculator.py LiteLLM Minor Fixes and Improvements (09/10/2024) (#5618) 2024-09-11 11:30:29 -07:00
exceptions.py fix: fix tests 2024-08-07 15:02:04 -07:00
main.py [Fix] Router cooldown logic - use % thresholds instead of allowed fails to cooldown deployments (#5698) 2024-09-14 18:01:19 -07:00
model_prices_and_context_window_backup.json fic otel load test % 2024-09-14 18:04:28 -07:00
py.typed feature - Types for mypy - #360 2024-05-30 14:14:41 -04:00
requirements.txt Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas. 2023-11-22 23:07:33 -05:00
router.py [Feat-Prometheus] Track exception status on litellm_deployment_failure_responses (#5706) 2024-09-14 18:44:31 -07:00
scheduler.py feat(scheduler.py): support redis caching for req. prioritization 2024-06-06 14:19:21 -07:00
timeout.py refactor: add black formatting 2023-12-25 14:11:20 +05:30
utils.py [Fix] Router cooldown logic - use % thresholds instead of allowed fails to cooldown deployments (#5698) 2024-09-14 18:01:19 -07:00