litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 10:44:24 +00:00

Author	SHA1	Message	Date
Krish Dholakia	61b35c12bb	LiteLLM Minor Fixes & Improvements (12/05/2024) (#7037 ) * fix(together_ai/chat): only return response_format + tools for supported models Fixes https://github.com/BerriAI/litellm/issues/6972 * feat(bedrock/rerank): initial working commit for bedrock rerank api support Closes https://github.com/BerriAI/litellm/issues/7021 * feat(bedrock/rerank): async bedrock rerank api support Addresses https://github.com/BerriAI/litellm/issues/7021 * build(model_prices_and_context_window.json): add 'supports_prompt_caching' for bedrock models + cleanup cross-region from model list (duplicate information - lead to inconsistencies ) * docs(json_mode.md): clarify model support for json schema Closes https://github.com/BerriAI/litellm/issues/6998 * fix(_service_logger.py): handle dd callback in list ensure failed spend tracking is logged to datadog * feat(converse_transformation.py): translate from anthropic format to bedrock format Closes https://github.com/BerriAI/litellm/issues/7030 * fix: fix linting errors * test: fix test	2024-12-05 00:02:31 -08:00
Ishaan Jaff	eb47117800	(feat) log error class, function_name on prometheus service failure hook + only log DB related failures on DB service hook (#6650 ) * log error on prometheus service failure hook * use a more accurate function name for wrapper that handles logging db metrics * fix log_db_metrics * test_log_db_metrics_failure_error_types * fix linting * fix auth checks	2024-11-07 17:01:18 -08:00
Krish Dholakia	d88e8922d4	Litellm dev 11 02 2024 (#6561 ) * fix(dual_cache.py): update in-memory check for redis batch get cache Fixes latency delay for async_batch_redis_cache * fix(service_logger.py): fix race condition causing otel service logging to be overwritten if service_callbacks set * feat(user_api_key_auth.py): add parent otel component for auth allows us to isolate how much latency is added by auth checks * perf(parallel_request_limiter.py): move async_set_cache_pipeline (from max parallel request limiter) out of execution path (background task) reduces latency by 200ms * feat(user_api_key_auth.py): have user api key auth object return user tpm/rpm limits - reduces redis calls in downstream task (parallel_request_limiter) Reduces latency by 400-800ms * fix(parallel_request_limiter.py): use batch get cache to reduce user/key/team usage object calls reduces latency by 50-100ms * fix: fix linting error * fix(_service_logger.py): fix import * fix(user_api_key_auth.py): fix service logging * fix(dual_cache.py): don't pass 'self' * fix: fix python3.8 error * fix: fix init]	2024-11-04 07:48:20 +05:30
Krish Dholakia	4f8a3fd4cf	redis otel tracing + async support for latency routing (#6452 ) * docs(exception_mapping.md): add missing exception types Fixes https://github.com/Aider-AI/aider/issues/2120#issuecomment-2438971183 * fix(main.py): register custom model pricing with specific key Ensure custom model pricing is registered to the specific model+provider key combination * test: make testing more robust for custom pricing * fix(redis_cache.py): instrument otel logging for sync redis calls ensures complete coverage for all redis cache calls * refactor: pass parent_otel_span for redis caching calls in router allows for more observability into what calls are causing latency issues * test: update tests with new params * refactor: ensure e2e otel tracing for router * refactor(router.py): add more otel tracing acrosss router catch all latency issues for router requests * fix: fix linting error * fix(router.py): fix linting error * fix: fix test * test: fix tests * fix(dual_cache.py): pass ttl to redis cache * fix: fix param	2024-10-28 21:52:12 -07:00
Krish Dholakia	70111a7abd	Litellm dev 10 26 2024 (#6472 ) * docs(exception_mapping.md): add missing exception types Fixes https://github.com/Aider-AI/aider/issues/2120#issuecomment-2438971183 * fix(main.py): register custom model pricing with specific key Ensure custom model pricing is registered to the specific model+provider key combination * test: make testing more robust for custom pricing * fix(redis_cache.py): instrument otel logging for sync redis calls ensures complete coverage for all redis cache calls	2024-10-28 15:05:43 -07:00
Ishaan Jaff	3ccdb42d26	[Fix] OTEL - Don't log messages when callback settings disable message logging (#5875 ) * fix otel dont log messages * otel fix redis failure hook logging	2024-09-24 18:29:52 -07:00
Ishaan Jaff	91e58d9049	[Feat] Add proxy level prometheus metrics (#5789 ) * add Proxy Level Tracking Metrics doc * update service logger * prometheus - track litellm_proxy_failed_requests_metric * use REQUESTED_MODEL * fix prom request_data	2024-09-19 17:13:07 -07:00
Ishaan Jaff	911230c434	[Feat-Proxy-DataDog] Log Redis, Postgres Failure events on DataDog (#5750 ) * dd - start tracking redis status on dd * add async_service_succes_hook / failure hook in custom logger * add async_service_failure_hook * log service failures on dd * fix import error * add test for redis errors / warning	2024-09-17 20:24:06 -07:00
Ishaan Jaff	7d4e834091	fix handle case when service logger has no attribute prometheusServicesLogger	2024-08-08 08:23:29 -07:00
Ishaan Jaff	315bba34e6	prom svc logger init if it's None	2024-08-07 09:02:03 -07:00
Ishaan Jaff	f55a0d98f3	otel log failures	2024-08-05 20:23:02 -07:00
Ishaan Jaff	8d91112726	log event_metadata on otel	2024-08-05 20:03:34 -07:00
Ishaan Jaff	19fb5cc11c	use common helpers for writing to otel	2024-07-27 11:40:39 -07:00
Krrish Dholakia	606d04b05b	fix(_service_logging.py): only trigger otel if in service_callback Fixes https://github.com/BerriAI/litellm/issues/4511	2024-07-03 09:48:38 -07:00
Krrish Dholakia	a028600932	feat(dynamic_rate_limiter.py): update cache with active project	2024-06-21 20:25:40 -07:00
Ishaan Jaff	5a5dd33b24	feat - working exception logs for Redis errors	2024-06-07 16:30:29 -07:00
Ishaan Jaff	7c1183e76e	fix import OTEL span	2024-06-07 09:47:13 -07:00
Ishaan Jaff	b734cca43e	fix service logger for OTEL	2024-06-06 22:12:45 -07:00
Krrish Dholakia	81573b2dd9	fix(test_lowest_tpm_rpm_routing_v2.py): unit testing for usage-based-routing-v2	2024-04-18 21:38:00 -07:00
Krrish Dholakia	919a2876f1	fix(proxy/utils.py): add prometheus failed db request tracking	2024-04-18 16:30:29 -07:00
Krrish Dholakia	0f95a824c4	feat(prometheus_services.py): emit proxy latency for successful llm api requests uses prometheus histogram for this	2024-04-18 16:04:35 -07:00
Krrish Dholakia	4e81acf2c6	feat(prometheus_services.py): monitor health of proxy adjacent services (redis / postgres / etc.)	2024-04-13 18:15:02 -07:00

22 commits