litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 18:54:30 +00:00

Author	SHA1	Message	Date
Krish Dholakia	2488e4b45f	Cost tracking improvements (#5828 ) * feat(litellm_logging.py): update standard logging payload to include debug information for cost failures Also includes fixes for cohere rerank cost tracking + databricks llama2 model cost tracking Easier to repro cost failures and improve reliability in prod * fix(proxy_server.py): emit cost failure debug info for slack alerting Improves debug information for cost tracking failures, on slack alerting	2024-09-21 21:47:50 -07:00
Krish Dholakia	d46660ea0f	LiteLLM Minor Fixes & Improvements (09/18/2024) (#5772 ) * fix(proxy_server.py): fix azure key vault logic to not require client id/secret * feat(cost_calculator.py): support fireworks ai cost tracking * build(docker-compose.yml): add lines for mounting config.yaml to docker compose Closes https://github.com/BerriAI/litellm/issues/5739 * fix(input.md): update docs to clarify litellm supports content as a list of dictionaries Fixes https://github.com/BerriAI/litellm/issues/5755 * fix(input.md): update input.md to include all message values * fix(image_handling.py): follow image url redirects Fixes https://github.com/BerriAI/litellm/issues/5763 * fix(router.py): Fix model key/base leak in error message Fixes https://github.com/BerriAI/litellm/issues/5762 * fix(http_handler.py): fix linting error * fix(azure.py): fix logging to show azure_ad_token being used Fixes https://github.com/BerriAI/litellm/issues/5767 * fix(_redis.py): add redis sentinel support Closes https://github.com/BerriAI/litellm/issues/4381 * feat(_redis.py): add redis sentinel support Closes https://github.com/BerriAI/litellm/issues/4381 * test(test_completion_cost.py): fix test * Databricks Integration: Integrate Databricks SDK as optional mechanism for fetching API base and token, if unspecified (#5746) * LiteLLM Minor Fixes & Improvements (09/16/2024) (#5723) * coverage (#5713) Signed-off-by: dbczumar <corey.zumar@databricks.com> * Move (#5714) Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix(litellm_logging.py): fix logging client re-init (#5710) Fixes https://github.com/BerriAI/litellm/issues/5695 * fix(presidio.py): Fix logging_hook response and add support for additional presidio variables in guardrails config Fixes https://github.com/BerriAI/litellm/issues/5682 * feat(o1_handler.py): fake streaming for openai o1 models Fixes https://github.com/BerriAI/litellm/issues/5694 * docs: deprecated traceloop integration in favor of native otel (#5249) * fix: fix linting errors * fix: fix linting errors * fix(main.py): fix o1 import --------- Signed-off-by: dbczumar <corey.zumar@databricks.com> Co-authored-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: Nir Gazit <nirga@users.noreply.github.com> * feat(spend_management_endpoints.py): expose `/global/spend/refresh` endpoint for updating material view (#5730) * feat(spend_management_endpoints.py): expose `/global/spend/refresh` endpoint for updating material view Supports having `MonthlyGlobalSpend` view be a material view, and exposes an endpoint to refresh it * fix(custom_logger.py): reset calltype * fix: fix linting errors * fix: fix linting error * fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix: fix import * Fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * DB test Signed-off-by: dbczumar <corey.zumar@databricks.com> * Coverage Signed-off-by: dbczumar <corey.zumar@databricks.com> * progress Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix test name Signed-off-by: dbczumar <corey.zumar@databricks.com> --------- Signed-off-by: dbczumar <corey.zumar@databricks.com> Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> Co-authored-by: Nir Gazit <nirga@users.noreply.github.com> * test: fix test * test(test_databricks.py): fix test * fix(databricks/chat.py): handle custom endpoint (e.g. sagemaker) * Apply code scanning fix for clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * fix(__init__.py): fix known fireworks ai models --------- Signed-off-by: dbczumar <corey.zumar@databricks.com> Co-authored-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: Nir Gazit <nirga@users.noreply.github.com> Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2024-09-19 13:25:29 -07:00
Krish Dholakia	98c335acd0	LiteLLM Minor Fixes & Improvements (09/17/2024) (#5742 ) * fix(proxy_server.py): use default azure credentials to support azure non-client secret kms * fix(langsmith.py): raise error if credentials missing * feat(langsmith.py): support error logging for langsmith + standard logging payload Fixes https://github.com/BerriAI/litellm/issues/5738 * Fix hardcoding of schema in view check (#5749) * fix - deal with case when check view exists returns None (#5740) * Revert "fix - deal with case when check view exists returns None (#5740)" (#5741) This reverts commit `535228159b`. * test(test_router_debug_logs.py): move to mock response * Fix hardcoding of schema --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> * fix(proxy_server.py): allow admin to disable ui via `DISABLE_ADMIN_UI` flag * fix(router.py): fix default model name value Fixes `55db19a1e4 (r1763712148)` * fix(utils.py): fix unbound variable error * feat(rerank/main.py): add azure ai rerank endpoints Closes https://github.com/BerriAI/litellm/issues/5667 * feat(secret_detection.py): Allow configuring secret detection params Allows admin to control what plugins to run for secret detection. Prevents overzealous secret detection. * docs(secret_detection.md): add secret detection guardrail docs * fix: fix linting errors * fix - deal with case when check view exists returns None (#5740) * Revert "fix - deal with case when check view exists returns None (#5740)" (#5741) This reverts commit `535228159b`. * Litellm fix router testing (#5748) * test: fix testing - azure changed content policy error logic * test: fix tests to use mock responses * test(test_image_generation.py): handle api instability * test(test_image_generation.py): handle azure api instability * fix(utils.py): fix unbounded variable error * fix(utils.py): fix unbounded variable error * test: refactor test to use mock response * test: mark flaky azure tests * Bump next from 14.1.1 to 14.2.10 in /ui/litellm-dashboard (#5753) Bumps [next](https://github.com/vercel/next.js) from 14.1.1 to 14.2.10. - [Release notes](https://github.com/vercel/next.js/releases) - [Changelog](https://github.com/vercel/next.js/blob/canary/release.js) - [Commits](https://github.com/vercel/next.js/compare/v14.1.1...v14.2.10) --- updated-dependencies: - dependency-name: next dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [Fix] o1-mini causes pydantic warnings on `reasoning_tokens` (#5754) * add requester_metadata in standard logging payload * log requester_metadata in metadata * use StandardLoggingPayload for logging * docs StandardLoggingPayload * fix import * include standard logging object in failure * add test for requester metadata * handle completion_tokens_details * add test for completion_tokens_details * [Feat-Proxy-DataDog] Log Redis, Postgres Failure events on DataDog (#5750) * dd - start tracking redis status on dd * add async_service_succes_hook / failure hook in custom logger * add async_service_failure_hook * log service failures on dd * fix import error * add test for redis errors / warning * [Fix] Router/ Proxy - Tag Based routing, raise correct error when no deployments found and tag filtering is on (#5745) * fix tag routing - raise correct error when no model with tag based routing * fix error string from tag based routing * test router tag based routing * raise 401 error when no tags avialable for deploymen * linting fix * [Feat] Log Request metadata on gcs bucket logging (#5743) * add requester_metadata in standard logging payload * log requester_metadata in metadata * use StandardLoggingPayload for logging * docs StandardLoggingPayload * fix import * include standard logging object in failure * add test for requester metadata * fix(litellm_logging.py): fix logging message * fix(rerank_api/main.py): fix linting errors * fix(custom_guardrails.py): maintain backwards compatibility for older guardrails * fix(rerank_api/main.py): fix cost tracking for rerank endpoints --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: steffen-sbt <148480574+steffen-sbt@users.noreply.github.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-09-17 23:00:04 -07:00
Krish Dholakia	0295a22561	LiteLLM Minor Fixes and Improvements (09/10/2024) (#5618 ) * fix(cost_calculator.py): move to debug for noisy warning message on cost calculation error Fixes https://github.com/BerriAI/litellm/issues/5610 * fix(databricks/cost_calculator.py): Handles model name issues for databricks models * fix(main.py): fix stream chunk builder for multiple tool calls Fixes https://github.com/BerriAI/litellm/issues/5591 * fix: correctly set user_alias when passed in Fixes https://github.com/BerriAI/litellm/issues/5612 * fix(types/utils.py): allow passing role for message object https://github.com/BerriAI/litellm/issues/5621 * fix(litellm_logging.py): Fix langfuse logging across multiple projects Fixes issue where langfuse logger was re-using the old logging object * feat(proxy/_types.py): support adding key-based tags for tag-based routing Enable tag based routing at key-level * fix(proxy/_types.py): fix inheritance * test(test_key_generate_prisma.py): fix test * test: fix test * fix(litellm_logging.py): return used callback object	2024-09-11 11:30:29 -07:00
Krish Dholakia	2d2282101b	LiteLLM Minor Fixes and Improvements (09/09/2024) (#5602 ) * fix(main.py): pass default azure api version as alternative in completion call Fixes api error caused due to api version Closes https://github.com/BerriAI/litellm/issues/5584 * Fixed gemini-1.5-flash pricing (#5590) * add /key/list endpoint * bump: version 1.44.21 → 1.44.22 * docs architecture * Fixed gemini-1.5-flash pricing --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> * fix(bedrock/chat.py): fix converse api stop sequence param mapping Fixes https://github.com/BerriAI/litellm/issues/5592 * fix(databricks/cost_calculator.py): handle databricks model name changes Fixes https://github.com/BerriAI/litellm/issues/5597 * fix(azure.py): support azure api version 2024-08-01-preview Closes https://github.com/BerriAI/litellm/issues/5377 * fix(proxy/_types.py): allow dev keys to call cohere /rerank endpoint Fixes issue where only admin could call rerank endpoint * fix(azure.py): check if model is gpt-4o * fix(proxy/_types.py): support /v1/rerank on non-admin routes as well * fix(cost_calculator.py): fix split on `/` logic in cost calculator --------- Co-authored-by: F1bos <44951186+F1bos@users.noreply.github.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2024-09-09 21:56:12 -07:00
Ishaan Jaff	3c16fcff1b	fix linting errors	2024-09-06 16:41:47 -07:00
Ishaan Jaff	e095daf2e4	add cost tracking for rerank	2024-09-06 16:04:54 -07:00
Ishaan Jaff	4a0fdc40f1	add cost tracking for pass through imagen	2024-09-02 18:10:46 -07:00
Krish Dholakia	9c8f1d7815	anthropic prompt caching cost tracking (#5453 ) * fix(utils.py): support 'drop_params' for embedding requests Fixes https://github.com/BerriAI/litellm/issues/5444 * feat(anthropic/cost_calculation.py): Support calculating cost for prompt caching on anthropic * feat(types/utils.py): allows us to migrate to openai's equivalent, once that comes out * fix: fix linting errors * test: mark flaky test	2024-08-31 14:09:35 -07:00
Krrish Dholakia	55217fa8d7	feat(cost_calculator.py): only override base model if custom pricing is set	2024-08-19 16:05:49 -07:00
Krish Dholakia	1a3b686580	Merge pull request #5219 from dhlidongming/fix-messages-length-check Fix incorrect message length check in cost calculator	2024-08-17 14:01:59 -07:00
Krrish Dholakia	bc0023a409	feat(google_ai_studio_endpoints.py): support pass-through endpoint for all google ai studio requests New Feature	2024-08-17 10:46:59 -07:00
Krrish Dholakia	a92dcdd2d6	fix(litellm_logging.py): fix price information logging to s3	2024-08-16 16:42:38 -07:00
Krrish Dholakia	178139f18d	feat(litellm_logging.py): support logging model price information to s3 logs	2024-08-16 16:21:34 -07:00
lidongming	e1f53fcc80	Fix incorrect message length check in cost calculator	2024-08-15 16:59:38 +08:00
Krrish Dholakia	ef8fb23334	fix(cost_calculator.py): fix cost calc	2024-08-12 16:47:15 -07:00
Krrish Dholakia	22e2840daa	fix(cost_calculator.py): handle openai usage pydantic object Fixes https://github.com/BerriAI/litellm/issues/5165	2024-08-12 15:45:21 -07:00
Krrish Dholakia	aad0bbb08c	fix(cost_calculator.py): respect litellm.suppress_debug_info for cost calc Fixes https://github.com/BerriAI/litellm/issues/4818#issuecomment-2263795765	2024-08-01 12:27:09 -07:00
Krrish Dholakia	46634af06f	fix(utils.py): fix model registeration to model cost map Fixes https://github.com/BerriAI/litellm/issues/4972	2024-07-30 18:15:00 -07:00
Krrish Dholakia	6d5aedc48d	feat(databricks.py): support vertex mistral cost tracking	2024-07-27 20:22:35 -07:00
Krrish Dholakia	959c627dd3	fix(litellm_logging.py): log response_cost=0 for failed calls Fixes https://github.com/BerriAI/litellm/issues/4604	2024-07-15 19:25:56 -07:00
Krrish Dholakia	2163434ff3	fix(llm_cost_calc/google.py): fix google embedding cost calculation Fixes https://github.com/BerriAI/litellm/issues/4630	2024-07-11 11:55:48 -07:00
Krish Dholakia	127f08ee67	Merge branch 'main' into litellm_tts_pricing	2024-07-06 14:57:34 -07:00
Krrish Dholakia	f62884da14	fix(cost_calculator.py): fix completion_response check	2024-07-06 12:28:46 -07:00
Krrish Dholakia	6e43cdcb17	feat(litellm_logging.py): support cost tracking for tts calls	2024-07-05 22:09:08 -07:00
Krrish Dholakia	407639cc7d	fix(cost_calculator.py): support openai+azure tts calls	2024-07-05 20:58:08 -07:00
Krrish Dholakia	0001683036	fix(cost_calculator.py): handle unexpected error in cost_calculator.py	2024-06-28 14:53:00 -07:00
Krish Dholakia	869275585a	Merge branch 'main' into litellm_response_cost_headers	2024-06-27 21:33:09 -07:00
Krrish Dholakia	94c069e869	fix(cost_calculator.py): infer provider name if not given Fixes https://github.com/BerriAI/litellm/issues/4452	2024-06-27 18:41:04 -07:00
Krrish Dholakia	f533e1da09	fix(utils.py): return 'response_cost' in completion call Closes https://github.com/BerriAI/litellm/issues/4335	2024-06-26 17:55:57 -07:00
spdustin@gmail.com	4acc2d50ad	fix: use per-token costs for claude via vertex_ai	2024-06-21 11:21:36 -05:00
Krish Dholakia	71716bec48	Merge pull request #4295 from BerriAI/litellm_gemini_pricing_2 Vertex AI - character based cost calculation	2024-06-19 19:17:09 -07:00
Krrish Dholakia	16da21e839	feat(llm_cost_calc/google.py): do character based cost calculation for vertex ai Calculate cost for vertex ai responses using characters in query/response Closes https://github.com/BerriAI/litellm/issues/4165	2024-06-19 17:18:42 -07:00
Ishaan Jaff	863c53e7e9	fix add cost tracking for ft:gpt-4o-2024-05-1	2024-06-19 16:59:06 -07:00
Krrish Dholakia	df753a8ab2	fix(cost_calculator.py): fix time import	2024-06-17 20:27:18 -07:00
Krrish Dholakia	f597aa432b	feat(cost_calculator.py): add cost calculating for dynamic context window (vertex ai / google ai studio)	2024-06-17 12:38:10 -07:00
Krrish Dholakia	4f91205530	refactor(utils.py): refactor Logging to it's own class. Cut down utils.py to <10k lines. Easier debugging Reference: https://github.com/BerriAI/litellm/issues/4206	2024-06-15 10:57:20 -07:00
Ishaan Jaff	43eef61aa7	fix azure cost tracking	2024-06-10 21:09:55 -07:00
Krrish Dholakia	f3a845eff9	build(model_prices_and_context_window.json): update together ai model pricing - account for new categories	2024-06-08 19:56:35 -07:00
Krrish Dholakia	b26c3c7d22	fix(cost_calculator.py): fixes tgai unmapped model pricing Fixes error where tgai helper function returned None. Enforces stronger type hints, refactors code, adds more unit testing.	2024-06-08 19:43:57 -07:00
Krrish Dholakia	52a2f5150c	fix(utils.py): fix cost calculation for openai-compatible streaming object	2024-06-04 10:36:25 -07:00

41 commits