litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 02:34:29 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	f0f2f819bd	Merge pull request #9760 from BerriAI/litellm_prometheus_error_monitoring [Reliability] Prometheus emit llm provider on failure metric - make it easy to differentiate litellm error vs llm api error	2025-04-04 21:37:28 -07:00
Ishaan Jaff	b89ed69257	Merge branch 'main' into litellm_add_auth_metrics_endpoint	2025-04-04 21:28:06 -07:00
Ishaan Jaff	f402e9bbd1	_get_exception_class_name	2025-04-04 21:23:21 -07:00
Ishaan Jaff	f16c531002	_mount_metrics_endpoint	2025-04-04 19:54:20 -07:00
Ishaan Jaff	253060cb09	allow requiring auth for /metrics endpoint	2025-04-04 17:35:02 -07:00
Ishaan Jaff	c402db9057	prometheus emit llm provider on failure metric	2025-04-04 17:07:43 -07:00
Ishaan Jaff	d3018a4c28	Merge branch 'main' into litellm_metrics_pod_lock_manager	2025-04-04 16:46:32 -07:00
Ishaan Jaff	901d6fe7b7	add operational metrics for pod lock manager v2 arch	2025-04-04 16:41:07 -07:00
Krish Dholakia	e1f7bcb47d	Fix VertexAI Credential Caching issue (#9756 ) * refactor(vertex_llm_base.py): Prevent credential misrouting for projects Fixes https://github.com/BerriAI/litellm/issues/7904 * fix: passing unit tests * fix(vertex_llm_base.py): common auth logic across sync + async vertex ai calls prevents credential caching issue across both flows * test: fix test * fix(vertex_llm_base.py): handle project id in default cause * fix(factory.py): don't pass cache control if not set bedrock invoke does not support this * test: fix test * fix(vertex_llm_base.py): add .exception message in load_auth * fix: fix ruff error	2025-04-04 16:38:08 -07:00
Ishaan Jaff	bde88b3ba6	fix type error	2025-04-04 16:34:43 -07:00
Ishaan Jaff	e3b788ea29	fix test	2025-04-02 21:58:35 -07:00
Ishaan Jaff	dd2d1dc2f4	Merge branch 'main' into litellm_metrics_pod_lock_manager	2025-04-02 21:35:55 -07:00
Krish Dholakia	8ee32291e0	Squashed commit of the following: (#9709 ) commit `b12a9892b7` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Apr 2 08:09:56 2025 -0700 fix(utils.py): don't modify openai_token_counter commit `294de31803` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 21:22:40 2025 -0700 fix: fix linting error commit `cb6e9fbe40` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 19:52:45 2025 -0700 refactor: complete migration commit `bfc159172d` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 19:09:59 2025 -0700 refactor: refactor more constants commit `43ffb6a558` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:45:24 2025 -0700 fix: test commit `04dbe4310c` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:28:58 2025 -0700 refactor: refactor: move more constants into constants.py commit `3c26284aff` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:14:46 2025 -0700 refactor: migrate hardcoded constants out of __init__.py commit `c11e0de69d` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:11:21 2025 -0700 build: migrate all constants into constants.py commit `7882bdc787` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:07:37 2025 -0700 build: initial test banning hardcoded numbers in repo	2025-04-02 21:24:54 -07:00
Ishaan Jaff	bcf42fd82d	linting fix prometheus services	2025-04-02 21:19:05 -07:00
Ishaan Jaff	80fb4ece97	prom emit size of DB TX queues for observability	2025-04-02 18:39:29 -07:00
Ishaan Jaff	05b30e28db	clean up service metrics	2025-04-02 17:50:41 -07:00
Krish Dholakia	9b7ebb6a7d	build(pyproject.toml): add new dev dependencies - for type checking (#9631 ) * build(pyproject.toml): add new dev dependencies - for type checking * build: reformat files to fit black * ci: reformat to fit black * ci(test-litellm.yml): make tests run clear * build(pyproject.toml): add ruff * fix: fix ruff checks * build(mypy/): fix mypy linting errors * fix(hashicorp_secret_manager.py): fix passing cert for tls auth * build(mypy/): resolve all mypy errors * test: update test * fix: fix black formatting * build(pre-commit-config.yaml): use poetry run black * fix(proxy_server.py): fix linting error * fix: fix ruff safe representation error	2025-03-29 11:02:13 -07:00
Ishaan Jaff	fca5926600	default to use SLP for GCS PubSub	2025-03-24 15:21:59 -07:00
Ishaan Jaff	5d3bb86f07	define CustomPromptManagement	2025-03-19 16:22:23 -07:00
Ishaan Jaff	f5ef0c3cb7	fix code quality checks	2025-03-18 22:34:43 -07:00
Ishaan Jaff	0f2e095b6b	_arize_otel_logger	2025-03-18 22:19:51 -07:00
Ishaan Jaff	57e5c94360	Merge branch 'main' into litellm_arize_dynamic_logging	2025-03-18 22:13:35 -07:00
Ishaan Jaff	78a5dde31f	fix code qa	2025-03-18 17:07:44 -07:00
Ishaan Jaff	bd122f631e	fix arize config	2025-03-18 16:54:31 -07:00
Ishaan Jaff	de97cda445	refactor create_litellm_proxy_request_started_spen	2025-03-18 16:12:16 -07:00
Ishaan Jaff	7a5726fc88	fix - Arize - only log LLM I/O	2025-03-18 15:50:38 -07:00
Ishaan Jaff	f8c49175ec	fix _get_span_processor	2025-03-18 14:59:13 -07:00
Ishaan Jaff	b940c969fd	use _get_headers_dictionary	2025-03-18 14:55:39 -07:00
Ishaan Jaff	48663a0920	use safe dumps for arize ai	2025-03-18 14:30:00 -07:00
Nate Mar	a1d188ba5e	Fix test and add comments	2025-03-18 03:46:53 -07:00
Nate Mar	434e262b8c	revert space_key change and add tests for arize integration	2025-03-18 01:40:10 -07:00
Nate Mar	35e0856f11	Fix wrong import and use space_id instead of space_key for Arize integration	2025-03-17 20:37:28 -07:00
Krrish Dholakia	997f2f0b3e	fix(aim.py): fix linting error	2025-03-13 15:32:42 -07:00
Tomer Bin	4a31b32a88	Support post-call guards for stream and non-stream responses	2025-03-13 08:53:54 +02:00
Ishaan Jaff	b2d9935567	use ProxyBaseLLMRequestProcessing	2025-03-12 16:54:33 -07:00
vivek-athina	cd4a53d6f2	Merge pull request #4 from BerriAI/main Update main	2025-03-10 11:13:21 +05:30
Krrish Dholakia	8ea3d4c046	build: merge litellm_dev_03_01_2025_p2	2025-03-03 23:05:41 -08:00
Krrish Dholakia	4418e6dd14	build: merge branch	2025-03-02 08:31:57 -08:00
Ishaan Jaff	428ed1360c	fix overly verbose non blocking error on dd get_request_response_payload	2025-03-01 10:09:18 -08:00
Vivek Aditya	ed75dd61c2	Removed prints and added unit tests	2025-02-28 21:48:13 +05:30
Vivek Aditya	c40d45ae09	Added tags to additional keys that can be sent to athina	2025-02-26 21:00:56 +05:30
Krish Dholakia	9914c166b7	Litellm contributor prs 02 24 2025 (#8775 ) * Adding VertexAI Claude 3.7 Sonnet (#8774) Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com> * build(model_prices_and_context_window.json): add anthropic 3-7 models on vertex ai and bedrock * Support video_url (#8743) * Support video_url Support VLMs that works with video. Example implemenation in vllm: https://github.com/vllm-project/vllm/pull/10020 * llms openai.py: Add ChatCompletionVideoObject Add data structures to support `video_url` in chat completion * test test_completion.py: add test for video_url * Arize Phoenix - ensure correct endpoint/protocol are used; and default to phoenix cloud (#8750) * minor fixes to default to http and to ensure that the correct endpoint is used * Update test_arize_phoenix.py * prioritize http over grpc --------- Co-authored-by: Emerson Gomes <emerson.gomes@gmail.com> Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com> Co-authored-by: Pang Wu <104795337+pang-wu@users.noreply.github.com> Co-authored-by: Nate Mar <67926244+nate-mar@users.noreply.github.com>	2025-02-24 18:55:48 -08:00
Krish Dholakia	21ea52105a	Support arize phoenix on litellm proxy (#7756 ) (#8715 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details * Update opentelemetry.py wip * Update test_opentelemetry_unit_tests.py * fix a few paths and tests * fix path * Update litellm_logging.py * accidentally removed code * Add type for protocol * Add and update tests * minor changes * update and add additional arize phoenix test * update existing test * address feedback * use standard_logging_object * address feedback Co-authored-by: Nate Mar <67926244+nate-mar@users.noreply.github.com>	2025-02-22 20:55:11 -08:00
Ishaan Jaff	0d2b0ee1b7	(Bug fix) prometheus - safely set latency metrics (#8669 ) * use safe_duration_seconds * _safe_duration_seconds * test_set_latency_metrics_missing_timestamps	2025-02-19 20:08:46 -08:00
Krish Dholakia	bdc1a72542	refactor(teams.tsx): refactor to display all teams, across all orgs (#8565 ) * refactor(teams.tsx): refactor to display all teams, across all orgs removes org switcher from navbar, simplifies viewing/creating teams on UI * fix(key_list.tsx): show user keys across all orgs make it easy to see flat list of keys across orgs on key table * style(all_keys_table.tsx): cleanup keys table * fix(user_dashboard.tsx): remove overflow-hidden in dashboard component * fix(teams.tsx): move org id placement in create team flow * fix(teams.tsx): support model selection on create team based on selected org * feat(view_key_table.tsx): move to using a filter component on keys page allows filtering keys by org and team * fix(filter.tsx): handle reset filter * fix: fix linting error * (Feat) - return `x-litellm-attempted-fallbacks` in responses from litellm proxy (#8558) * add_fallback_headers_to_response * test x-litellm-attempted-fallbacks * unit test attempted fallbacks * fix add_fallback_headers_to_response * docs document response headers * fix file name * test fix use mock endpoints for e2e files and ft tests * Revert "test fix use mock endpoints for e2e files and ft tests" This reverts commit `c921d8dd81`. * cleanup_azure_files * Add remaining org CRUD endpoints + support deleting orgs on UI (#8561) * feat(organization_endpoints.py): expose new `/organization/delete` endpoint. Cascade org deletion to member, teams and keys Ensures any org deletion is handled correctly * test(test_organizations.py): add simple test to ensure org deletion works * feat(organization_endpoints.py): expose /organization/update endpoint, and define response models for org delete + update * fix(organizations.tsx): support org delete on UI + move org/delete endpoint to use DELETE * feat(organization_endpoints.py): support `/organization/member_update` endpoint Allow admin to update member's role within org * feat(organization_endpoints.py): support deleting member from org * test(test_organizations.py): add e2e test to ensure org member flow works * fix(organization_endpoints.py): fix code qa check * fix(schema.prisma): don't introduce ondelete:cascade - breaking change * docs(organization_endpoints.py): document missing params * refactor(organization_view.tsx): initial commit creating a generic update member component shared between org and team member classes * feat(organization_view.tsx): support updating org member role on UI * feat(organization_view.tsx): allow proxy admin to delete members from org * Enable update/delete org members on UI (#8560) * feat(organization_endpoints.py): expose new `/organization/delete` endpoint. Cascade org deletion to member, teams and keys Ensures any org deletion is handled correctly * test(test_organizations.py): add simple test to ensure org deletion works * feat(organization_endpoints.py): expose /organization/update endpoint, and define response models for org delete + update * fix(organizations.tsx): support org delete on UI + move org/delete endpoint to use DELETE * feat(organization_endpoints.py): support `/organization/member_update` endpoint Allow admin to update member's role within org * feat(organization_endpoints.py): support deleting member from org * test(test_organizations.py): add e2e test to ensure org member flow works * fix(organization_endpoints.py): fix code qa check * fix(schema.prisma): don't introduce ondelete:cascade - breaking change * docs(organization_endpoints.py): document missing params * (Bug Fix) - Add Regenerate Key on Virtual Keys Tab (#8567) * add regenerate key to ui * ui fix key info * (Bug Fix + Better Observability) - BudgetResetJob: (#8562) * use class ResetBudgetJob * refactor reset budget job * update reset_budget job * refactor reset budget job * fix LiteLLM_UserTable * refactor reset budget job * add telemetry for reset budget job * dd - log service success/failure on DD * add detailed reset budget reset info on DD * initialize_scheduled_background_jobs * refactor reset budget job * trigger service failure hook when fails to reset a budget for team, key, user * fix resetBudgetJob * unit testing for ResetBudgetJob * test_duration_in_seconds_basic * testing for triggering service logging * fix logs on test teams fail * remove unused imports * fix import duration in s * duration_in_seconds * (Patch/bug fix) - UI, filter out litellm ui session tokens on Virtual Keys Page (#8568) * fix key list endpoint * _get_condition_to_filter_out_ui_session_tokens * duration_in_seconds * test_list_key_helper_team_filtering * bump: version 1.61.4 → 1.61.5 * ui fix tsx linting * ui new build * test_list_key_helper_team_filtering * ui new build * test_openai_fine_tuning * test_openai_fine_tuning --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-02-15 23:50:11 -08:00
Ishaan Jaff	2753de1458	(Bug Fix + Better Observability) - BudgetResetJob: (#8562 ) * use class ResetBudgetJob * refactor reset budget job * update reset_budget job * refactor reset budget job * fix LiteLLM_UserTable * refactor reset budget job * add telemetry for reset budget job * dd - log service success/failure on DD * add detailed reset budget reset info on DD * initialize_scheduled_background_jobs * refactor reset budget job * trigger service failure hook when fails to reset a budget for team, key, user * fix resetBudgetJob * unit testing for ResetBudgetJob * test_duration_in_seconds_basic * testing for triggering service logging * fix logs on test teams fail * remove unused imports * fix import duration in s * duration_in_seconds	2025-02-15 16:13:08 -08:00
vivek-athina	fd0769f2ed	Added custom_attributes to additional_keys which can be sent to athina (#8518 )	2025-02-13 13:19:24 -08:00
Krish Dholakia	57e5ec07cc	Improved wildcard route handling on `/models` and `/model_group/info` (#8473 ) * fix(model_checks.py): update returning known model from wildcard to filter based on given model prefix ensures wildcard route - `vertex_ai/gemini-` just returns known vertex_ai/gemini- models test(test_proxy_utils.py): add unit testing for new 'get_known_models_from_wildcard' helper * test(test_models.py): add e2e testing for `/model_group/info` endpoint * feat(prometheus.py): support tracking total requests by user_email on prometheus adds initial support for tracking total requests by user_email * test(test_prometheus.py): add testing to ensure user email is always tracked * test: update testing for new prometheus metric * test(test_prometheus_unit_tests.py): add user email to total proxy metric * test: update tests * test: fix spend tests * test: fix test * fix(pagerduty.py): fix linting error	2025-02-11 19:37:43 -08:00
Ishaan Jaff	00c596a852	(Feat) - Allow viewing Request/Response Logs stored in GCS Bucket (#8449 ) * BaseRequestResponseFetchFromCustomLogger * get_active_base_request_response_fetch_from_custom_logger * get_request_response_payload * ui_view_request_response_for_request_id * fix uiSpendLogDetailsCall * fix get_request_response_payload * ui fix RequestViewer * use 1 class AdditionalLoggingUtils * ui_view_request_response_for_request_id * cache the prefetch logs details * refactor prefetch * test view request/resp logs * fix code quality * fix get_request_response_payload * uninstall posthog prevent it from being added in ci/cd * fix posthog * fix traceloop test * fix linting error	2025-02-10 20:38:55 -08:00
Ishaan Jaff	b535c9bdc0	(Bug Fix - Langfuse) - fix for when model response has `choices=[]` (#8339 ) * refactor _get_langfuse_input_output_content * test_langfuse_logging_completion_with_malformed_llm_response * fix _get_langfuse_input_output_content * fixes for langfuse linting * unit testing for get chat/text content for langfuse * fix _should_raise_content_policy_error	2025-02-06 18:02:26 -08:00

1 2 3 4 5 ...

929 commits