litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 02:34:29 +00:00

Author	SHA1	Message	Date
Christian Owusu	47420d8d68	Require auth for all dashboard pages (#10229 ) * Require authentication for all Dashboard pages * Add test * Add test	2025-04-23 07:08:25 -07:00
Krish Dholakia	217681eb5e	Litellm dev 04 22 2025 p1 (#10206 ) * fix(openai.py): initial commit adding generic event type for openai responses api streaming Ensures handling for undocumented event types - e.g. "response.reasoning_summary_part.added" * fix(transformation.py): handle unknown openai response type * fix(datadog_llm_observability.py): handle dict[str, any] -> dict[str, str] conversion Fixes https://github.com/BerriAI/litellm/issues/9494 * test: add more unit testing * test: add unit test * fix(common_utils.py): fix message with content list * test: update testing	2025-04-22 23:58:43 -07:00
Krrish Dholakia	31f704a370	fix(internal_user_endpoints.py): add check on sortby value	2025-04-22 21:41:13 -07:00
Krish Dholakia	5f98d4d7de	UI - Users page - Enable global sorting (allows finding users with highest spend) (#10211 ) * fix(view_users.tsx): add time tracking logic to debounce search - prevent new queries from being overwritten by previous ones * fix(internal_user_endpoints.py): add sort functionality to user list endpoint * feat(internal_user_endpoints.py): support sort by on `/user/list` * fix(view_users.tsx): enable global sorting allows finding user with highest spend * feat(view_users.tsx): support filtering by sso user id * test(search_users.spec.ts): add tests to ensure filtering works * test: add more unit testing	2025-04-22 19:59:53 -07:00
Krish Dholakia	66680c421d	Add global filtering to Users tab (#10195 ) * style(internal_user_endpoints.py): add response model to `/user/list` endpoint make sure we maintain consistent response spec * fix(key_management_endpoints.py): return 'created_at' and 'updated_at' on `/key/generate` Show 'created_at' on UI when key created * test(test_keys.py): add e2e test to ensure created at is always returned * fix(view_users.tsx): support global search by user email allows easier search * test(search_users.spec.ts): add e2e test ensure user search works on admin ui * fix(view_users.tsx): support filtering user by role and user id More powerful filtering on internal users table * fix(view_users.tsx): allow filtering users by team * style(view_users.tsx): cleanup ui to show filters in consistent style * refactor(view_users.tsx): cleanup to just use 1 variable for the data * fix(view_users.tsx): cleanup use effect hooks * fix(internal_user_endpoints.py): fix check to pass testing * test: update tests * test: update tests * Revert "test: update tests" This reverts commit `6553eeb232`. * fix(view_userts.tsx): add back in 'previous' and 'next' tabs for pagination	2025-04-22 13:59:43 -07:00
Ishaan Jaff	0c2f705417	[Feat] Add Responses API - Routing Affinity logic for sessions (#10193 ) * test for test_responses_api_routing_with_previous_response_id * test_responses_api_routing_with_previous_response_id * add ResponsesApiDeploymentCheck * ResponsesApiDeploymentCheck * ResponsesApiDeploymentCheck * fix ResponsesApiDeploymentCheck * test_responses_api_routing_with_previous_response_id * ResponsesApiDeploymentCheck * test_responses_api_deployment_check.py * docs routing affinity * simplify ResponsesApiDeploymentCheck * test response id * fix code quality check	2025-04-21 20:00:27 -07:00
Ishaan Jaff	4eac0f64f3	[Feat] Pass through endpoints - ensure `PassthroughStandardLoggingPayload` is logged and contains method, url, request/response body (#10194 ) * ensure passthrough_logging_payload is filled in kwargs * test_assistants_passthrough_logging * test_assistants_passthrough_logging * test_assistants_passthrough_logging * test_threads_passthrough_logging * test _init_kwargs_for_pass_through_endpoint * _init_kwargs_for_pass_through_endpoint	2025-04-21 19:46:22 -07:00
Krrish Dholakia	4a50cf10fb	build: update ui build All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 23s Details Helm unit test / unit-test (push) Successful in 25s Details	2025-04-21 16:26:36 -07:00
Krish Dholakia	89131d8ed3	Remove user_id from url (#10192 ) * fix(user_dashboard.tsx): initial commit using user id from jwt instead of url * fix(proxy_server.py): remove user id from url fixes security issue around sharing url's * fix(user_dashboard.tsx): handle user id being null	2025-04-21 16:22:57 -07:00
Krrish Dholakia	a34778dda6	build(ui/): update ui build supports new non-user id in url flow	2025-04-21 16:22:28 -07:00
Krish Dholakia	0c3b7bb37d	fix(router.py): handle edge case where user sets 'model_group' inside… (#10191 ) * fix(router.py): handle edge case where user sets 'model_group' inside 'model_info' * fix(key_management_endpoints.py): security fix - return hashed token in 'token' field Ensures when creating a key on UI - only hashed token shown * test(test_key_management_endpoints.py): add unit test * test: update test	2025-04-21 16:17:45 -07:00
Nilanjan De	03245c732a	Fix: Potential SQLi in spend_management_endpoints.py (#9878 ) * fix: Potential SQLi in spend_management_endpoints.py * fix tests * test: add tests for global spend keys endpoint * chore: update error message * chore: lint * chore: rename test	2025-04-21 14:29:38 -07:00
Krish Dholakia	ce828408da	fix(proxy_server.py): pass llm router to get complete model list (#10176 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 21s Details Helm unit test / unit-test (push) Successful in 27s Details allows model auth to work	2025-04-19 22:27:49 -07:00
Krish Dholakia	e0a613f88a	fix(common_daily_activity.py): support empty entity id field (#10175 ) * fix(common_daily_activity.py): support empty entity id field allows returning empty response when user is not admin and does not belong to any team * test(test_common_daily_activity.py): add unit testing	2025-04-19 22:20:28 -07:00
Ishaan Jaff	36bcb3de4e	fix models appearing under test key page	2025-04-19 21:37:08 -07:00
Krish Dholakia	bbfcb1ac7e	Litellm release notes 04 19 2025 (#10169 ) * docs(index.md): initial draft release notes * docs: note all pending docs * build(model_prices_and_context_window.json): add o3, gpt-4.1, o4-mini pricing * docs(vllm.md): update vllm doc to show file message type support * docs(mistral.md): add mistral passthrough route doc * docs(gemini.md): add gemini thinking to docs * docs(vertex.md): add thinking/reasoning content for gemini models to docs * docs(index.md): more links * docs(index.md): add more links, images * docs(index.md): cleanup highlights	2025-04-19 17:26:30 -07:00
Ishaan Jaff	7c3df984da	can_user_call_model (#10170 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 51s Details Helm unit test / unit-test (push) Successful in 51s Details	2025-04-19 16:46:00 -07:00
Ishaan Jaff	c80e984d7e	ui new build	2025-04-19 14:19:33 -07:00
Krish Dholakia	5c929317cd	fix(triton/completion/transformation.py): remove bad_words / stop wor… (#10163 ) * fix(triton/completion/transformation.py): remove bad_words / stop words from triton call parameter 'bad_words' has invalid type. It should be either 'int', 'bool', or 'string'. * fix(proxy_track_cost_callback.py): add debug logging for track cost callback error	2025-04-19 11:23:37 -07:00
Krrish Dholakia	ba1b552e8b	fix(common_daily_activity.py): fix python 3_8 error	2025-04-19 08:39:19 -07:00
Krrish Dholakia	dee5182fc8	fix: fix linting error	2025-04-19 08:04:56 -07:00
Krish Dholakia	ef6ac42658	Litellm dev 04 18 2025 p2 (#10157 ) * fix(proxy/_types.py): allow internal user to call api playground * fix(new_usage.tsx): cleanup tag based usage - only show for proxy admin not clear what tags internal user should be allowed to see * fix(team_endpoints.py): allow internal user view spend for teams they belong to * fix(team_endpoints.py): return team alias on `/team/daily/activity` API allows displaying team alias on ui * fix: fix linting error * fix(entity_usage.tsx): allow viewing top keys by team * fix(entity_usage.tsx): show alias, if available in breakdown allows entity alias to be easily displayed * Show usage by key (on all up, team, and tag usage dashboards) (#10152) * fix(entity_usage.tsx): allow user to select team in team usage tab * fix(new_usage.tsx): load all tags for filtering * fix(tag_management_endpoints.py): return dynamic tags from db on `/tag/list` * fix(litellm_pre_call_utils.py): support x-litellm-tags even if tag based routing not enabled * fix(new_usage.tsx): show breakdown of usage by api key on dashboard helpful when looking at spend by team * fix(networking.tsx): exclude litellm-dashboard team id's from calls adds noisy ui tokens to key activity * fix(new_usage.tsx): allow user to see activity by key on main tab * feat(internal_user_endpoints.py): refactor to use common_daily_activity function reuses same logic across teams/keys/tags Allows returning team_alias in api_keys consistently * fix(leftnav.tsx): swap old usage with new usage tab * fix(entity_usage.tsx): show breakdown of teams in daily spend chart * style(new_usage.tsx): show global usage tab if user is admin / has admin view * fix(new_usage.tsx): add disclaimer for new usage dashboard * fix(new_usage.tsx): fix linting error * Allow filtering usage dashboard by team + tag (#10150) * fix(entity_usage.tsx): allow user to select team in team usage tab * fix(new_usage.tsx): load all tags for filtering * fix(tag_management_endpoints.py): return dynamic tags from db on `/tag/list` * fix(litellm_pre_call_utils.py): support x-litellm-tags even if tag based routing not enabled * fix: fix linting error	2025-04-19 07:32:23 -07:00
Ishaan Jaff	3d5022bd79	[Feat] Support for all litellm providers on Responses API (works with Codex) - Anthropic, Bedrock API, VertexAI, Ollama (#10132 ) * transform request * basic handler for LiteLLMCompletionTransformationHandler * complete transform litellm to responses api * fixes to test * fix stream=True * fix streaming iterator * fixes for transformation * fixes for anthropic codex support * fix pass response_api_optional_params * test anthropic responses api tools * update responses types * working codex with litellm * add session handler * fixes streaming iterator * fix handler * add litellm codex example * fix code quality * test fix * docs litellm codex * litellm codexdoc * docs openai codex with litellm * docs litellm openai codex * litellm codex * linting fixes for transforming responses API * fix import error * fix responses api test * add sync iterator support for responses api	2025-04-18 19:53:59 -07:00
Krrish Dholakia	614d80cb1b	build(model_prices_and_context_window.json): add azure gpt-4.1 pricing ensures cost tracking for gpt-4.1 works	2025-04-17 20:09:17 -07:00
Krrish Dholakia	ff81f48af3	bump: version 1.66.2 → 1.66.3	2025-04-16 22:20:10 -07:00
Krrish Dholakia	78c6d73dea	build: new ui build	2025-04-16 22:11:53 -07:00
Krish Dholakia	c73a6a8d1e	Add new `/vertex_ai/discovery` route - enables calling AgentBuilder API routes (#10084 ) * feat(llm_passthrough_endpoints.py): expose new `/vertex_ai/discovery/` endpoint Allows calling vertex ai discovery endpoints via passthrough For agentbuilder api calls * refactor(llm_passthrough_endpoints.py): use common _base_vertex_proxy_route Prevents duplicate code * feat(llm_passthrough_endpoints.py): add vertex endpoint specific passthrough handlers	2025-04-16 21:45:51 -07:00
Ishaan Jaff	12ccb954a6	ui new build	2025-04-16 19:23:04 -07:00
Ishaan Jaff	6220f3e7b8	[Feat SSO] Add LiteLLM SCIM Integration for Team and User management (#10072 ) * fix NewUser response type * add scim router * add v0 scim v2 endpoints * working scim transformation * use 1 file for types * fix scim firstname and givenName storage * working SCIMErrorResponse * working team / group provisioning on SCIM * add SCIMPatchOp * move scim folder * fix import scim_router * fix dont auto create scim keys * add auth on all scim endpoints * add is_virtual_key_allowed_to_call_route * fix allowed routes * fix for key management * fix allowed routes check * clean up error message * fix code check * fix for route checks * ui SCIM support * add UI tab for SCIM * fixes SCIM * fixes for SCIM settings on ui * scim settings * clean up scim view * add migration for allowed_routes in keys table * refactor scim transform * fix SCIM linting error * fix code quality check * fix ui linting * test_scim_transformations.py	2025-04-16 19:21:47 -07:00
Krish Dholakia	7ca553b235	Add team based usage dashboard at 1m+ spend logs (+ new `/team/daily/activity` API) (#10081 ) * feat(ui/): add team based usage to dashboard allows admin to see spend across teams + within teams at 1m+ spend logs * fix(entity_usage.tsx): add activity page to entity usage * style(entity_usage.tsx): place filter above tab switcher	2025-04-16 18:10:14 -07:00
Krish Dholakia	c0d7e9f16d	Add new `/tag/daily/activity` endpoint + Add tag dashboard to UI (#10073 ) Some checks failed Read Version from pyproject.toml / read-version (push) Successful in 15s Details Helm unit test / unit-test (push) Successful in 24s Details Publish Prisma Migrations / publish-migrations (push) Failing after 1m47s Details * feat: initial commit adding daily tag spend table to db * feat(db_spend_update_writer.py): correctly log tag spend transactions * build(schema.prisma): add new tag table to root * build: add new migration file * feat(common_daily_activity.py): add `/tag/daily/activity` API endpoint allows viewing daily spend by tag * feat(tag_management_endpoints.py): support comma separated list of tags + tag breakdown metric allows querying multiple tags + knowing what tags are driving spend * feat(entity_usage.tsx): initial commit adding tag based usage to litellm dashboard brings back tag based usage tracking to UI at 1m+ spend logs * feat(entity_usage.tsx): add top api key view to ui * feat(entity_usage.tsx): add tag table to ui * feat(entity_usage.tsx): allow filtering by tag * refactor(entity_usage.tsx): reorder components * build(ui/): fix linting error * fix: fix ruff checks * fix(schema.prisma): drop uniqueness requirement on tag allows dailytagspend to have multiple rows with the same tag * build(schema.prisma): drop uniqueness requirement on tag in dailytagspend allows tag agg. view to work on multiple rows with same tag * build(schema.prisma): drop tag uniqueness requirement	2025-04-16 15:24:44 -07:00
Krish Dholakia	d8a1071bc4	Add aggregate spend by tag (#10071 ) * feat: initial commit adding daily tag spend table to db * feat(db_spend_update_writer.py): correctly log tag spend transactions * build(schema.prisma): add new tag table to root * build: add new migration file	2025-04-16 12:26:21 -07:00
ChaoFu Yang	c07eea864e	/utils/token_counter: get model_info from deployment directly (#10047 )	2025-04-16 07:53:18 -07:00
Michael Leshchinsky	e19d05980c	Add litellm call id passing to Aim guardrails on pre and post-hooks calls (#10021 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details Helm unit test / unit-test (push) Successful in 19s Details * Add litellm_call_id passing to aim guardrails on pre and post-hooks * Add test that ensures that pre_call_hook receives litellm call id when common_request_processing called	2025-04-16 07:41:28 -07:00
Ishaan Jaff	1d4fea509d	ui new build	2025-04-15 22:36:44 -07:00
Ishaan Jaff	dcc43e797a	[Docs] Auto prompt caching (#10044 ) * docs prompt cache controls * doc fix auto prompt caching	2025-04-15 22:29:47 -07:00
Ishaan Jaff	bd88263b29	[Feat - Cost Tracking improvement] Track prompt caching metrics in DailyUserSpendTransactions (#10029 ) * stash changes * emit cache read/write tokens to daily spend update * emit cache read/write tokens on daily activity * update types.ts * docs prompt caching * undo ui change * fix activity metrics * fix prompt caching metrics * fix typed dict fields * fix get_aggregated_daily_spend_update_transactions * fix aggregating cache tokens * test_cache_token_fields_aggregation * daily_transaction * add cache_creation_input_tokens and cache_read_input_tokens to LiteLLM_DailyUserSpend * test_daily_spend_update_queue.py	2025-04-15 21:40:57 -07:00
Ishaan Jaff	d32d6fe03e	[UI] Bug Fix - Show created_at and updated_at for Users Page (#10033 ) * add created_at and updated_at as fields for internal user table * test_get_users_includes_timestamps	2025-04-15 21:15:44 -07:00
Krish Dholakia	9b77559ccf	Add aggregate team based usage logging (#10039 ) * feat(schema.prisma): initial commit adding aggregate table for team spend allows team spend to be visible at 1m+ logs * feat(db_spend_update_writer.py): support logging aggregate team spend allows usage dashboard to work at 1m+ logs * feat(litellm-proxy-extras/): add new migration file * fix(db_spend_update_writer.py): fix return type * build: bump requirements * fix: fix ruff error	2025-04-15 20:58:48 -07:00
Krish Dholakia	33ead69c0a	Support checking provider `/models` endpoints on proxy `/v1/models` endpoint (#9958 ) * feat(utils.py): support global flag for 'check_provider_endpoints' enables setting this for `/models` on proxy * feat(utils.py): add caching to 'get_valid_models' Prevents checking endpoint repeatedly * fix(utils.py): ensure mutations don't impact cached results * test(test_utils.py): add unit test to confirm cache invalidation logic * feat(utils.py): get_valid_models - support passing litellm params dynamically Allows for checking endpoints based on received credentials * test: update test * feat(model_checks.py): pass router credentials to get_valid_models - ensures it checks correct credentials * refactor(utils.py): refactor for simpler functions * fix: fix linting errors * fix(utils.py): fix test * fix(utils.py): set valid providers to custom_llm_provider, if given * test: update test * fix: fix ruff check error	2025-04-14 23:23:20 -07:00
Krish Dholakia	9b0f871129	Add `/vllm/` and `/mistral/` passthrough endpoints (adds support for Mistral OCR via passthrough) * feat(llm_passthrough_endpoints.py): support mistral passthrough Closes https://github.com/BerriAI/litellm/issues/9051 * feat(llm_passthrough_endpoints.py): initial commit for adding vllm passthrough route * feat(vllm/common_utils.py): add new vllm model info route make it possible to use vllm passthrough route via factory function * fix(llm_passthrough_endpoints.py): add all methods to vllm passthrough route * fix: fix linting error * fix: fix linting error * fix: fix ruff check * fix(proxy/_types.py): add new passthrough routes * docs(config_settings.md): add mistral env vars to docs	2025-04-14 22:06:33 -07:00
Ishaan Jaff	ce2595f56a	bump: version 1.66.0 → 1.66.1	2025-04-14 21:30:07 -07:00
Ishaan Jaff	b210639dce	ui new build	2025-04-14 21:19:21 -07:00
Ishaan Jaff	c1a642ce20	[UI] Allow setting prompt `cache_control_injection_points` (#10000 ) * test_anthropic_cache_control_hook_system_message * test_anthropic_cache_control_hook.py * should_run_prompt_management_hooks * fix should_run_prompt_management_hooks * test_anthropic_cache_control_hook_specific_index * fix test * fix linting errors * ChatCompletionCachedContent * initial commit for cache control * fixes ui design * fix inserting cache_control_injection_points * fix entering cache control points * fixes for using cache control on ui + backend * update cache control settings on edit model page * fix init custom logger compatible class * fix linting errors * fix linting errors * fix get_chat_completion_prompt	2025-04-14 21:17:42 -07:00
Krish Dholakia	2ed593e052	Updated cohere v2 passthrough (#9997 ) * Add cohere `/v2/chat` pass-through cost tracking support (#8235) * feat(cohere_passthrough_handler.py): initial working commit with cohere passthrough cost tracking * fix(v2_transformation.py): support cohere /v2/chat endpoint * fix: fix linting errors * fix: fix import * fix(v2_transformation.py): fix linting error * test: handle openai exception change	2025-04-14 19:51:01 -07:00
Ishaan Jaff	72c1f7e09a	ui new build	2025-04-12 20:42:43 -07:00
Ishaan Jaff	89dfb42697	[UI QA checklist] (#9957 ) * fix typo on UI * fix for edit user tab * fix for user spend * add /team/permissions_list to management routes * fix auth check for team member permissions * fix team endpoints test	2025-04-12 20:41:50 -07:00
Krish Dholakia	00e49380df	Litellm UI qa 04 12 2025 p1 (#9955 ) * fix(model_info_view.tsx): cleanup text * fix(key_management_endpoints.py): fix filtering litellm-dashboard keys for internal users * fix(proxy_track_cost_callback.py): prevent flooding spend logs with admin endpoint errors * test: add unit testing for logic * test(test_auth_exception_handler.py): add more unit testing * fix(router.py): correctly handle retrieving model info on get_model_group_info fixes issue where model hub was showing None prices * fix: fix linting errors	2025-04-12 19:30:48 -07:00
Ishaan Jaff	c86e678809	[Docs] v1.66.0-stable fixes (#9953 ) * add categories for spend tracking improvements * xai reasoning usage * docs tag management * docs tag based routing * [Beta] Routing based * docs tag based routing * docs tag routing * docs enterprise web search	2025-04-12 16:57:25 -07:00
Ishaan Jaff	4e81b2cab4	[Team Member permissions] - Fixes (#9945 ) * only load member permissions for non-admins * run member permission checks on update + regenerate endpoints * run check for /key/generate * working test_default_member_permissions * passing test with permissions on update delete endpoints * test_create_permissions * _team_key_generation_check * fix TeamBase * fix team endpoints * fix api docs check	2025-04-12 11:17:51 -07:00

1 2 3 4 5 ...

4702 commits