Ishaan Jaff
0c37051b98
supports_reasoning
2025-04-11 17:41:57 -07:00
Ishaan Jaff
0bc45749c8
fix test
2025-04-11 17:18:13 -07:00
Ishaan Jaff
0718110865
"supports_reasoning": false,
2025-04-11 17:04:47 -07:00
Ishaan Jaff
c574824b82
fix supports_reasoning test
2025-04-11 17:01:41 -07:00
Ishaan Jaff
45a5ee9cb4
docs supports reasoning
2025-04-11 16:55:19 -07:00
Ishaan Jaff
8a40fa0f56
add supports_reasoning
2025-04-11 16:47:38 -07:00
Ishaan Jaff
02c265181c
add supports reasoning to model group info
2025-04-11 16:46:59 -07:00
Ishaan Jaff
05540713fd
test_supports_reasoning
2025-04-11 16:02:35 -07:00
Ishaan Jaff
fd9c4d5e67
add deepseek as supports reasoning
2025-04-11 15:51:02 -07:00
Ishaan Jaff
6e45c55313
add supports reasoning for claude 3-7 models
2025-04-11 15:47:12 -07:00
Ishaan Jaff
b9db110449
add litellm.supports_reasoning
2025-04-11 15:42:12 -07:00
Ishaan Jaff
2dca5781ad
add supports_reasoning util
2025-04-11 15:41:47 -07:00
Ishaan Jaff
4367635fd7
add "supports_reasoning": true for o1 series models
2025-04-11 15:40:13 -07:00
Ishaan Jaff
d2955e4528
add supports_reasoning for xai models
2025-04-11 15:37:15 -07:00
Ishaan Jaff
2d6ad534bc
[Feat - PR1] Add xAI grok-3 models to LiteLLM ( #9920 )
...
* add xai/grok-3-mini-beta, xai/grok-3-beta
* add grok-3-fast-latest models
* supports_response_schema
* fix pricing
* docs xai
2025-04-11 15:12:12 -07:00
Marc Abramowitz
fc14931be9
Fix typo: Entrata -> Entra in docs ( #9921 )
2025-04-11 15:08:57 -07:00
Ishaan Jaff
8b1d2d6956
[Feat - UI] - Allow setting Default Team setting when LiteLLM SSO auto creates teams ( #9918 )
...
* endpoint for updating default team settings on ui
* add GET default team settings endpoint
* ui expose default team settings on UI
* update to use DefaultTeamSSOParams
* DefaultTeamSSOParams
* fix DefaultTeamSSOParams
* docs team management
* test_update_default_team_settings
2025-04-11 14:07:10 -07:00
Manuel Cañete
c4ea1ab61b
feat: add extraEnvVars to the helm deployment ( #9292 )
Read Version from pyproject.toml / read-version (push) Successful in 17s
Helm unit test / unit-test (push) Successful in 22s
2025-04-11 10:32:16 -07:00
Krish Dholakia
0415f1205e
Litellm dev 04 10 2025 p3 ( #9903 )
...
* feat(managed_files.py): encode file type in unified file id
simplify calling gemini models
* fix(common_utils.py): fix extracting file type from unified file id
* fix(litellm_logging.py): create standard logging payload for create file call
* fix: fix linting error
2025-04-11 09:29:42 -07:00
Ishaan Jaff
8ecd9ede81
docs clean up
Read Version from pyproject.toml / read-version (push) Successful in 17s
Helm unit test / unit-test (push) Successful in 22s
2025-04-10 21:35:08 -07:00
Krish Dholakia
9f27e8363f
Realtime API: Support 'base_model' cost tracking + show response in spend logs (if enabled) ( #9897 )
...
* refactor(litellm_logging.py): refactor realtime cost tracking to use common code as rest
Ensures basic features like base model just work
* feat(realtime/): support 'base_model' cost tracking on realtime api
Fixes issue where base model was not working on realtime
* fix: fix ruff linting error
* test: fix test
2025-04-10 21:24:45 -07:00
Krish Dholakia
78879c68a9
Revert avglogprobs change + Add azure/gpt-4o-realtime-audio cost tracking ( #9893 )
...
* test: initial commit fixing gemini logprobs
Fixes https://github.com/BerriAI/litellm/issues/9888
* fix(vertex_and_google_ai_studio.py): Revert avglogprobs change
Fixes https://github.com/BerriAI/litellm/issues/8890
* build(model_prices_and_context_window.json): add gpt-4o-realtime-preview cost to model cost map
Fixes https://github.com/BerriAI/litellm/issues/9814
* test: add cost calculation unit testing
* test: fix test
* test: update test
2025-04-10 21:23:55 -07:00
Ishaan Jaff
892964272f
docs msft SSO
Read Version from pyproject.toml / read-version (push) Successful in 21s
Helm unit test / unit-test (push) Successful in 25s
2025-04-10 20:33:14 -07:00
Ishaan Jaff
c71e98b05a
docs MSFT sso
2025-04-10 20:30:52 -07:00
Ishaan Jaff
9ebbf52249
docs self serve msft SSO
2025-04-10 20:25:43 -07:00
Ishaan Jaff
1197746ab3
bump: version 1.65.6 → 1.65.7
2025-04-10 20:23:08 -07:00
Ishaan Jaff
595c0cbb42
docs msft default team settings
2025-04-10 20:22:46 -07:00
Ishaan Jaff
34b1bf2c75
docs msft_default_settings
2025-04-10 20:21:14 -07:00
Ishaan Jaff
f5c5c79ea4
update docs
2025-04-10 20:18:54 -07:00
Ishaan Jaff
98e34cbf5d
[Docs] Tutorial using MSFT auto team assignment with LiteLLM ( #9898 )
...
* add default_team_params as a config.yaml setting
* create_litellm_team_from_sso_group
* test_default_team_params
* test_create_team_without_default_params
* docs default team settings
* docs msft entra id tutorial
* commit litellm docs msft group assignment
* litellm MSFT sso
* member, team assignment on litellm
* docs msft auto assignment
* bug fix default team setting
* docs litellm default team settings
* test_default_team_params
2025-04-10 20:07:55 -07:00
Ishaan Jaff
72a12e91c4
[Bug Fix MSFT SSO] Use correct field for user email when using MSFT SSO ( #9886 )
...
* fix openid_from_response
* test_microsoft_sso_handler_openid_from_response_user_principal_name
* test upsert_sso_user
2025-04-10 17:40:58 -07:00
Ishaan Jaff
94a553dbb2
[Feat] Emit Key, Team Budget metrics on a cron job schedule ( #9528 )
...
* _initialize_remaining_budget_metrics
* initialize_budget_metrics_cron_job
* initialize_budget_metrics_cron_job
* initialize_budget_metrics_cron_job
* test_initialize_budget_metrics_cron_job
* LITELLM_PROXY_ADMIN_NAME
* fix code qa checks
* test_initialize_budget_metrics_cron_job
* test_initialize_budget_metrics_cron_job
* pod lock manager allow dynamic cron job ID
* fix pod lock manager
* require cronjobid for PodLockManager
* fix DB_SPEND_UPDATE_JOB_NAME acquire / release lock
* add comment on prometheus logger
* add debug statements for emitting key, team budget metrics
* test_pod_lock_manager.py
* test_initialize_budget_metrics_cron_job
* initialize_budget_metrics_cron_job
* initialize_remaining_budget_metrics
* remove outdated test
2025-04-10 16:59:14 -07:00
Ishaan Jaff
90d862b041
[Feat SSO] - Allow admins to set default_team_params
to have default params for when litellm SSO creates default teams ( #9895 )
...
* add default_team_params as a config.yaml setting
* create_litellm_team_from_sso_group
* test_default_team_params
* test_create_team_without_default_params
* docs default team settings
2025-04-10 16:58:28 -07:00
Krrish Dholakia
7d383fc0c1
test: update testing
2025-04-10 14:15:58 -07:00
Krrish Dholakia
b168f8b744
test: update test
2025-04-10 14:04:57 -07:00
Krrish Dholakia
cd878bdd71
bump: version 1.65.5 → 1.65.6
Read Version from pyproject.toml / read-version (push) Successful in 17s
Helm unit test / unit-test (push) Successful in 24s
2025-04-09 22:32:50 -07:00
Krish Dholakia
0dbd663877
fix(cost_calculator.py): handle custom pricing at deployment level fo… ( #9855 )
...
* fix(cost_calculator.py): handle custom pricing at deployment level for router
* test: add unit tests
* fix(router.py): show custom pricing on UI
check correct model str
* fix: fix linting error
* docs(custom_pricing.md): clarify custom pricing for proxy
Fixes https://github.com/BerriAI/litellm/issues/8573#issuecomment-2790420740
* test: update code qa test
* fix: cleanup traceback
* fix: handle litellm param custom pricing
* test: update test
* fix(cost_calculator.py): add router model id to list of potential model names
* fix(cost_calculator.py): fix router model id check
* fix: router.py - maintain older model registry approach
* fix: fix ruff check
* fix(router.py): router get deployment info
add custom values to mapped dict
* test: update test
* fix(utils.py): update only if value is non-null
* test: add unit test
2025-04-09 22:13:10 -07:00
Krish Dholakia
0c5b4aa96d
feat(realtime/): add token tracking + log usage object in spend logs … ( #9843 )
...
* feat(realtime/): add token tracking + log usage object in spend logs metadata
* test: fix test
* test: update tests
* test: update testing
* test: update test
* test: update test
* test: update test
* test: update test
* test: update tesdt
* test: update test
2025-04-09 22:11:00 -07:00
Krish Dholakia
87733c8193
Fix anthropic prompt caching cost calc + trim logged message in db ( #9838 )
...
* fix(spend_tracking_utils.py): prevent logging entire mp4 files to db
Fixes https://github.com/BerriAI/litellm/issues/9732
* fix(anthropic/chat/transformation.py): Fix double counting cache creation input tokens
Fixes https://github.com/BerriAI/litellm/issues/9812
* refactor(anthropic/chat/transformation.py): refactor streaming to use same usage calculation block as non-streaming
reduce errors
* fix(bedrock/chat/converse_transformation.py): don't increment prompt tokens with cache_creation_input_tokens
* build: remove redisvl from requirements.txt (temporary)
* fix(spend_tracking_utils.py): handle circular references
* test: update code cov test
* test: update test
2025-04-09 21:26:43 -07:00
Ishaan Jaff
00c5c23d97
docs Microsoft Entra ID SSO group assignment
2025-04-09 21:07:47 -07:00
Ishaan Jaff
aed8d4ce21
bump: version 1.65.4 → 1.65.5
Read Version from pyproject.toml / read-version (push) Successful in 14s
Helm unit test / unit-test (push) Successful in 23s
2025-04-09 20:34:51 -07:00
Ishaan Jaff
1359e6d7a6
[SSO] Connect LiteLLM to Azure Entra ID Enterprise Application ( #9872 )
...
* refactor SSO handler
* render sso JWT on ui
* docs debug sso
* fix sso login flow use await
* fix ui sso debug JWT
* test ui sso
* remove redis vl
* fix redisvl==0.5.1
* fix ml dtypes
* fix redisvl
* fix redis vl
* fix debug_sso_callback
* fix linting error
* fix redis semantic caching dep
* working graph api assignment
* test msft sso handler openid
* testing for msft group assignment
* fix debug graph api sso flow
* fix linting errors
* add_user_to_teams_from_sso_response
* ui sso fix team assignments
* linting fix _get_group_ids_from_graph_api_response
* add MicrosoftServicePrincipalTeam
* create_litellm_teams_from_service_principal_team_ids
* create_litellm_teams_from_service_principal_team_ids
* docs MICROSOFT_SERVICE_PRINCIPAL_ID
* fix linting errors
2025-04-09 20:26:59 -07:00
Krish Dholakia
ac4f32fb1e
Cost tracking for gemini-2.5-pro
( #9837 )
...
* build(model_prices_and_context_window.json): add google/gemini-2.0-flash-lite-001 versioned pricing
Closes https://github.com/BerriAI/litellm/issues/9829
* build(model_prices_and_context_window.json): add initial support for 'supported_output_modalities' param
* build(model_prices_and_context_window.json): add initial support for 'supported_output_modalities' param
* build(model_prices_and_context_window.json): add supported endpoints to gemini-2.5-pro
* build(model_prices_and_context_window.json): add gemini 200k+ pricing
* feat(utils.py): support cost calculation for gemini-2.5-pro above 200k tokens
Fixes https://github.com/BerriAI/litellm/issues/9807
* build: test dockerfile change
* build: revert apk change
* ci(config.yml): pip install wheel
* ci: test problematic package first
* ci(config.yml): pip install only binary
* ci: try more things
* ci: test different ml_dtypes version
* ci(config.yml): check ml_dtypes==0.4.0
* ci: test
* ci: cleanup config.yml
* ci: specify ml dtypes in requirements.txt
* ci: remove redisvl depedency (temporary)
* fix: fix linting errors
* test: update test
* test: fix test
2025-04-09 18:48:43 -07:00
Ishaan Jaff
4c1bb74c3d
[Feat] - SSO - Use MSFT Graph API to assign users to teams ( #9865 )
...
* refactor SSO handler
* render sso JWT on ui
* docs debug sso
* fix sso login flow use await
* fix ui sso debug JWT
* test ui sso
* remove redis vl
* fix redisvl==0.5.1
* fix ml dtypes
* fix redisvl
* fix redis vl
* fix debug_sso_callback
* fix linting error
* fix redis semantic caching dep
* working graph api assignment
* test msft sso handler openid
* testing for msft group assignment
* fix debug graph api sso flow
* fix linting errors
* add_user_to_teams_from_sso_response
* fix linting error
2025-04-09 18:26:43 -07:00
Krrish Dholakia
a1433da4a7
fix: transform_request.tsx
...
don't hardcode to localhost
2025-04-09 17:50:13 -07:00
Krrish Dholakia
86bfb8cd66
Revert "docs: initial commit adding api playground to docs"
...
This reverts commit 9d68008152
.
2025-04-09 17:50:13 -07:00
Krrish Dholakia
5ca93a1950
docs: initial commit adding api playground to docs
...
makes it easy to see how litellm transforms your request
2025-04-09 17:50:13 -07:00
Krrish Dholakia
3f3afabda9
feat(leftnav.tsx): show api playground on UI
...
allows easy testing on UI
2025-04-09 17:50:13 -07:00
Krrish Dholakia
b11c08bde3
fix(new_usage.tsx): increase page size + iterate through all pages if multiple pages
2025-04-09 17:50:13 -07:00
Krrish Dholakia
9ec1972926
fix(internal_user_endpoints.py): increase default page size for /user/daily/activity
2025-04-09 17:50:13 -07:00