Ishaan Jaff
c2db33e457
Merge branch 'main' into litellm_docs_msft_entra_id
2025-04-10 20:07:27 -07:00
Ishaan Jaff
6d66e2ebf1
test_default_team_params
2025-04-10 20:04:42 -07:00
Ishaan Jaff
2162e092a5
docs litellm default team settings
2025-04-10 17:56:36 -07:00
Ishaan Jaff
019c179ab8
bug fix default team setting
2025-04-10 17:54:23 -07:00
Ishaan Jaff
e839cb1bdf
docs msft auto assignment
2025-04-10 17:43:55 -07:00
Ishaan Jaff
72a12e91c4
[Bug Fix MSFT SSO] Use correct field for user email when using MSFT SSO ( #9886 )
...
* fix openid_from_response
* test_microsoft_sso_handler_openid_from_response_user_principal_name
* test upsert_sso_user
2025-04-10 17:40:58 -07:00
Ishaan Jaff
eb9735000e
member, team assignment on litellm
2025-04-10 17:35:44 -07:00
Ishaan Jaff
284747c444
litellm MSFT sso
2025-04-10 17:25:26 -07:00
Ishaan Jaff
92205cad45
commit litellm docs msft group assignment
2025-04-10 17:14:14 -07:00
Ishaan Jaff
94a553dbb2
[Feat] Emit Key, Team Budget metrics on a cron job schedule ( #9528 )
...
* _initialize_remaining_budget_metrics
* initialize_budget_metrics_cron_job
* initialize_budget_metrics_cron_job
* initialize_budget_metrics_cron_job
* test_initialize_budget_metrics_cron_job
* LITELLM_PROXY_ADMIN_NAME
* fix code qa checks
* test_initialize_budget_metrics_cron_job
* test_initialize_budget_metrics_cron_job
* pod lock manager allow dynamic cron job ID
* fix pod lock manager
* require cronjobid for PodLockManager
* fix DB_SPEND_UPDATE_JOB_NAME acquire / release lock
* add comment on prometheus logger
* add debug statements for emitting key, team budget metrics
* test_pod_lock_manager.py
* test_initialize_budget_metrics_cron_job
* initialize_budget_metrics_cron_job
* initialize_remaining_budget_metrics
* remove outdated test
2025-04-10 16:59:14 -07:00
Ishaan Jaff
90d862b041
[Feat SSO] - Allow admins to set default_team_params
to have default params for when litellm SSO creates default teams ( #9895 )
...
* add default_team_params as a config.yaml setting
* create_litellm_team_from_sso_group
* test_default_team_params
* test_create_team_without_default_params
* docs default team settings
2025-04-10 16:58:28 -07:00
Ishaan Jaff
e4bf7eff82
docs msft entra id tutorial
2025-04-10 16:50:20 -07:00
Ishaan Jaff
51043bf66b
docs default team settings
2025-04-10 16:32:28 -07:00
Ishaan Jaff
d177943d00
test_create_team_without_default_params
2025-04-10 16:15:26 -07:00
Ishaan Jaff
d4d75397c6
test_default_team_params
2025-04-10 16:08:41 -07:00
Ishaan Jaff
2345a5e392
create_litellm_team_from_sso_group
2025-04-10 15:58:11 -07:00
Ishaan Jaff
02f8690735
add default_team_params as a config.yaml setting
2025-04-10 15:12:17 -07:00
Krrish Dholakia
7d383fc0c1
test: update testing
2025-04-10 14:15:58 -07:00
Krrish Dholakia
b168f8b744
test: update test
2025-04-10 14:04:57 -07:00
Krrish Dholakia
cd878bdd71
bump: version 1.65.5 → 1.65.6
Read Version from pyproject.toml / read-version (push) Successful in 17s
Helm unit test / unit-test (push) Successful in 24s
2025-04-09 22:32:50 -07:00
Krish Dholakia
0dbd663877
fix(cost_calculator.py): handle custom pricing at deployment level fo… ( #9855 )
...
* fix(cost_calculator.py): handle custom pricing at deployment level for router
* test: add unit tests
* fix(router.py): show custom pricing on UI
check correct model str
* fix: fix linting error
* docs(custom_pricing.md): clarify custom pricing for proxy
Fixes https://github.com/BerriAI/litellm/issues/8573#issuecomment-2790420740
* test: update code qa test
* fix: cleanup traceback
* fix: handle litellm param custom pricing
* test: update test
* fix(cost_calculator.py): add router model id to list of potential model names
* fix(cost_calculator.py): fix router model id check
* fix: router.py - maintain older model registry approach
* fix: fix ruff check
* fix(router.py): router get deployment info
add custom values to mapped dict
* test: update test
* fix(utils.py): update only if value is non-null
* test: add unit test
2025-04-09 22:13:10 -07:00
Krish Dholakia
0c5b4aa96d
feat(realtime/): add token tracking + log usage object in spend logs … ( #9843 )
...
* feat(realtime/): add token tracking + log usage object in spend logs metadata
* test: fix test
* test: update tests
* test: update testing
* test: update test
* test: update test
* test: update test
* test: update test
* test: update tesdt
* test: update test
2025-04-09 22:11:00 -07:00
Krish Dholakia
87733c8193
Fix anthropic prompt caching cost calc + trim logged message in db ( #9838 )
...
* fix(spend_tracking_utils.py): prevent logging entire mp4 files to db
Fixes https://github.com/BerriAI/litellm/issues/9732
* fix(anthropic/chat/transformation.py): Fix double counting cache creation input tokens
Fixes https://github.com/BerriAI/litellm/issues/9812
* refactor(anthropic/chat/transformation.py): refactor streaming to use same usage calculation block as non-streaming
reduce errors
* fix(bedrock/chat/converse_transformation.py): don't increment prompt tokens with cache_creation_input_tokens
* build: remove redisvl from requirements.txt (temporary)
* fix(spend_tracking_utils.py): handle circular references
* test: update code cov test
* test: update test
2025-04-09 21:26:43 -07:00
Ishaan Jaff
00c5c23d97
docs Microsoft Entra ID SSO group assignment
2025-04-09 21:07:47 -07:00
Ishaan Jaff
aed8d4ce21
bump: version 1.65.4 → 1.65.5
Read Version from pyproject.toml / read-version (push) Successful in 14s
Helm unit test / unit-test (push) Successful in 23s
2025-04-09 20:34:51 -07:00
Ishaan Jaff
1359e6d7a6
[SSO] Connect LiteLLM to Azure Entra ID Enterprise Application ( #9872 )
...
* refactor SSO handler
* render sso JWT on ui
* docs debug sso
* fix sso login flow use await
* fix ui sso debug JWT
* test ui sso
* remove redis vl
* fix redisvl==0.5.1
* fix ml dtypes
* fix redisvl
* fix redis vl
* fix debug_sso_callback
* fix linting error
* fix redis semantic caching dep
* working graph api assignment
* test msft sso handler openid
* testing for msft group assignment
* fix debug graph api sso flow
* fix linting errors
* add_user_to_teams_from_sso_response
* ui sso fix team assignments
* linting fix _get_group_ids_from_graph_api_response
* add MicrosoftServicePrincipalTeam
* create_litellm_teams_from_service_principal_team_ids
* create_litellm_teams_from_service_principal_team_ids
* docs MICROSOFT_SERVICE_PRINCIPAL_ID
* fix linting errors
2025-04-09 20:26:59 -07:00
Krish Dholakia
ac4f32fb1e
Cost tracking for gemini-2.5-pro
( #9837 )
...
* build(model_prices_and_context_window.json): add google/gemini-2.0-flash-lite-001 versioned pricing
Closes https://github.com/BerriAI/litellm/issues/9829
* build(model_prices_and_context_window.json): add initial support for 'supported_output_modalities' param
* build(model_prices_and_context_window.json): add initial support for 'supported_output_modalities' param
* build(model_prices_and_context_window.json): add supported endpoints to gemini-2.5-pro
* build(model_prices_and_context_window.json): add gemini 200k+ pricing
* feat(utils.py): support cost calculation for gemini-2.5-pro above 200k tokens
Fixes https://github.com/BerriAI/litellm/issues/9807
* build: test dockerfile change
* build: revert apk change
* ci(config.yml): pip install wheel
* ci: test problematic package first
* ci(config.yml): pip install only binary
* ci: try more things
* ci: test different ml_dtypes version
* ci(config.yml): check ml_dtypes==0.4.0
* ci: test
* ci: cleanup config.yml
* ci: specify ml dtypes in requirements.txt
* ci: remove redisvl depedency (temporary)
* fix: fix linting errors
* test: update test
* test: fix test
2025-04-09 18:48:43 -07:00
Ishaan Jaff
4c1bb74c3d
[Feat] - SSO - Use MSFT Graph API to assign users to teams ( #9865 )
...
* refactor SSO handler
* render sso JWT on ui
* docs debug sso
* fix sso login flow use await
* fix ui sso debug JWT
* test ui sso
* remove redis vl
* fix redisvl==0.5.1
* fix ml dtypes
* fix redisvl
* fix redis vl
* fix debug_sso_callback
* fix linting error
* fix redis semantic caching dep
* working graph api assignment
* test msft sso handler openid
* testing for msft group assignment
* fix debug graph api sso flow
* fix linting errors
* add_user_to_teams_from_sso_response
* fix linting error
2025-04-09 18:26:43 -07:00
Krrish Dholakia
a1433da4a7
fix: transform_request.tsx
...
don't hardcode to localhost
2025-04-09 17:50:13 -07:00
Krrish Dholakia
86bfb8cd66
Revert "docs: initial commit adding api playground to docs"
...
This reverts commit 9d68008152
.
2025-04-09 17:50:13 -07:00
Krrish Dholakia
5ca93a1950
docs: initial commit adding api playground to docs
...
makes it easy to see how litellm transforms your request
2025-04-09 17:50:13 -07:00
Krrish Dholakia
3f3afabda9
feat(leftnav.tsx): show api playground on UI
...
allows easy testing on UI
2025-04-09 17:50:13 -07:00
Krrish Dholakia
b11c08bde3
fix(new_usage.tsx): increase page size + iterate through all pages if multiple pages
2025-04-09 17:50:13 -07:00
Krrish Dholakia
9ec1972926
fix(internal_user_endpoints.py): increase default page size for /user/daily/activity
2025-04-09 17:50:13 -07:00
Ishaan Jaff
6f7e9b9728
[Feat SSO] Debug route - allow admins to debug SSO JWT fields ( #9835 )
...
* refactor SSO handler
* render sso JWT on ui
* docs debug sso
* fix sso login flow use await
* fix ui sso debug JWT
* test ui sso
* remove redis vl
* fix redisvl==0.5.1
* fix ml dtypes
* fix redisvl
* fix redis vl
* fix debug_sso_callback
* fix linting error
* fix redis semantic caching dep
2025-04-09 15:29:35 -07:00
Ishaan Jaff
08a3620414
[Bug Fix] Add support for UploadFile on LLM Pass through endpoints (OpenAI, Azure etc) ( #9853 )
...
* http passthrough file handling
* fix make_multipart_http_request
* test_pass_through_file_operations
* unit tests for file handling
2025-04-09 15:29:20 -07:00
Krish Dholakia
6ba3c4a4f8
VertexAI non-jsonl file storage support ( #9781 )
...
* test: add initial e2e test
* fix(vertex_ai/files): initial commit adding sync file create support
* refactor: initial commit of vertex ai non-jsonl files reaching gcp endpoint
* fix(vertex_ai/files/transformation.py): initial working commit of non-jsonl file call reaching backend endpoint
* fix(vertex_ai/files/transformation.py): working e2e non-jsonl file upload
* test: working e2e jsonl call
* test: unit testing for jsonl file creation
* fix(vertex_ai/transformation.py): reset file pointer after read
allow multiple reads on same file object
* fix: fix linting errors
* fix: fix ruff linting errors
* fix: fix import
* fix: fix linting error
* fix: fix linting error
* fix(vertex_ai/files/transformation.py): fix linting error
* test: update test
* test: update tests
* fix: fix linting errors
* fix: fix test
* fix: fix linting error
2025-04-09 14:01:48 -07:00
qvalentin
93532e00db
feat: add enterpriseWebSearch tool for vertex-ai ( #9856 )
2025-04-09 13:17:48 -07:00
Emerson Gomes
d5e362459c
Update Azure Phi-4 pricing ( #9862 )
...
Updates Phi-4 family model prices with recently published info
2025-04-09 13:17:00 -07:00
Jacob Hagstedt P Suorra
dc9bfae053
Add user alias to API endpoint ( #9859 )
...
Co-authored-by: Jacob Hagstedt <wcgs@novonordisk.com>
2025-04-09 13:16:35 -07:00
Christian Owusu
d4e5da87be
Reflect key and team update in UI ( #9825 )
...
Read Version from pyproject.toml / read-version (push) Successful in 16s
Helm unit test / unit-test (push) Successful in 23s
* Reflect updates to keys in UI instantly
* Reflect updates to teams in UI instantly
2025-04-09 07:47:16 -07:00
Marcus Hynfield
cc7d59a11e
Add service annotations to litellm-helm chart ( #9840 )
Read Version from pyproject.toml / read-version (push) Successful in 15s
Helm unit test / unit-test (push) Successful in 22s
2025-04-08 21:42:09 -07:00
Ishaan Jaff
357f081d1c
fix mldtypes dep
Read Version from pyproject.toml / read-version (push) Successful in 14s
Helm unit test / unit-test (push) Successful in 23s
2025-04-08 17:25:28 -07:00
Ishaan Jaff
9f33e9b3e8
pin ml-dtypes==0.4.0
2025-04-08 17:24:08 -07:00
Li Yang
11389535d5
chore: fix haiku cache read pricing per token ( #9834 )
2025-04-08 16:43:09 -07:00
Krrish Dholakia
a3ea079583
docs(gemini.md): show how to call google search via litellm
...
Addresses https://github.com/BerriAI/litellm/issues/361#issuecomment-2787497217
2025-04-08 16:41:24 -07:00
Ishaan Jaff
c403dfb615
pip install --upgrade pip wheel setuptools
2025-04-08 16:38:44 -07:00
Ishaan Jaff
8a596dbe8c
pip install wheel
2025-04-08 16:27:09 -07:00
dependabot[bot]
73356b3a9f
Bump next from 14.2.25 to 14.2.26 in /ui/litellm-dashboard ( #9716 )
...
Bumps [next](https://github.com/vercel/next.js ) from 14.2.25 to 14.2.26.
- [Release notes](https://github.com/vercel/next.js/releases )
- [Changelog](https://github.com/vercel/next.js/blob/canary/release.js )
- [Commits](https://github.com/vercel/next.js/compare/v14.2.25...v14.2.26 )
---
updated-dependencies:
- dependency-name: next
dependency-version: 14.2.26
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-08 14:15:19 -07:00
Ishaan Jaff
441c7275ed
test fix post call rules ( #9826 )
2025-04-08 13:55:37 -07:00