Ishaan Jaff
df51d8bcfa
Merge branch 'main' into litellm_metrics_pod_lock_manager
2025-04-04 21:11:39 -07:00
Ishaan Jaff
fc4c453cb9
test_no_auth_metrics_when_disabled
2025-04-04 21:02:29 -07:00
Krrish Dholakia
7cd7bdbd0f
build: fix model cost map
2025-04-04 20:48:29 -07:00
Krrish Dholakia
5826108c9a
build: bump
2025-04-04 20:45:27 -07:00
caramulrooney
3e9066e91d
Update model_prices_and_context_window.json ( #9620 )
...
Add watsonx/ibm/granite-3-8b-instruct
2025-04-04 20:44:06 -07:00
Hugo Liu
08f9e1447b
fix(asr-groq): add groq whisper models to model cost map ( #9648 )
...
Co-authored-by: liuhu <liuhu@huami.com>
2025-04-04 20:43:46 -07:00
Chaos Yu
001043ba05
make sure metadata available and have a value ( #9764 )
2025-04-04 20:39:12 -07:00
Ishaan Jaff
eaad3b2402
PrometheusAuthMiddleware
2025-04-04 20:37:53 -07:00
Krish Dholakia
af42e5855f
Gemini image generation output support ( #9646 )
...
* fix(gemini/transformation.py): make GET request to get uri details, if cannot be inferred
* fix: fix linting errors
* Revert "fix: fix linting errors"
This reverts commit 926a5a527f
.
* fix(gemini/transformation.py): modalities param support
Partially resolves https://github.com/BerriAI/litellm/issues/9237
* feat(google_ai_studio/): add image generation support
Closes https://github.com/BerriAI/litellm/issues/9237
* fix: fix types
* fix: fix ruff check
2025-04-04 20:37:48 -07:00
Ishaan Jaff
86b473d267
allow adding auth on /metrics endpoint
2025-04-04 20:37:17 -07:00
Krish Dholakia
90a4dfab3c
fix(xai/chat/transformation.py): filter out 'name' param for xai non-… ( #9761 )
...
* fix(xai/chat/transformation.py): filter out 'name' param for xai non-user roles
Fixes https://github.com/BerriAI/litellm/issues/9720
* test fix test_hf_chat_template
---------
Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>
2025-04-04 20:37:08 -07:00
Krish Dholakia
d66db2207b
Allow team members to see team models ( #9742 )
...
* fix(proxy_server.py): allow team member to see team models
* fix(model_dashboard.tsx): show edit + delete icons to be disabled if user is not admin and did not create models
* fix(proxy_server.py): fix ruff function size error
* fix(proxy_server.py): fix user model filter check
2025-04-04 20:36:48 -07:00
Ishaan Jaff
96ce5dbf7d
_should_run_auth_on_metrics_endpoint
2025-04-04 20:32:04 -07:00
Ishaan Jaff
c7523818b4
PrometheusAuthMiddleware
2025-04-04 20:27:17 -07:00
Krrish Dholakia
b5851769fc
fix: fix import
2025-04-04 20:26:11 -07:00
Krrish Dholakia
6395bd8d65
test: mark flaky test
2025-04-04 20:25:05 -07:00
Ishaan Jaff
f16c531002
_mount_metrics_endpoint
2025-04-04 19:54:20 -07:00
Krish Dholakia
c555c15ad7
fix(router.py): support reusable credentials via passthrough router ( #9758 )
...
* fix(router.py): support reusable credentials via passthrough router
enables reusable vertex credentials to be used in passthrough
* test: fix test
* test(test_router_adding_deployments.py): add unit testing
2025-04-04 18:40:14 -07:00
Ishaan Jaff
253060cb09
allow requiring auth for /metrics endpoint
2025-04-04 17:35:02 -07:00
Ishaan Jaff
8d76da03fe
Merge pull request #9759 from BerriAI/litellm_reliability_fix_db_txs
...
Read Version from pyproject.toml / read-version (push) Successful in 20s
Helm unit test / unit-test (push) Successful in 25s
[Reliability] v2 DB Deadlock Reduction Architecture – Add Max Size for In-Memory Queue + Backpressure Mechanism
2025-04-04 17:12:20 -07:00
Ishaan Jaff
c402db9057
prometheus emit llm provider on failure metric
2025-04-04 17:07:43 -07:00
Ishaan Jaff
150e77cd7d
Merge branch 'main' into litellm_reliability_fix_db_txs
2025-04-04 16:46:46 -07:00
Ishaan Jaff
d3018a4c28
Merge branch 'main' into litellm_metrics_pod_lock_manager
2025-04-04 16:46:32 -07:00
Ishaan Jaff
5c2bc796b1
test fix test_hf_chat_template
2025-04-04 16:45:20 -07:00
Ishaan Jaff
901d6fe7b7
add operational metrics for pod lock manager v2 arch
2025-04-04 16:41:07 -07:00
Krish Dholakia
e1f7bcb47d
Fix VertexAI Credential Caching issue ( #9756 )
...
* refactor(vertex_llm_base.py): Prevent credential misrouting for projects
Fixes https://github.com/BerriAI/litellm/issues/7904
* fix: passing unit tests
* fix(vertex_llm_base.py): common auth logic across sync + async vertex ai calls
prevents credential caching issue across both flows
* test: fix test
* fix(vertex_llm_base.py): handle project id in default cause
* fix(factory.py): don't pass cache control if not set
bedrock invoke does not support this
* test: fix test
* fix(vertex_llm_base.py): add .exception message in load_auth
* fix: fix ruff error
2025-04-04 16:38:08 -07:00
Ishaan Jaff
bde88b3ba6
fix type error
2025-04-04 16:34:43 -07:00
Ishaan Jaff
1cdee4b331
Merge branch 'main' into litellm_metrics_pod_lock_manager
2025-04-04 16:33:16 -07:00
Ishaan Jaff
decb6649ec
test_queue_flush_limit
2025-04-04 16:29:06 -07:00
Ishaan Jaff
e77a178a37
test_queue_size_reduction_with_large_volume
2025-04-04 16:21:29 -07:00
Ishaan Jaff
dc063fdfec
test_queue_size_reduction_with_large_volume
2025-04-04 15:59:35 -07:00
Ishaan Jaff
5bed0b7557
aggregated values
2025-04-04 15:55:14 -07:00
Ishaan Jaff
eb48cbdec6
aggregate_queue_updates
2025-04-04 15:54:07 -07:00
Ishaan Jaff
cdd351a03b
Merge pull request #9745 from BerriAI/litellm_sso_fixes_dev
...
[Feat] Allow assigning SSO users to teams on MSFT SSO
2025-04-04 15:40:19 -07:00
Ishaan Jaff
888446256c
fix vertex failing test
2025-04-04 15:37:48 -07:00
Ishaan Jaff
93068cb142
flush_all_updates_from_in_memory_queue
2025-04-04 15:34:56 -07:00
Ishaan Jaff
065477abb4
add _get_aggregated_spend_update_queue_item
2025-04-04 15:32:27 -07:00
Ishaan Jaff
9abaefea62
add logic for max size in memory queue
2025-04-04 15:31:40 -07:00
Ishaan Jaff
363fb0c46f
add MAX_SIZE_IN_MEMORY_QUEUE
2025-04-04 15:31:09 -07:00
Ishaan Jaff
3374c54ba2
add MAX_SIZE_IN_MEMORY_QUEUE constant
2025-04-04 15:30:53 -07:00
Ishaan Jaff
12b3923579
update circle ci requirements
2025-04-04 14:45:14 -07:00
Ishaan Jaff
cba1dacc7d
ui new build
2025-04-04 14:39:55 -07:00
Krrish Dholakia
ad90871ad6
fix(factory.py): don't pass cache control if not set
...
bedrock invoke does not support this
2025-04-04 12:37:34 -07:00
Ishaan Jaff
e67d16d5bd
Merge pull request #9748 from BerriAI/litellm_ui_allow_testing_image_endpoints
...
Read Version from pyproject.toml / read-version (push) Successful in 18s
Helm unit test / unit-test (push) Successful in 22s
[Feat] UI - Test Key v2 page - allow testing image endpoints + polish the page
2025-04-03 22:39:45 -07:00
Ishaan Jaff
c8468b71c8
fix linting ui
2025-04-03 22:32:56 -07:00
Ishaan Jaff
f6c2b86903
fix typo
2025-04-03 22:21:11 -07:00
Adrian Lyjak
d640bc0a00
fix #8425 , passthrough kwargs during acompletion, and unwrap extra_body for openrouter ( #9747 )
2025-04-03 22:19:40 -07:00
Ishaan Jaff
0745f306c7
test_microsoft_sso_handler_with_empty_response
2025-04-03 22:17:06 -07:00
Ishaan Jaff
984114adf0
fix sso callback
2025-04-03 22:13:46 -07:00
Krrish Dholakia
bdad9961e3
docs: cleanup
2025-04-03 22:12:51 -07:00