Ishaan Jaff
5a722ef18f
Merge pull request #9715 from BerriAI/litellm_refactor_pod_lock_manager
...
[Reliability Fix] - Use Redis for PodLock Manager instead of PG (ensures no deadlocks occur)
2025-04-02 21:15:02 -07:00
Ishaan Jaff
0155b9f212
Merge branch 'main' into litellm_refactor_pod_lock_manager
2025-04-02 21:05:18 -07:00
Ishaan Jaff
5222cce510
Merge branch 'main' into litellm_metrics_pod_lock_manager
2025-04-02 21:04:44 -07:00
Ishaan Jaff
20d84ddef1
get_base_image_generation_call_args
2025-04-02 21:04:06 -07:00
Ishaan Jaff
da97fefc1f
Merge pull request #9694 from BerriAI/litellm_fix_azure_o_series
...
[Bug fix] Azure o-series tool calling
2025-04-02 20:59:00 -07:00
Ishaan Jaff
acf920a41a
Merge branch 'main' into litellm_fix_azure_o_series
2025-04-02 20:58:52 -07:00
Ishaan Jaff
3eb6c1f2f7
Merge pull request #9708 from BerriAI/dependabot/npm_and_yarn/docs/my-website/image-size-1.2.1
...
Bump image-size from 1.1.1 to 1.2.1 in /docs/my-website
2025-04-02 20:58:16 -07:00
Ishaan Jaff
c3341a1e18
test fixes - azure deprecated dall-e-2
2025-04-02 20:56:20 -07:00
Ishaan Jaff
74550df197
get_base_image_generation_call_args
2025-04-02 20:52:16 -07:00
Ishaan Jaff
4ed0ab5b1c
Revert "remove google dns for img tests"
...
This reverts commit d3fc8b563c
.
2025-04-02 20:42:29 -07:00
Ishaan Jaff
d3fc8b563c
remove google dns for img tests
2025-04-02 20:34:47 -07:00
Ishaan Jaff
e1c444ad07
Merge pull request #9714 from BerriAI/litellm_docs_new_architecture
...
[Docs] High Availability Setup (Resolve DB Deadlocks)
2025-04-02 20:28:50 -07:00
Ishaan Jaff
c4e8b9607d
fix async_set_cache
2025-04-02 18:54:51 -07:00
Ishaan Jaff
07215e3f7a
fix async_set_cache
2025-04-02 18:51:41 -07:00
Ishaan Jaff
80fb4ece97
prom emit size of DB TX queues for observability
2025-04-02 18:39:29 -07:00
Ishaan Jaff
7b768ed909
doc fix sso login url
2025-04-02 18:38:33 -07:00
Ishaan Jaff
3256b6af6c
track service types on prom services
2025-04-02 18:03:09 -07:00
Ishaan Jaff
05b30e28db
clean up service metrics
2025-04-02 17:50:41 -07:00
Ishaan Jaff
73bbd0a446
emit lock acquired and released events
2025-04-02 17:40:25 -07:00
Ishaan Jaff
e09ef4afc7
use service logger for tracking pod lock status
2025-04-02 17:39:48 -07:00
Ishaan Jaff
8405fcb748
test pod lock manager
2025-04-02 15:06:31 -07:00
Ishaan Jaff
8b12a2e5dc
fix pod lock manager
2025-04-02 14:52:55 -07:00
Ishaan Jaff
a64631edfb
test pod lock manager
2025-04-02 14:39:40 -07:00
Ishaan Jaff
2e939a21b3
refactor pod lock manager to use redis
2025-04-02 14:37:39 -07:00
Ishaan Jaff
68ce0b111e
Setup on LiteLLM config
2025-04-02 13:41:16 -07:00
Ishaan Jaff
6ab1eba7b6
doc High Availability Setup
2025-04-02 13:38:49 -07:00
Ishaan Jaff
b48b8366c2
docs new deadlock fixing architecture
2025-04-02 13:24:53 -07:00
Ishaan Jaff
d4a20d4fb8
test azure o series
2025-04-02 09:46:45 -07:00
Ishaan Jaff
83e4c34e0a
test fix get_base_completion_call_args
2025-04-02 09:18:56 -07:00
Ishaan Jaff
3f52a4df32
docs allowed openai params
2025-04-02 09:08:11 -07:00
Krish Dholakia
053b0e741f
Add Google AI Studio /v1/files
upload API support ( #9645 )
...
Read Version from pyproject.toml / read-version (push) Successful in 16s
Helm unit test / unit-test (push) Successful in 23s
* test: fix import for test
* fix: fix bad error string
* docs: cleanup files docs
* fix(files/main.py): cleanup error string
* style: initial commit with a provider/config pattern for files api
google ai studio files api onboarding
* fix: test
* feat(gemini/files/transformation.py): support gemini files api response transformation
* fix(gemini/files/transformation.py): return file id as gemini uri
allows id to be passed in to chat completion request, just like openai
* feat(llm_http_handler.py): support async route for files api on llm_http_handler
* fix: fix linting errors
* fix: fix model info check
* fix: fix ruff errors
* fix: fix linting errors
* Revert "fix: fix linting errors"
This reverts commit 926a5a527f
.
* fix: fix linting errors
* test: fix test
* test: fix tests
2025-04-02 08:56:58 -07:00
Krish Dholakia
453003c378
fix(gemini/): add gemini/ route optional param mapping support ( #9677 )
...
Fixes https://github.com/BerriAI/litellm/issues/9654
2025-04-02 08:56:32 -07:00
Krrish Dholakia
6e8d9860a5
ci(config.yml): remove -s
from large tests
...
easier to track test progress
2025-04-02 08:50:47 -07:00
dependabot[bot]
7255c8e94a
Bump image-size from 1.1.1 to 1.2.1 in /docs/my-website
...
Bumps [image-size](https://github.com/image-size/image-size ) from 1.1.1 to 1.2.1.
- [Release notes](https://github.com/image-size/image-size/releases )
- [Commits](https://github.com/image-size/image-size/compare/v1.1.1...v1.2.1 )
---
updated-dependencies:
- dependency-name: image-size
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
2025-04-02 15:10:45 +00:00
Krrish Dholakia
d32cf141f5
docs: update docs
2025-04-02 07:58:45 -07:00
Pranav Simha
2e35f07e94
Add support for max_completion_tokens to the Cohere chat transformation config ( #9701 )
2025-04-02 07:50:44 -07:00
Ishaan Jaff
443b8ab93a
test_azure_o1_series_response_format_extra_params
2025-04-02 07:01:08 -07:00
Ishaan Jaff
58b4e4b206
add AzureOpenAIO1Config for tools
2025-04-02 06:55:03 -07:00
Ishaan Jaff
9e7c67805b
get_supported_openai_params
2025-04-02 06:52:07 -07:00
Ishaan Jaff
8f372ea243
test_completion_invalid_param_cohere
2025-04-02 06:49:11 -07:00
Krish Dholakia
6c69ad4c89
fix(model_management_endpoints.py): fix allowing team admins to update team models ( #9697 )
...
Read Version from pyproject.toml / read-version (push) Successful in 17s
Helm unit test / unit-test (push) Successful in 22s
* fix(model_management_endpoints.py): fix allowing team admins to update their models
* test(test_models.py): add e2e test to for team model flow
ensure team admin can always add / edit / delete team models
2025-04-01 22:28:15 -07:00
Krish Dholakia
3d0313b15b
Litellm user daily activity allow non admin usage ( #9695 )
...
* feat(internal_user_endpoints.py): allow non-admin to view their own usage via `/user/daily/activity` route
* fix(leftnav.tsx): allow users to view their own usage via new_usage.tsx
allows internal users to see their usage via new api
Handles 1m+ spend logs scenario
* fix(leftnav.tsx): allow all users to see new usage tab
2025-04-01 22:27:26 -07:00
Krish Dholakia
23051d89dd
fix(streaming_handler.py): fix completion start time tracking ( #9688 )
...
* fix(streaming_handler.py): fix completion start time tracking
Fixes https://github.com/BerriAI/litellm/issues/9210
* feat(anthropic/chat/transformation.py): map openai 'reasoning_effort' to anthropic 'thinking' param
Fixes https://github.com/BerriAI/litellm/issues/9022
* feat: map 'reasoning_effort' to 'thinking' param across bedrock + vertex
Closes https://github.com/BerriAI/litellm/issues/9022#issuecomment-2705260808
2025-04-01 22:00:56 -07:00
Tomer Bin
0690f7a3cb
Virtual key based policies in Aim Guardrails ( #9499 )
...
* report key alias to aim
* send litellm version to aim
* Update docs
* blacken
* add docs
* Add info part about virtual keys specific guards
* sort guardrails alphabetically
* fix ruff
2025-04-01 21:57:23 -07:00
Ishaan Jaff
63dd2934b7
test_supports_tool_choice
2025-04-01 21:43:46 -07:00
Ishaan Jaff
4b99f833bb
test_cohere_request_body_with_allowed_params
2025-04-01 21:30:24 -07:00
Ishaan Jaff
4080fe54d5
clean up o series
2025-04-01 21:21:41 -07:00
Ishaan Jaff
5f286fe147
fix _check_valid_arg
2025-04-01 21:20:31 -07:00
Ishaan Jaff
f7129e5e59
fix _apply_openai_param_overrides
2025-04-01 21:17:59 -07:00
Ishaan Jaff
ac3399238e
Merge pull request #9690 from BerriAI/litellm_fixes_for_deadlock_round_2
...
[Reliability Fixes] - Ensure no deadlocks occur when updating `DailyUserSpendTransaction`
2025-04-01 20:59:41 -07:00