Ishaan Jaff
475dfaa156
fixes for web search cost tracking
2025-03-22 16:56:32 -07:00
Ishaan Jaff
1910ed6027
WebSearchOptions
2025-03-22 15:39:04 -07:00
Ishaan Jaff
e19b82f202
add WebSearchOptions as supported chat completion param
2025-03-22 15:37:34 -07:00
Ishaan Jaff
cf22d31b2b
search_context_cost_per_query
2025-03-22 14:52:58 -07:00
Ishaan Jaff
7dd37a5b18
fix supports_web_search
2025-03-22 14:02:51 -07:00
Ishaan Jaff
44f4c623e2
Add annotations to the delta
2025-03-22 11:38:30 -07:00
Ishaan Jaff
dc5bf10009
feat - add openai web search
2025-03-22 10:43:35 -07:00
Krrish Dholakia
8ef9129556
fix(types/utils.py): support openai 'file' message type
...
Closes https://github.com/BerriAI/litellm/issues/9365
2025-03-19 23:13:51 -07:00
Ishaan Jaff
57e5c94360
Merge branch 'main' into litellm_arize_dynamic_logging
2025-03-18 22:13:35 -07:00
Krrish Dholakia
39ac9e3eca
fix(lowest_tpm_rpm_v2.py): fix updating limits
2025-03-18 17:10:17 -07:00
Ishaan Jaff
bd122f631e
fix arize config
2025-03-18 16:54:31 -07:00
Ishaan Jaff
57d08531a1
add arize_api_key to StandardCallbackDynamicParams
2025-03-18 14:46:55 -07:00
Krrish Dholakia
057c774c14
fix(http_handler.py): fix typing error
2025-03-17 16:42:32 -07:00
Krrish Dholakia
f089b1e23f
feat(endpoints.py): support adding credentials by model id
...
Allows user to reuse existing model credentials
2025-03-14 12:32:32 -07:00
Krrish Dholakia
605a4d1121
feat(endpoints.py): enable retrieving existing credentials by model name
...
Enables reusing existing credentials
2025-03-14 12:02:50 -07:00
Ishaan Jaff
241a36a74f
Merge pull request #9222 from BerriAI/litellm_snowflake_pr_mar_13
...
[Feat] Add Snowflake Cortex to LiteLLM
2025-03-13 21:35:39 -07:00
Krrish Dholakia
8a6e4715aa
feat(converse_transformation.py): fix type for bedrock cache usage block
2025-03-13 19:33:22 -07:00
Sunny Wan
f9a5109203
Merge branch 'BerriAI:main' into main
2025-03-13 19:37:22 -04:00
Krrish Dholakia
f99b1937db
feat(converse_transformation.py): translate converse usage block with cache creation values to openai format
2025-03-13 15:49:25 -07:00
Ishaan Jaff
d4ed384984
working spend tracking + logging for response api
2025-03-12 17:29:25 -07:00
Krish Dholakia
2d957a0ed9
Merge branch 'main' into litellm_dev_03_10_2025_p3
2025-03-12 14:56:01 -07:00
Ishaan Jaff
de473bee4b
fix mypy linting errors
2025-03-12 12:13:19 -07:00
Ishaan Jaff
342741ede1
Merge branch 'main' into litellm_responses_api_support
2025-03-12 12:04:12 -07:00
Ishaan Jaff
c6a9e8cafe
typing_extensions Annotated
2025-03-12 11:58:56 -07:00
Ishaan Jaff
e6b696370b
BaseLiteLLMOpenAIResponseObject
2025-03-12 11:39:51 -07:00
Ishaan Jaff
fde75a068a
working streaming logging
2025-03-12 00:02:39 -07:00
Ishaan Jaff
4ff6e41c15
ResponsesAPIStreamEvents
2025-03-11 23:42:35 -07:00
Ishaan Jaff
51dc24a405
_transform_response_api_usage_to_chat_usage
2025-03-11 22:26:44 -07:00
Ishaan Jaff
8ada5a469d
add responses api to call types
2025-03-11 22:02:48 -07:00
Ishaan Jaff
24cb83b0e4
Response API cost tracking
2025-03-11 22:02:14 -07:00
Ishaan Jaff
8da714104b
ResponsesAPIStreamingResponse
2025-03-11 17:48:15 -07:00
Ishaan Jaff
2ac5aa2477
ResponsesAPIOptionalRequestParams
2025-03-11 17:36:06 -07:00
Krrish Dholakia
cbc2e84044
refactor(azure.py): refactor to have client init work across all endpoints
2025-03-11 17:27:24 -07:00
Ishaan Jaff
eafc1be132
add ResponsesAPIResponse
2025-03-11 16:46:28 -07:00
Ishaan Jaff
0f8de3d0a5
add transform_request for OpenAI responses API
2025-03-11 16:33:26 -07:00
Ishaan Jaff
401a52e694
working transform
2025-03-11 15:24:42 -07:00
Ishaan Jaff
4b1b87eb67
openai reasoning initial types
2025-03-11 14:28:47 -07:00
Krrish Dholakia
a962a97fcb
feat(endpoints.py): support writing credentials to db
2025-03-10 18:27:43 -07:00
Krrish Dholakia
f1cdc26967
feat(endpoints.py): initial set of crud endpoints for reusable credentials on proxy
2025-03-10 17:48:02 -07:00
Krrish Dholakia
fdd5ba3084
feat(credential_accessor.py): support loading in credentials from credential_list
...
Resolves https://github.com/BerriAI/litellm/issues/9114
2025-03-10 17:15:58 -07:00
Krrish Dholakia
4bd4bb16fd
feat(proxy_server.py): move credential list to being a top-level param
2025-03-10 17:04:05 -07:00
omrishiv
0674491386
add support for Amazon Nova Canvas model ( #7838 )
...
* add initial support for Amazon Nova Canvas model
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* adjust name to AmazonNovaCanvas and map function variables to config
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* tighten model name check
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* fix quality mapping
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* add premium quality in config
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* support all Amazon Nova Canvas tasks
* remove unused import
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* add tests for image generation tasks and fix payload
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* add missing util file
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* update model prices backup file
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* remove image tasks other than text->image
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
---------
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>
2025-03-10 08:02:00 -07:00
Krrish Dholakia
574f5056c8
fix(utils.py): fix linting error
2025-03-09 20:47:12 -07:00
5aaee9
42b7921ca1
fix: perplexity return both delta and message cause OpenWebUI repect text ( #9081 )
2025-03-09 19:46:31 -07:00
Krish Dholakia
4330ef8e81
Fix batches api cost tracking + Log batch models in spend logs / standard logging payload ( #9077 )
...
Read Version from pyproject.toml / read-version (push) Successful in 42s
* feat(batches/): fix batch cost calculation - ensure it's accurate
use the correct cost value - prev. defaulting to non-batch cost
* feat(batch_utils.py): log batch models to spend logs + standard logging payload
makes it easy to understand how cost was calculated
* fix: fix stored payload for test
* test: fix test
2025-03-08 11:47:25 -08:00
Krish Dholakia
0e3caf92b9
UI - new API Playground for testing LiteLLM translation ( #9073 )
...
* feat: initial commit - enable dev to see translated request
* feat(utils.py): expose new endpoint - `/utils/transform_request` to see the raw request sent by litellm
* feat(transform_request.tsx): allow user to see their transformed request
* refactor(litellm_logging.py): return raw request in 3 parts - api_base, headers, request body
easier to render each individually on UI vs. extracting from combined string
* feat: transform_request.tsx
working e2e raw request viewing
* fix(litellm_logging.py): fix transform viewing for bedrock models
* fix(litellm_logging.py): don't return sensitive headers in raw request headers
prevent accidental leak
* feat(transform_request.tsx): style improvements
2025-03-07 19:39:31 -08:00
Ishaan Jaff
b02af305de
[Feat] - Display thinking
tokens on OpenWebUI (Bedrock, Anthropic, Deepseek) ( #9029 )
...
Read Version from pyproject.toml / read-version (push) Successful in 14s
* if merge_reasoning_content_in_choices
* _optional_combine_thinking_block_in_choices
* stash changes
* working merge_reasoning_content_in_choices with bedrock
* fix litellm_params accessor
* fix streaming handler
* merge_reasoning_content_in_choices
* _optional_combine_thinking_block_in_choices
* test_bedrock_stream_thinking_content_openwebui
* merge_reasoning_content_in_choices
* fix for _optional_combine_thinking_block_in_choices
* linting error fix
2025-03-06 18:32:58 -08:00
Ishaan Jaff
f47987e673
(Refactor) /v1/messages
to follow simpler logic for Anthropic API spec ( #9013 )
...
* anthropic_messages_handler v0
* fix /messages
* working messages with router methods
* test_anthropic_messages_handler_litellm_router_non_streaming
* test_anthropic_messages_litellm_router_non_streaming_with_logging
* AnthropicMessagesConfig
* _handle_anthropic_messages_response_logging
* working with /v1/messages endpoint
* working /v1/messages endpoint
* refactor to use router factory function
* use aanthropic_messages
* use BaseConfig for Anthropic /v1/messages
* track api key, team on /v1/messages endpoint
* fix get_logging_payload
* BaseAnthropicMessagesTest
* align test config
* test_anthropic_messages_with_thinking
* test_anthropic_streaming_with_thinking
* fix - display anthropic url for debugging
* test_bad_request_error_handling
* test_anthropic_messages_router_streaming_with_bad_request
* fix ProxyException
* test_bad_request_error_handling_streaming
* use provider_specific_header
* test_anthropic_messages_with_extra_headers
* test_anthropic_messages_to_wildcard_model
* fix gcs pub sub test
* standard_logging_payload
* fix unit testing for anthopic /v1/messages support
* fix pass through anthropic messages api
* delete dead code
* fix anthropic pass through response
* revert change to spend tracking utils
* fix get_litellm_metadata_from_kwargs
* fix spend logs payload json
* proxy_pass_through_endpoint_tests
* TestAnthropicPassthroughBasic
* fix pass through tests
* test_async_vertex_proxy_route_api_key_auth
* _handle_anthropic_messages_response_logging
* vertex_credentials
* test_set_default_vertex_config
* test_anthropic_messages_litellm_router_non_streaming_with_logging
* test_ageneric_api_call_with_fallbacks_basic
* test__aadapter_completion
2025-03-06 00:43:08 -08:00
Krish Dholakia
f6535ae6ad
Support format
param for specifying image type ( #9019 )
...
* fix(transformation.py): support a 'format' parameter for image's
allow user to specify mime type
* fix: pass mimetype via 'format' param
* feat(gemini/chat/transformation.py): support 'format' param for gemini
* fix(factory.py): support 'format' param on sync bedrock converse calls
* feat(bedrock/converse_transformation.py): support 'format' param for bedrock async calls
* refactor(factory.py): move to supporting 'format' param in base helper
ensures consistency in param support
* feat(gpt_transformation.py): filter out 'format' param
don't send invalid param to openai
* fix(gpt_transformation.py): fix translation
* fix: fix translation error
2025-03-05 19:52:53 -08:00
Krish Dholakia
ec4f665e29
Return signature
on anthropic streaming + migrate to signature
field instead of signature_delta
[MINOR bump] ( #9021 )
...
* Fix missing signature_delta in thinking blocks when streaming from Claude 3.7 (#8797 )
Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>
* test: update test to enforce signature found
* feat(refactor-signature-param-to-be-'signature'-instead-of-'signature_delta'): keeps it in sync with anthropic
* fix: fix linting error
---------
Co-authored-by: Martin Krasser <krasserm@googlemail.com>
2025-03-05 19:33:54 -08:00