litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 19:24:27 +00:00

Author	SHA1	Message	Date
Krish Dholakia	740bd7e9ce	(security fix) - Enforce model access restrictions on Azure OpenAI route (#8888 ) * fix(user_api_key_auth.py): Fixes https://github.com/BerriAI/litellm/issues/8780 security fix - enforce model access checks on azure routes * test(test_user_api_key_auth.py): add unit testing * test(test_openai_endpoints.py): add e2e test to ensure azure routes also run through model validation checks	2025-02-27 21:24:58 -08:00
Ishaan Jaff	2a3b70f2b6	move test openai error handling tests	2025-02-27 21:20:25 -08:00
Ishaan Jaff	35232b83b2	move test openai error handling	2025-02-27 20:50:15 -08:00
Ishaan Jaff	51a6a219cd	(Improvements) use `/openai/` pass through with OpenAI Ruby for Assistants API (#8884 ) * add ruby assistants testing * _join_url_paths * run ruby tests on ci/cd * TestBaseOpenAIPassThroughHandler * _join_url_paths * fix _join_url_paths * Install Ruby and Bundler * Install Ruby and Bundler	2025-02-27 20:01:16 -08:00
Ishaan Jaff	378e3d9e4d	(Proxy improvement) - Raise `BadRequestError` when unknown model passed in request (#8886 ) * fix safe access model in request body * litellm.BadRequestError * don't pass model in request body * test_chat_completion_bad_model	2025-02-27 19:30:57 -08:00
Ishaan Jaff	1e7b9cf767	(fix) Pass through spend tracking - ensure `custom_llm_provider` is tracked for Vertex, Google AI Studio, Anthropic (#8882 ) * fix track custom llm provider on pass through routes * fix use correct provider for google ai studio * fix tracking custom llm provider on pass through route * ui fix get provider logo * update tests to track custom llm provider * test_anthropic_streaming_with_headers * Potential fix for code scanning alert no. 2263: Incomplete URL substring sanitization Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-02-27 17:09:43 -08:00
Ishaan Jaff	047d1b1208	(Bug Fix) - Accurate token counting for `/anthropic/` API Routes on LiteLLM Proxy (#8880 ) * fix _create_anthropic_response_logging_payload * fix - pass through don't create standard logging payload * fix logged key hash * test_init_kwargs_for_pass_through_endpoint_basic * test_unit_test_anthropic_pass_through * fix anthropic pass through logging handler * test_stream_token_counting_anthropic_with_include_usage * convert_str_chunk_to_generic_chunk * _build_complete_streaming_response * test_anthropic_basic_completion_with_headers * test_anthropic_streaming_with_headers * improve test for pass through token counting	2025-02-27 15:43:03 -08:00
Ishaan Jaff	24df2331ec	(fix) Anthropic pass through cost tracking (#8874 ) * fix _create_anthropic_response_logging_payload * fix - pass through don't create standard logging payload * fix logged key hash * test_init_kwargs_for_pass_through_endpoint_basic * test_unit_test_anthropic_pass_through * fix anthropic pass through logging handler	2025-02-27 15:42:43 -08:00
Krish Dholakia	88eedb22b9	vertex ai anthropic thinking param support (#8853 ) * fix(vertex_llm_base.py): handle credentials passed in as dictionary * fix(router.py): support vertex credentials as json dict * test(test_vertex.py): allows easier testing mock anthropic thinking response for vertex ai * test(vertex_ai_partner_models/): don't remove "@" from model breaks anthropic cost calculation * test: move testing * fix: fix linting error * fix: fix linting error * fix(vertex_ai_partner_models/main.py): split @ for codestral model * test: fix test * fix: fix stripping "@" on mistral models * fix: fix test * test: fix test	2025-02-26 21:37:18 -08:00
Ishaan Jaff	6231052b18	[Bug]: Deepseek error on proxy after upgrading to 1.61.13-stable (#8860 ) * fix deepseek error * test_deepseek_provider_async_completion * fix get_complete_url	2025-02-26 21:11:06 -08:00
Krish Dholakia	3de4209569	fix caching on main branch (#8858 ) * fix(streaming_handler.py): fix is delta empty check to handle empty str * fix(streaming_handler.py): fix delta chunk on final response	2025-02-26 19:16:34 -08:00
Ishaan Jaff	c07dd16d88	(UI + Backend) Fix Adding Azure, Azure AI Studio models on LiteLLM (#8856 ) * fix Azure_AI_Studio * fix flow for adding custom model names * fix _should_use_api_key_header * handle custom model name change * test_azure_ai_request_format * Azure AI Foundry (Studio) * fix _should_use_api_key_header	2025-02-26 18:45:02 -08:00
Krish Dholakia	ab7c4d1a0e	Litellm dev bedrock anthropic 3 7 v2 (#8843 ) * feat(bedrock/converse/transformation.py): support claude-3-7-sonnet reasoning_Content transformation Closes https://github.com/BerriAI/litellm/issues/8777 * fix(bedrock/): support returning `reasoning_content` on streaming for claude-3-7 Resolves https://github.com/BerriAI/litellm/issues/8777 * feat(bedrock/): unify converse reasoning content blocks for consistency across anthropic and bedrock * fix(anthropic/chat/transformation.py): handle deepseek-style 'reasoning_content' extraction within transformation.py simpler logic * feat(bedrock/): fix streaming to return blocks in consistent format * fix: fix linting error * test: fix test * feat(factory.py): fix bedrock thinking block translation on tool calling allows passing the thinking blocks back to bedrock for tool calling * fix(types/utils.py): don't exclude provider_specific_fields on model dump ensures consistent responses * fix: fix linting errors * fix(convert_dict_to_response.py): pass reasoning_content on root * fix: test * fix(streaming_handler.py): add helper util for setting model id * fix(streaming_handler.py): fix setting model id on model response stream chunk * fix(streaming_handler.py): fix linting error * fix(streaming_handler.py): fix linting error * fix(types/utils.py): add provider_specific_fields to model stream response * fix(streaming_handler.py): copy provider specific fields and add them to the root of the streaming response * fix(streaming_handler.py): fix check * fix: fix test * fix(types/utils.py): ensure messages content is always openai compatible * fix(types/utils.py): fix delta object to always be openai compatible only introduce new params if variable exists * test: fix bedrock nova tests * test: skip flaky test * test: skip flaky test in ci/cd	2025-02-26 16:05:33 -08:00
Krish Dholakia	017c482d7b	fix(o_series_transformation.py): fix optional param check for o-serie… (#8787 ) * fix(o_series_transformation.py): fix optional param check for o-series models o3-mini and o-1 do not support parallel tool calling * fix(utils.py): support 'drop_params' for 'thinking' param across models allows switching to older claude versions (or non-anthropic models) and param to be safely dropped * fix: fix passing thinking param in optional params allows dropping thinking_param where not applicable * test: update old model * fix(utils.py): fix linting errors * fix(main.py): add param to acompletion	2025-02-26 12:26:55 -08:00
Krrish Dholakia	fcf4ea3608	build: merge squashed commit Squashed commit of the following: commit `6678e15381` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 09:29:15 2025 -0800 test_prompt_caching commit `bd86e0ac47` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 08:57:16 2025 -0800 test_prompt_caching commit `2fc21ad51e` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 08:13:45 2025 -0800 test_aprompt_caching commit `d94cff55ff` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 08:13:12 2025 -0800 test_prompt_caching commit `49c5e7811e` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 07:43:53 2025 -0800 ui new build commit `cb8d5e5917` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 07:38:56 2025 -0800 (UI) - Create Key flow for existing users (#8844) * working create user button * working create user for a key flow * allow searching users * working create user + key * use clear sections on create key * better search for users * fix create key * ui fix create key button - make it neater / cleaner * ui fix all keys table commit `335ba30467` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Feb 26 08:53:17 2025 -0800 fix: fix file name commit `b8c5b31a4e` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Tue Feb 25 22:54:46 2025 -0800 fix: fix utils commit `ac6e503461` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Feb 24 10:43:31 2025 -0800 fix(main.py): fix openai message for assistant msg if role is missing - openai allows this Fixes https://github.com/BerriAI/litellm/issues/8661 commit `de3989dbc5` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Feb 24 21:19:25 2025 -0800 fix(get_litellm_params.py): handle no-log being passed in via kwargs Fixes https://github.com/BerriAI/litellm/issues/8380	2025-02-26 09:39:27 -08:00
Ishaan Jaff	da1fd9b25f	test_prompt_caching	2025-02-26 09:29:15 -08:00
Ishaan Jaff	4858417283	test_prompt_caching	2025-02-26 08:57:16 -08:00
Ishaan Jaff	2cf66f8267	test_aprompt_caching	2025-02-26 08:13:45 -08:00
Ishaan Jaff	56b2576979	test_prompt_caching	2025-02-26 08:13:12 -08:00
Ishaan Jaff	fe3a13ecf9	fix pass through tests vertex	2025-02-25 20:21:16 -08:00
Ishaan Jaff	89b5eeb345	fix test internal users	2025-02-25 20:13:38 -08:00
Ishaan Jaff	555067e207	test fix deprecated gemini-1.0-pro on vertex	2025-02-25 20:00:04 -08:00
Ishaan Jaff	7021f2f244	(Bug fix) dd-trace used by default on litellm proxy (#8817 ) * fix _should_use_dd_tracer * fix _should_use_dd_tracer * _should_use_dd_tracer * _should_use_dd_tracer * _should_use_dd_tracer * _init_dd_tracer * _should_use_dd_tracer * fix should use dd-tracer * fix dd tracer	2025-02-25 19:54:22 -08:00
Ishaan Jaff	81039d8faf	(Bug fix) - allow using Assistants GET, DELETE on `/openai` pass through routes (#8818 ) * test_openai_assistants_e2e_operations * test openai assistants pass through * fix GET request on pass through handler * _make_non_streaming_http_request * _is_assistants_api_request * test_openai_assistants_e2e_operations * test_openai_assistants_e2e_operations * openai_proxy_route * docs openai pass through * docs openai pass through * docs openai pass through * test pass through handler * Potential fix for code scanning alert no. 2240: Incomplete URL substring sanitization Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-02-25 19:19:00 -08:00
Ishaan Jaff	f9cee4c46b	(Bug Fix) Using LiteLLM Python SDK with model=`litellm_proxy/` for embedding, image_generation, transcription, speech, rerank (#8815 ) * test_litellm_gateway_from_sdk * fix embedding check for openai * test litellm proxy provider * fix image generation openai compatible models * fix litellm.transcription * test_litellm_gateway_from_sdk_rerank * docs litellm python sdk * docs litellm python sdk with proxy * test_litellm_gateway_from_sdk_rerank * ci/cd run again * test_litellm_gateway_from_sdk_image_generation * test_litellm_gateway_from_sdk_embedding * test_litellm_gateway_from_sdk_embedding	2025-02-25 16:22:37 -08:00
Ishaan Jaff	7eaf003919	expected_result	2025-02-25 15:25:51 -08:00
Krish Dholakia	b829475587	Litellm dev 02 25 2025 p1 (#8816 ) * build(model_prices_and_context_window.json): add bedrock cross-region inferencing model information Closes https://github.com/BerriAI/litellm/issues/8801#issuecomment-2683438528 * build(model_prices_and_context_window.json): add claude sonnet `-latest` models to model cost map Closes https://github.com/BerriAI/litellm/discussions/8770#discussioncomment-12318880 * build(model_prices_and_context_window.json): add remaining anthropic `-latest` models to model cost map Closes https://github.com/BerriAI/litellm/discussions/8770#discussioncomment-12318880 * test: update test with new model	2025-02-25 15:20:39 -08:00
Ishaan Jaff	d963568970	(Bug fix) - running litellm proxy on wndows (#8735 ) * fix running litellm on windows * fix importing litellm * _init_hypercorn_server * linting fix * TestProxyInitializationHelpers * ci/cd run again * ci/cd run again	2025-02-25 15:19:19 -08:00
Ishaan Jaff	c0aec0cc5d	(Bug fix) - reading /parsing request body when on hypercorn (#8734 ) * _safe_get_request_parsed_body * use scope on hypercorn * test http parsing utils * ci/cd run again	2025-02-25 15:18:04 -08:00
Ishaan Jaff	f6fa2399cc	(Router) - If `allowed_fails` or `allowed_fail_policy` set, use that for single deployment cooldown logic (#8668 ) * fix cooldown 1 deployment * test_single_deployment_cooldown_with_allowed_fail_policy * fix docstring test * test_single_deployment_no_cooldowns * ci/cd run again * test router cooldowns	2025-02-25 15:15:01 -08:00
Ishaan Jaff	3d0b56e8a3	test_can_team_access_model	2025-02-25 14:57:13 -08:00
Ishaan Jaff	55ae194912	test_user_email_metrics	2025-02-25 10:47:09 -08:00
Ishaan Jaff	1b2e3855cc	test_update_user_unit_test	2025-02-25 10:46:33 -08:00
Krrish Dholakia	de8497309b	docs(anthropic.md): add claude-3-7-sonnet support All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 15s Details	2025-02-25 00:06:30 -08:00
Krrish Dholakia	b7ec53aec1	test: handle index error	2025-02-24 22:11:08 -08:00
Krish Dholakia	142b195784	Add anthropic thinking + reasoning content support (#8778 ) * feat(anthropic/chat/transformation.py): add anthropic thinking param support * feat(anthropic/chat/transformation.py): support returning thinking content for anthropic on streaming responses * feat(anthropic/chat/transformation.py): return list of thinking blocks (include block signature) allows usage in tool call responses * fix(types/utils.py): extract and map reasoning_content from anthropic as content str * test: add testing to ensure thinking_blocks are returned at the root * fix(anthropic/chat/handler.py): return thinking blocks on streaming - include signature * feat(factory.py): handle anthropic thinking blocks translation if in assistant response * test: handle openai internal instability * test: handle openai audio instability * ci: pin anthropic dep * test: handle openai audio instability * fix: fix linting error * refactor(anthropic/chat/transformation.py): refactor function to remain <50 LOC * fix: fix linting error * fix: fix linting error * fix: fix linting error * fix: fix linting error	2025-02-24 21:54:30 -08:00
Krish Dholakia	9914c166b7	Litellm contributor prs 02 24 2025 (#8775 ) * Adding VertexAI Claude 3.7 Sonnet (#8774) Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com> * build(model_prices_and_context_window.json): add anthropic 3-7 models on vertex ai and bedrock * Support video_url (#8743) * Support video_url Support VLMs that works with video. Example implemenation in vllm: https://github.com/vllm-project/vllm/pull/10020 * llms openai.py: Add ChatCompletionVideoObject Add data structures to support `video_url` in chat completion * test test_completion.py: add test for video_url * Arize Phoenix - ensure correct endpoint/protocol are used; and default to phoenix cloud (#8750) * minor fixes to default to http and to ensure that the correct endpoint is used * Update test_arize_phoenix.py * prioritize http over grpc --------- Co-authored-by: Emerson Gomes <emerson.gomes@gmail.com> Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com> Co-authored-by: Pang Wu <104795337+pang-wu@users.noreply.github.com> Co-authored-by: Nate Mar <67926244+nate-mar@users.noreply.github.com>	2025-02-24 18:55:48 -08:00
Dragos Campean	2905ad98b3	Add anthropic3-7-sonnet (#8766 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 14s Details	2025-02-24 12:59:00 -08:00
Ishaan Jaff	b93889660a	fix: remove aws params from bedrock embedding request body (#8618 ) (#8696 ) * fix: remove aws params from bedrock embedding request body (#8618) * fix: remove aws params from bedrock embedding request body * fix-7548: handle aws params in base class * test: load request data from mock call * (Infra/DB) - Allow running older litellm version when out of sync with current state of DB (#8695) * fix check migration * clean up should_update_prisma_schema * update test * db_migration_disable_update_check * Check container logs for expected message * db_migration_disable_update_check * test_check_migration_out_of_sync * test_should_update_prisma_schema * db_migration_disable_update_check * pip install aiohttp * ui new build * delete deprecated code test * bump: version 1.61.12 → 1.61.13 * Add cost tracking for rerank via bedrock (#8691) * feat(bedrock/rerank): infer model region if model given as arn * test: add unit testing to ensure bedrock region name inferred from arn on rerank * feat(bedrock/rerank/transformation.py): include search units for bedrock rerank result Resolves https://github.com/BerriAI/litellm/issues/7258#issuecomment-2671557137 * test(test_bedrock_completion.py): add testing for bedrock cohere rerank * feat(cost_calculator.py): refactor rerank cost tracking to support bedrock cost tracking * build(model_prices_and_context_window.json): add amazon.rerank model to model cost map * fix(cost_calculator.py): bedrock/common_utils.py get base model from model w/ arn -> handles rerank model * build(model_prices_and_context_window.json): add bedrock cohere rerank pricing * feat(bedrock/rerank): migrate bedrock config to basererank config * Revert "feat(bedrock/rerank): migrate bedrock config to basererank config" This reverts commit `84fae1f167`. * test: add testing to ensure large doc / queries are correctly counted * Revert "test: add testing to ensure large doc / queries are correctly counted" This reverts commit `4337f1657e`. * fix(migrate-jina-ai-to-rerank-config): enables cost tracking * refactor(jina_ai/): finish migrating jina ai to base rerank config enables cost tracking * fix(jina_ai/rerank): e2e jina ai rerank cost tracking * fix: cleanup dead code * fix: fix python3.8 compatibility error * test: fix test * test: add e2e testing for azure ai rerank * fix: fix linting error * test: mark cohere as flaky * add bedrock llama vision support + cohere / infinity rerank - 'return_documents' support (#8684) * build(model_prices_and_context_window.json): mark bedrock llama as supporting vision based on docs * Add price for Cerebras llama3.3-70b (#8676) * docs(readme.md): fix contributing docs point people to new mock directory testing structure s/o @vibhavbhat * build: update contributing readme * docs(readme.md): improve docs * docs(readme.md): cleanup readme on tests/ * docs(README.md): cleanup doc * feat(infinity/): support returning documents when return_documents=True * test(test_rerank.py): add e2e testing for cohere rerank * fix: fix linting errors * fix(together_ai/): fix together ai transformation * fix: fix linting error * fix: fix linting errors * fix: fix linting errors * test: mark cohere as flaky * build: fix model supports check * test: fix test * test: mark flaky test * fix: fix test * test: fix test --------- Co-authored-by: Yury Koleda <fut.wrk@gmail.com> * test: fix test * fix: remove unused import * bump: version 1.61.13 → 1.61.14 * Correct spelling in user_management_heirarchy.md (#8716) Fixing irritating typo -- page and image names would also need to be updated * (Feat) - UI, Allow sorting models by Created_At and all other columns on the UI (#8725) * order models by created at * use existing table component on models page * sorting for created at * ui clean up models page * remove provider filter * fix columns sorting * decent switching * ui fix models page * (UI) Edit Model flow improvements (#8729) * order models by created at * use existing table component on models page * sorting for created at * ui clean up models page * remove provider filter * fix columns sorting * decent switching * ui fix models page * show edit / delete button on root of table * clean up columns * working edit model flow * decent working model edit page * fix edit model * show created at and created by * ui easy model edit flow * clean up columns * ui clean up updated at * fix model datatable * ui new build * bump: version 1.61.14 → 1.61.15 * Support arize phoenix on litellm proxy (#7756) (#8715) * Update opentelemetry.py wip * Update test_opentelemetry_unit_tests.py * fix a few paths and tests * fix path * Update litellm_logging.py * accidentally removed code * Add type for protocol * Add and update tests * minor changes * update and add additional arize phoenix test * update existing test * address feedback * use standard_logging_object * address feedback Co-authored-by: Nate Mar <67926244+nate-mar@users.noreply.github.com> * fix(amazon_deepseek_transformation.py): remove </think> from stream o… (#8717) * fix(amazon_deepseek_transformation.py): remove </think> from stream output - cleanup user facing stream * fix(key_managenet_endpoints.py): return `/key/list` sorted by created_at makes it easier to see created key * style: cleanup team table * feat(key_edit_view.tsx): support setting model specific tpm/rpm limits on keys * Add cohere v2/rerank support (#8421) (#8605) * Add cohere v2/rerank support (#8421) * Support v2 endpoint cohere rerank * Add tests and docs * Make v1 default if old params used * Update docs * Update docs pt 2 * Update tests * Add e2e test * Clean up code * Use inheritence for new config * Fix linting issues (#8608) * Fix cohere v2 failing test + linting (#8672) * Fix test and unused imports * Fix tests * fix: fix linting errors * test: handle tgai instability * fix: skip service unavailable err * test: print logs for unstable test * test: skip unreliable tests --------- Co-authored-by: vibhavbhat <vibhavb00@gmail.com> * fix(proxy/_types.py): fixes issue where internal user able to escalat… (#8740) * fix(proxy/_types.py): fixes issue where internal user able to escalate their role with ui key Fixes https://github.com/BerriAI/litellm/issues/8029 * style: cleanup * test: handle bedrock instability --------- Co-authored-by: Madhukar Holla <mholla8@gmail.com> Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> Co-authored-by: Yury Koleda <fut.wrk@gmail.com> Co-authored-by: Oskar Austegard <oskar@austegard.com> Co-authored-by: Nate Mar <67926244+nate-mar@users.noreply.github.com> Co-authored-by: vibhavbhat <vibhavb00@gmail.com>	2025-02-24 10:04:58 -08:00
Krish Dholakia	566d9354aa	fix(proxy/_types.py): fixes issue where internal user able to escalat… (#8740 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 41s Details * fix(proxy/_types.py): fixes issue where internal user able to escalate their role with ui key Fixes https://github.com/BerriAI/litellm/issues/8029 * style: cleanup * test: handle bedrock instability	2025-02-22 22:59:58 -08:00
Krish Dholakia	09462ba80c	Add cohere v2/rerank support (#8421 ) (#8605 ) * Add cohere v2/rerank support (#8421) * Support v2 endpoint cohere rerank * Add tests and docs * Make v1 default if old params used * Update docs * Update docs pt 2 * Update tests * Add e2e test * Clean up code * Use inheritence for new config * Fix linting issues (#8608) * Fix cohere v2 failing test + linting (#8672) * Fix test and unused imports * Fix tests * fix: fix linting errors * test: handle tgai instability * fix: skip service unavailable err * test: print logs for unstable test * test: skip unreliable tests --------- Co-authored-by: vibhavbhat <vibhavb00@gmail.com>	2025-02-22 22:25:29 -08:00
Krish Dholakia	21ea52105a	Support arize phoenix on litellm proxy (#7756 ) (#8715 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details * Update opentelemetry.py wip * Update test_opentelemetry_unit_tests.py * fix a few paths and tests * fix path * Update litellm_logging.py * accidentally removed code * Add type for protocol * Add and update tests * minor changes * update and add additional arize phoenix test * update existing test * address feedback * use standard_logging_object * address feedback Co-authored-by: Nate Mar <67926244+nate-mar@users.noreply.github.com>	2025-02-22 20:55:11 -08:00
Krish Dholakia	251467a525	add bedrock llama vision support + cohere / infinity rerank - 'return_documents' support (#8684 ) * build(model_prices_and_context_window.json): mark bedrock llama as supporting vision based on docs * Add price for Cerebras llama3.3-70b (#8676) * docs(readme.md): fix contributing docs point people to new mock directory testing structure s/o @vibhavbhat * build: update contributing readme * docs(readme.md): improve docs * docs(readme.md): cleanup readme on tests/ * docs(README.md): cleanup doc * feat(infinity/): support returning documents when return_documents=True * test(test_rerank.py): add e2e testing for cohere rerank * fix: fix linting errors * fix(together_ai/): fix together ai transformation * fix: fix linting error * fix: fix linting errors * fix: fix linting errors * test: mark cohere as flaky * build: fix model supports check * test: fix test * test: mark flaky test * fix: fix test * test: fix test --------- Co-authored-by: Yury Koleda <fut.wrk@gmail.com>	2025-02-20 21:23:54 -08:00
Krish Dholakia	b682dc4ec8	Add cost tracking for rerank via bedrock (#8691 ) * feat(bedrock/rerank): infer model region if model given as arn * test: add unit testing to ensure bedrock region name inferred from arn on rerank * feat(bedrock/rerank/transformation.py): include search units for bedrock rerank result Resolves https://github.com/BerriAI/litellm/issues/7258#issuecomment-2671557137 * test(test_bedrock_completion.py): add testing for bedrock cohere rerank * feat(cost_calculator.py): refactor rerank cost tracking to support bedrock cost tracking * build(model_prices_and_context_window.json): add amazon.rerank model to model cost map * fix(cost_calculator.py): bedrock/common_utils.py get base model from model w/ arn -> handles rerank model * build(model_prices_and_context_window.json): add bedrock cohere rerank pricing * feat(bedrock/rerank): migrate bedrock config to basererank config * Revert "feat(bedrock/rerank): migrate bedrock config to basererank config" This reverts commit `84fae1f167`. * test: add testing to ensure large doc / queries are correctly counted * Revert "test: add testing to ensure large doc / queries are correctly counted" This reverts commit `4337f1657e`. * fix(migrate-jina-ai-to-rerank-config): enables cost tracking * refactor(jina_ai/): finish migrating jina ai to base rerank config enables cost tracking * fix(jina_ai/rerank): e2e jina ai rerank cost tracking * fix: cleanup dead code * fix: fix python3.8 compatibility error * test: fix test * test: add e2e testing for azure ai rerank * fix: fix linting error * test: mark cohere as flaky	2025-02-20 21:00:18 -08:00
Ishaan Jaff	da510651e0	delete deprecated code test	2025-02-20 18:53:59 -08:00
Ishaan Jaff	55b938dd6e	(Infra/DB) - Allow running older litellm version when out of sync with current state of DB (#8695 ) * fix check migration * clean up should_update_prisma_schema * update test * db_migration_disable_update_check * Check container logs for expected message * db_migration_disable_update_check * test_check_migration_out_of_sync * test_should_update_prisma_schema * db_migration_disable_update_check * pip install aiohttp	2025-02-20 18:30:23 -08:00
Ishaan Jaff	300d7825f5	(Observability) - Add more detailed dd tracing on Proxy Auth, Bedrock Auth (#8693 ) * add dd tracer * fix dd tracing * add @tracer.wrap() on def user_api_key_auth * add async_function_with_retries * remove dead code * add tracer.wrap on base aws llm * add tracer.wrap on base aws llm * fix print verbose * fix dd tracing * trace base aws llm * fix test base aws llm * fix converse transform * test base aws llm * BASE_AWS_LLM_PATH * BASE_AWS_LLM_PATH * test dd tracing	2025-02-20 18:00:41 -08:00
Ishaan Jaff	ccfbb77b73	(Redis fix) - use mget_non_atomic (#8682 ) * use mget_nonatomic * redis cluster override MGET op * fix redis cluster + MGET * test redis cluster	2025-02-20 17:51:31 -08:00
Ishaan Jaff	bb6f43d12e	(Bug fix) - Cache Health not working when configured with prometheus service logger (#8687 ) * fix serialize on safe json dumps * test_non_standard_dict_keys_complex * ui fix HealthCheckCacheParams * fix HealthCheckCacheParams * fix code qa * test_cache_ping_failure * test_cache_ping_health_check_includes_only_cache_attributes * test_cache_ping_health_check_includes_only_cache_attributes	2025-02-20 13:41:56 -08:00
Krish Dholakia	f9df01fbc6	fix(utils.py): handle token counter error when invalid message passed in (#8670 ) * fix(utils.py): handle token counter error * fix(utils.py): testing fixes * fix(utils.py): fix incr for num tokens from list * fix(utils.py): fix text str token counting	2025-02-19 22:21:34 -08:00

... 10 11 12 13 14 ...

1589 commits