litellm

Author	SHA1	Message	Date
Ishaan Jaff	7959dc9db3	(feat) add bedrock/stability.stable-image-ultra-v1:0 (#6723 ) * add stability.stable-image-ultra-v1:0 * add pricing for stability.stable-image-ultra-v1:0 * fix test_supports_response_schema * ci/cd run again	2024-11-14 14:47:15 -08:00
Krrish Dholakia	fc685c1f74	docs(logging.md): add 'trace_id' param to standard logging payload	2024-11-15 02:01:37 +05:30
Krrish Dholakia	9593fbe5c3	docs(reliability.md): add tutorial on disabling fallbacks per key	2024-11-15 01:49:17 +05:30
Krrish Dholakia	499780eff2	docs: add docs on jina ai rerank support	2024-11-15 01:45:57 +05:30
Krrish Dholakia	89678ace00	bump: version 1.52.7 → 1.52.8	2024-11-15 01:03:49 +05:30
Krish Dholakia	e9aa492af3	LiteLLM Minor Fixes & Improvement (11/14/2024) (#6730 ) * fix(ollama.py): fix get model info request Fixes https://github.com/BerriAI/litellm/issues/6703 * feat(anthropic/chat/transformation.py): support passing user id to anthropic via openai 'user' param * docs(anthropic.md): document all supported openai params for anthropic * test: fix tests * fix: fix tests * feat(jina_ai/): add rerank support Closes https://github.com/BerriAI/litellm/issues/6691 * test: handle service unavailable error * fix(handler.py): refactor together ai rerank call * test: update test to handle overloaded error * test: fix test * Litellm router trace (#6742) * feat(router.py): add trace_id to parent functions - allows tracking retry/fallbacks * feat(router.py): log trace id across retry/fallback logic allows grouping llm logs for the same request * test: fix tests * fix: fix test * fix(transformation.py): only set non-none stop_sequences * Litellm router disable fallbacks (#6743) * bump: version 1.52.6 → 1.52.7 * feat(router.py): enable dynamically disabling fallbacks Allows for enabling/disabling fallbacks per key * feat(litellm_pre_call_utils.py): support setting 'disable_fallbacks' on litellm key * test: fix test * fix(exception_mapping_utils.py): map 'model is overloaded' to internal server error * test: handle gemini error * test: fix test * fix: new run	2024-11-15 01:02:54 +05:30
Ishaan Jaff	f8e700064e	(Feat) Add support for storing virtual keys in AWS SecretManager (#6728 ) * add SecretManager to httpxSpecialProvider * fix importing AWSSecretsManagerV2 * add unit testing for writing keys to AWS secret manager * use KeyManagementEventHooks for key/generated events * us event hooks for key management endpoints * working AWSSecretsManagerV2 * fix write secret to AWS secret manager on /key/generate * fix KeyManagementSettings * use tasks for key management hooks * add async_delete_secret * add test for async_delete_secret * use _delete_virtual_keys_from_secret_manager * fix test secret manager * test_key_generate_with_secret_manager_call * fix check for key_management_settings * sync_read_secret * test_aws_secret_manager * fix sync_read_secret * use helper to check when _should_read_secret_from_secret_manager * test_get_secret_with_access_mode * test - handle eol model claude-2, use claude-2.1 instead * docs AWS secret manager * fix test_read_nonexistent_secret * fix test_supports_response_schema * ci/cd run again	2024-11-14 09:25:07 -08:00
Ishaan Jaff	da84056e59	mark Helm PreSyn as BETA	2024-11-13 22:18:12 -08:00
Ishaan Jaff	387c70c989	fix test_supports_response_schema	2024-11-13 21:59:24 -08:00
Camden Clark	b582efa3ce	Update prefix.md (#6734 )	2024-11-14 11:18:35 +05:30
Jongseob Jeon	f3914c87d3	Update code blocks huggingface.md (#6737 )	2024-11-14 11:17:57 +05:30
Ishaan Jaff	310669e3bc	(docs) add instructions on how to contribute to docker image	2024-11-13 20:52:17 -08:00
Ishaan Jaff	914cec3ab5	test - handle eol model claude-2, use claude-2.1 instead	2024-11-13 19:37:34 -08:00
Ishaan Jaff	f2e6025c65	fix prisma migration	2024-11-13 17:04:58 -08:00
Ishaan Jaff	0e2c16e948	fix migration job	2024-11-13 17:02:06 -08:00
Ishaan Jaff	b56b5dce7f	fix migrations-job.yaml	2024-11-13 16:59:34 -08:00
Ishaan Jaff	894b295658	update doc on pre sync hook	2024-11-13 16:56:55 -08:00
Ishaan Jaff	b5183ce31b	fix migration job	2024-11-13 16:56:09 -08:00
Ishaan Jaff	da5da64d27	fix yaml on migrations job	2024-11-13 16:48:22 -08:00
Ishaan Jaff	4dc23cf997	use existing spec for migrations job	2024-11-13 16:43:26 -08:00
Ishaan Jaff	aa82a88c5f	fix DATABASE_URL	2024-11-13 16:19:37 -08:00
Ishaan Jaff	db9d9dde0a	fix migration job.yaml	2024-11-13 16:18:11 -08:00
Ishaan Jaff	49cda71c55	docs helm pre sync hook	2024-11-13 15:33:43 -08:00
Ishaan Jaff	e77ceec949	helm run DISABLE_SCHEMA_UPDATE	2024-11-13 15:28:07 -08:00
Ishaan Jaff	b8b899f5d7	docs proxy_budget_rescheduler_min_time	2024-11-13 15:03:08 -08:00
Krrish Dholakia	44709dd31d	bump: version 1.52.6 → 1.52.7	2024-11-14 01:25:31 +05:30
Krish Dholakia	1c3dcd4b25	Litellm key update fix (#6710 ) * fix(caching): convert arg to equivalent kwargs in llm caching handler prevent unexpected errors * fix(caching_handler.py): don't pass args to caching * fix(caching): remove all args from caching.py fix(caching): consistent function signatures + abc method * test(caching_unit_tests.py): add unit tests for llm caching ensures coverage for common caching scenarios across different implementations * refactor(litellm_logging.py): move to using cache key from hidden params instead of regenerating one * fix(router.py): drop redis password requirement * fix(proxy_server.py): fix faulty slack alerting check * fix(langfuse.py): avoid copying functions/thread lock objects in metadata fixes metadata copy error when parent otel span in metadata * test: update test * fix(key_management_endpoints.py): fix /key/update with metadata update * fix(key_management_endpoints.py): fix key_prepare_update helper * fix(key_management_endpoints.py): reset value to none if set in key update * fix: update test ' * Litellm dev 11 11 2024 (#6693) * fix(__init__.py): add 'watsonx_text' as mapped llm api route Fixes https://github.com/BerriAI/litellm/issues/6663 * fix(opentelemetry.py): fix passing parallel tool calls to otel Fixes https://github.com/BerriAI/litellm/issues/6677 * refactor(test_opentelemetry_unit_tests.py): create a base set of unit tests for all logging integrations - test for parallel tool call handling reduces bugs in repo * fix(__init__.py): update provider-model mapping to include all known provider-model mappings Fixes https://github.com/BerriAI/litellm/issues/6669 * feat(anthropic): support passing document in llm api call * docs(anthropic.md): add pdf anthropic call to docs + expose new 'supports_pdf_input' function * fix(factory.py): fix linting error * add clear doc string for GCS bucket logging * Add docs to export logs to Laminar (#6674) * Add docs to export logs to Laminar * minor fix: newline at end of file * place laminar after http and grpc * (Feat) Add langsmith key based logging (#6682) * add langsmith_api_key to StandardCallbackDynamicParams * create a file for langsmith types * langsmith add key / team based logging * add key based logging for langsmith * fix langsmith key based logging * fix linting langsmith * remove NOQA violation * add unit test coverage for all helpers in test langsmith * test_langsmith_key_based_logging * docs langsmith key based logging * run langsmith tests in logging callback tests * fix logging testing * test_langsmith_key_based_logging * test_add_callback_via_key_litellm_pre_call_utils_langsmith * add debug statement langsmith key based logging * test_langsmith_key_based_logging * (fix) OpenAI's optional messages[].name does not work with Mistral API (#6701) * use helper for _transform_messages mistral * add test_message_with_name to base LLMChat test * fix linting * add xAI on Admin UI (#6680) * (docs) add benchmarks on 1K RPS (#6704) * docs litellm proxy benchmarks * docs GCS bucket * doc fix - reduce clutter on logging doc title * (feat) add cost tracking stable diffusion 3 on Bedrock (#6676) * add cost tracking for sd3 * test_image_generation_bedrock * fix get model info for image cost * add cost_calculator for stability 1 models * add unit testing for bedrock image cost calc * test_cost_calculator_with_no_optional_params * add test_cost_calculator_basic * correctly allow size Optional * fix cost_calculator * sd3 unit tests cost calc * fix raise correct error 404 when /key/info is called on non-existent key (#6653) * fix raise correct error on /key/info * add not_found_error error * fix key not found in DB error * use 1 helper for checking token hash * fix error code on key info * fix test key gen prisma * test_generate_and_call_key_info * test fix test_call_with_valid_model_using_all_models * fix key info tests * bump: version 1.52.4 → 1.52.5 * add defaults used for GCS logging * LiteLLM Minor Fixes & Improvements (11/12/2024) (#6705) * fix(caching): convert arg to equivalent kwargs in llm caching handler prevent unexpected errors * fix(caching_handler.py): don't pass args to caching * fix(caching): remove all args from caching.py fix(caching): consistent function signatures + abc method * test(caching_unit_tests.py): add unit tests for llm caching ensures coverage for common caching scenarios across different implementations * refactor(litellm_logging.py): move to using cache key from hidden params instead of regenerating one * fix(router.py): drop redis password requirement * fix(proxy_server.py): fix faulty slack alerting check * fix(langfuse.py): avoid copying functions/thread lock objects in metadata fixes metadata copy error when parent otel span in metadata * test: update test * bump: version 1.52.5 → 1.52.6 * (feat) helm hook to sync db schema (#6715) * v0 migration job * fix job * fix migrations job.yml * handle standalone DB on helm hook * fix argo cd annotations * fix db migration helm hook * fix migration job * doc fix Using Http/2 with Hypercorn * (fix proxy redis) Add redis sentinel support (#6154) * add sentinel_password support * add doc for setting redis sentinel password * fix redis sentinel - use sentinel password * Fix: Update gpt-4o costs to that of gpt-4o-2024-08-06 (#6714) Fixes #6713 * (fix) using Anthropic `response_format={"type": "json_object"}` (#6721) * add support for response_format=json anthropic * add test_json_response_format to baseLLM ChatTest * fix test_litellm_anthropic_prompt_caching_tools * fix test_anthropic_function_call_with_no_schema * test test_create_json_tool_call_for_response_format * (feat) Add cost tracking for Azure Dall-e-3 Image Generation + use base class to ensure basic image generation tests pass (#6716) * add BaseImageGenTest * use 1 class for unit testing * add debugging to BaseImageGenTest * TestAzureOpenAIDalle3 * fix response_cost_calculator * test_basic_image_generation * fix img gen basic test * fix _select_model_name_for_cost_calc * fix test_aimage_generation_bedrock_with_optional_params * fix undo changes cost tracking * fix response_cost_calculator * fix test_cost_azure_gpt_35 * fix remove dup test (#6718) * (build) update db helm hook * (build) helm db pre sync hook * (build) helm db sync hook * test: run test_team_logging firdst --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Dinmukhamed Mailibay <47117969+dinmukhamedm@users.noreply.github.com> Co-authored-by: Kilian Lieret <kilian.lieret@posteo.de>	2024-11-14 00:42:37 +05:30
Ishaan Jaff	70c8be59d7	(build) helm db sync hook	2024-11-12 20:45:53 -08:00
Ishaan Jaff	ebb03098cb	(build) helm db pre sync hook	2024-11-12 20:26:08 -08:00
Ishaan Jaff	ac04e5f1e6	(build) update db helm hook	2024-11-12 20:22:08 -08:00
Ishaan Jaff	aa6fe6e317	fix remove dup test (#6718 )	2024-11-12 20:16:54 -08:00
Ishaan Jaff	73c7b73aa0	(feat) Add cost tracking for Azure Dall-e-3 Image Generation + use base class to ensure basic image generation tests pass (#6716 ) * add BaseImageGenTest * use 1 class for unit testing * add debugging to BaseImageGenTest * TestAzureOpenAIDalle3 * fix response_cost_calculator * test_basic_image_generation * fix img gen basic test * fix _select_model_name_for_cost_calc * fix test_aimage_generation_bedrock_with_optional_params * fix undo changes cost tracking * fix response_cost_calculator * fix test_cost_azure_gpt_35	2024-11-12 20:02:16 -08:00
Ishaan Jaff	6d4cf2d908	(fix) using Anthropic `response_format={"type": "json_object"}` (#6721 ) * add support for response_format=json anthropic * add test_json_response_format to baseLLM ChatTest * fix test_litellm_anthropic_prompt_caching_tools * fix test_anthropic_function_call_with_no_schema * test test_create_json_tool_call_for_response_format	2024-11-12 19:06:00 -08:00
Kilian Lieret	e7543378b8	Fix: Update gpt-4o costs to that of gpt-4o-2024-08-06 (#6714 ) Fixes #6713	2024-11-12 18:40:52 -08:00
Ishaan Jaff	d136641954	(fix proxy redis) Add redis sentinel support (#6154 ) * add sentinel_password support * add doc for setting redis sentinel password * fix redis sentinel - use sentinel password	2024-11-12 18:36:46 -08:00
Ishaan Jaff	86607a2018	doc fix Using Http/2 with Hypercorn	2024-11-12 18:33:07 -08:00
Ishaan Jaff	4192d7ec6f	fix migration job	2024-11-12 12:20:30 -08:00
Ishaan Jaff	07d7ac3ede	fix db migration helm hook	2024-11-12 12:13:42 -08:00
Ishaan Jaff	503e4a4ad5	fix argo cd annotations	2024-11-12 12:07:57 -08:00
Ishaan Jaff	b4f76556b6	handle standalone DB on helm hook	2024-11-12 12:06:13 -08:00
Ishaan Jaff	ccb6c42e86	fix migrations job.yml	2024-11-12 12:01:37 -08:00
Ishaan Jaff	688d513459	(feat) helm hook to sync db schema (#6715 ) * v0 migration job * fix job	2024-11-12 11:58:35 -08:00
Krrish Dholakia	5081b912eb	bump: version 1.52.5 → 1.52.6	2024-11-12 23:53:07 +05:30
Krish Dholakia	9160d80fa5	LiteLLM Minor Fixes & Improvements (11/12/2024) (#6705 ) * fix(caching): convert arg to equivalent kwargs in llm caching handler prevent unexpected errors * fix(caching_handler.py): don't pass args to caching * fix(caching): remove all args from caching.py fix(caching): consistent function signatures + abc method * test(caching_unit_tests.py): add unit tests for llm caching ensures coverage for common caching scenarios across different implementations * refactor(litellm_logging.py): move to using cache key from hidden params instead of regenerating one * fix(router.py): drop redis password requirement * fix(proxy_server.py): fix faulty slack alerting check * fix(langfuse.py): avoid copying functions/thread lock objects in metadata fixes metadata copy error when parent otel span in metadata * test: update test	2024-11-12 22:50:51 +05:30
Ishaan Jaff	d39fd60801	add defaults used for GCS logging	2024-11-12 07:12:51 -08:00
Ishaan Jaff	33ceb7ca1f	bump: version 1.52.4 → 1.52.5	2024-11-11 21:01:05 -08:00
Ishaan Jaff	de2f9aed3a	fix raise correct error 404 when /key/info is called on non-existent key (#6653 ) * fix raise correct error on /key/info * add not_found_error error * fix key not found in DB error * use 1 helper for checking token hash * fix error code on key info * fix test key gen prisma * test_generate_and_call_key_info * test fix test_call_with_valid_model_using_all_models * fix key info tests	2024-11-11 21:00:39 -08:00
Ishaan Jaff	25bae4cc23	(feat) add cost tracking stable diffusion 3 on Bedrock (#6676 ) * add cost tracking for sd3 * test_image_generation_bedrock * fix get model info for image cost * add cost_calculator for stability 1 models * add unit testing for bedrock image cost calc * test_cost_calculator_with_no_optional_params * add test_cost_calculator_basic * correctly allow size Optional * fix cost_calculator * sd3 unit tests cost calc	2024-11-11 20:21:44 -08:00
Ishaan Jaff	e5051a93a8	(docs) add benchmarks on 1K RPS (#6704 ) * docs litellm proxy benchmarks * docs GCS bucket * doc fix - reduce clutter on logging doc title	2024-11-11 19:25:53 -08:00
Ishaan Jaff	4fd0c6c8f2	add xAI on Admin UI (#6680 )	2024-11-11 18:05:36 -08:00

1 2 3 4 5 ...

18337 commits