litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 10:44:24 +00:00

Author	SHA1	Message	Date
Vivek Aditya	c40d45ae09	Added tags to additional keys that can be sent to athina	2025-02-26 21:00:56 +05:30
Ishaan Jaff	81039d8faf	(Bug fix) - allow using Assistants GET, DELETE on `/openai` pass through routes (#8818 ) * test_openai_assistants_e2e_operations * test openai assistants pass through * fix GET request on pass through handler * _make_non_streaming_http_request * _is_assistants_api_request * test_openai_assistants_e2e_operations * test_openai_assistants_e2e_operations * openai_proxy_route * docs openai pass through * docs openai pass through * docs openai pass through * test pass through handler * Potential fix for code scanning alert no. 2240: Incomplete URL substring sanitization Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-02-25 19:19:00 -08:00
Ishaan Jaff	f9cee4c46b	(Bug Fix) Using LiteLLM Python SDK with model=`litellm_proxy/` for embedding, image_generation, transcription, speech, rerank (#8815 ) * test_litellm_gateway_from_sdk * fix embedding check for openai * test litellm proxy provider * fix image generation openai compatible models * fix litellm.transcription * test_litellm_gateway_from_sdk_rerank * docs litellm python sdk * docs litellm python sdk with proxy * test_litellm_gateway_from_sdk_rerank * ci/cd run again * test_litellm_gateway_from_sdk_image_generation * test_litellm_gateway_from_sdk_embedding * test_litellm_gateway_from_sdk_embedding	2025-02-25 16:22:37 -08:00
Krrish Dholakia	de8497309b	docs(anthropic.md): add claude-3-7-sonnet support All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 15s Details	2025-02-25 00:06:30 -08:00
Nitin Patel	46cbaa8c0c	fix incorrect variable name in reliability section of docs (#8753 )	2025-02-24 14:51:20 -08:00
Krish Dholakia	09462ba80c	Add cohere v2/rerank support (#8421 ) (#8605 ) * Add cohere v2/rerank support (#8421) * Support v2 endpoint cohere rerank * Add tests and docs * Make v1 default if old params used * Update docs * Update docs pt 2 * Update tests * Add e2e test * Clean up code * Use inheritence for new config * Fix linting issues (#8608) * Fix cohere v2 failing test + linting (#8672) * Fix test and unused imports * Fix tests * fix: fix linting errors * test: handle tgai instability * fix: skip service unavailable err * test: print logs for unstable test * test: skip unreliable tests --------- Co-authored-by: vibhavbhat <vibhavb00@gmail.com>	2025-02-22 22:25:29 -08:00
Krish Dholakia	21ea52105a	Support arize phoenix on litellm proxy (#7756 ) (#8715 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details * Update opentelemetry.py wip * Update test_opentelemetry_unit_tests.py * fix a few paths and tests * fix path * Update litellm_logging.py * accidentally removed code * Add type for protocol * Add and update tests * minor changes * update and add additional arize phoenix test * update existing test * address feedback * use standard_logging_object * address feedback Co-authored-by: Nate Mar <67926244+nate-mar@users.noreply.github.com>	2025-02-22 20:55:11 -08:00
Oskar Austegard	fd1070a7d1	Correct spelling in user_management_heirarchy.md (#8716 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details Fixing irritating typo -- page and image names would also need to be updated	2025-02-21 09:23:29 -08:00
Krish Dholakia	b682dc4ec8	Add cost tracking for rerank via bedrock (#8691 ) * feat(bedrock/rerank): infer model region if model given as arn * test: add unit testing to ensure bedrock region name inferred from arn on rerank * feat(bedrock/rerank/transformation.py): include search units for bedrock rerank result Resolves https://github.com/BerriAI/litellm/issues/7258#issuecomment-2671557137 * test(test_bedrock_completion.py): add testing for bedrock cohere rerank * feat(cost_calculator.py): refactor rerank cost tracking to support bedrock cost tracking * build(model_prices_and_context_window.json): add amazon.rerank model to model cost map * fix(cost_calculator.py): bedrock/common_utils.py get base model from model w/ arn -> handles rerank model * build(model_prices_and_context_window.json): add bedrock cohere rerank pricing * feat(bedrock/rerank): migrate bedrock config to basererank config * Revert "feat(bedrock/rerank): migrate bedrock config to basererank config" This reverts commit `84fae1f167`. * test: add testing to ensure large doc / queries are correctly counted * Revert "test: add testing to ensure large doc / queries are correctly counted" This reverts commit `4337f1657e`. * fix(migrate-jina-ai-to-rerank-config): enables cost tracking * refactor(jina_ai/): finish migrating jina ai to base rerank config enables cost tracking * fix(jina_ai/rerank): e2e jina ai rerank cost tracking * fix: cleanup dead code * fix: fix python3.8 compatibility error * test: fix test * test: add e2e testing for azure ai rerank * fix: fix linting error * test: mark cohere as flaky	2025-02-20 21:00:18 -08:00
elroy-bot	7f47ae88b7	Add Elroy to projects built with litellm (#8642 ) Co-authored-by: Tom Bedor <tombedor@gmail.com>	2025-02-18 16:43:30 -08:00
Ishaan Jaff	d1ba04d9d9	[Feature]: Redis Caching - Allow setting a namespace for redis cache (#8624 ) * use _add_namespace_to_cache_key * fix cache_control_args * test_redis_caching_multiple_namespaces * test_add_namespace_to_cache_key * test_redis_caching_multiple_namespaces * docs redis name space * test_add_namespace_to_cache_key	2025-02-18 14:47:34 -08:00
Krish Dholakia	2340f1b31f	Pass router tags in request headers - `x-litellm-tags` (#8609 ) * feat(litellm_pre_call_utils.py): support `x-litellm-tags` request header allow tag based routing + spend tracking via request headers * docs(request_headers.md): document new `x-litellm-tags` for tag based routing and spend tracking * docs(tag_routing.md): add to docs * fix(utils.py): only pass str values for openai metadata param * fix(utils.py): drop non-str values for metadata param to openai preview-feature, otel span was being sent in	2025-02-18 08:26:22 -08:00
Krrish Dholakia	7bfd816d3b	build: merge commit `1b15568af7` All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 14s Details Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Feb 17 21:37:36 2025 -0800 fix(proxy/_types.py): fix linting error commit `dc4d5cffa6` Author: Krrish Dholakia <krrishdholakia@gmail.com>	2025-02-17 21:56:00 -08:00
Krrish Dholakia	d0413ec96b	docs(routing.md): add section on weighted deployments	2025-02-17 17:02:06 -08:00
Krrish Dholakia	048dd995dc	docs: update litellm user management heirarchy doc All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 36s Details	2025-02-16 00:31:13 -08:00
Krrish Dholakia	c2e0c2f0bc	docs(request_headers.md): document openai org id header handling in request_headers.md	2025-02-16 00:04:38 -08:00
Ishaan Jaff	6b3bfa2b42	(Feat) - return `x-litellm-attempted-fallbacks` in responses from litellm proxy (#8558 ) * add_fallback_headers_to_response * test x-litellm-attempted-fallbacks * unit test attempted fallbacks * fix add_fallback_headers_to_response * docs document response headers * fix file name	2025-02-15 14:54:23 -08:00
miraclebakelaser	3c197b9925	docs(perplexity.md): removing `return_citations` documentation (#8527 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details Deprecation Notice: Effective immediately, all API users will see citations returned as part of their requests by default. This is not a breaking change. The return_citations parameter will no longer have any effect. [source](https://docs.perplexity.ai/changelog/changelog#citations-public-release-and-increased-default-rate-limits)	2025-02-13 22:09:54 -08:00
Krish Dholakia	58141df65d	Litellm dev 02 13 2025 p2 (#8525 ) * fix(azure/chat/gpt_transformation.py): add 'prediction' as a support azure param Closes https://github.com/BerriAI/litellm/issues/8500 * build(model_prices_and_context_window.json): add new 'gemini-2.0-pro-exp-02-05' model * style: cleanup invalid json trailing commma * feat(utils.py): support passing 'tokenizer_config' to register_prompt_template enables passing complete tokenizer config of model to litellm Allows calling deepseek on bedrock with the correct prompt template * fix(utils.py): fix register_prompt_template for custom model names * test(test_prompt_factory.py): fix test * test(test_completion.py): add e2e test for bedrock invoke deepseek ft model * feat(base_invoke_transformation.py): support hf_model_name param for bedrock invoke calls enables proxy admin to set base model for ft bedrock deepseek model * feat(bedrock/invoke): support deepseek_r1 route for bedrock makes it easy to apply the right chat template to that call * feat(constants.py): store deepseek r1 chat template - allow user to get correct response from deepseek r1 without extra work * test(test_completion.py): add e2e mock test for bedrock deepseek * docs(bedrock.md): document new deepseek_r1 route for bedrock allows us to use the right config * fix(exception_mapping_utils.py): catch read operation timeout	2025-02-13 20:28:42 -08:00
vivek-athina	fd0769f2ed	Added custom_attributes to additional_keys which can be sent to athina (#8518 )	2025-02-13 13:19:24 -08:00
exiao	fa3136c391	add phoenix docs for observability integration (#8522 ) * Add files via upload * Update arize_integration.md * Update arize_integration.md * add Phoenix docs	2025-02-13 13:18:37 -08:00
Krish Dholakia	305049a968	Litellm dev 02 12 2025 p1 (#8494 ) * Resolves https://github.com/BerriAI/litellm/issues/6625 (#8459) - enables no auth for SMTP Signed-off-by: Regli Daniel <daniel.regli1@sanitas.com> * add sonar pricings (#8476) * add sonar pricings * Update model_prices_and_context_window.json * Update model_prices_and_context_window.json * Update model_prices_and_context_window_backup.json * test: fix test --------- Signed-off-by: Regli Daniel <daniel.regli1@sanitas.com> Co-authored-by: Dani Regli <1daniregli@gmail.com> Co-authored-by: Lucca Zenóbio <luccazen@gmail.com>	2025-02-12 22:39:29 -08:00
Krrish Dholakia	9f93ed110a	docs: fix docs All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details	2025-02-12 07:28:21 -08:00
Krrish Dholakia	c4a5e2c5c7	docs(token_auth.md): clarify scopes can be a list or comma separated string	2025-02-12 07:26:47 -08:00
Krish Dholakia	9c4c7813fb	Allow org admin to create teams on UI (#8407 ) * fix(client_initialization_utils.py): handle custom llm provider set with valid value not from model name * fix(handle_jwt.py): handle groups not existing in jwt token if user not in group, this won't exist * fix(handle_jwt.py): add new `enforce_team_based_model_access` flag to jwt auth allows proxy admin to enforce user can only call model if team has access * feat(navbar.tsx): expose new dropdown in navbar - allow org admin to create teams within org context * fix(navbar.tsx): remove non-functional cogicon * fix(proxy/utils.py): include user-org memberships in `/user/info` response return orgs user is a member of and the user role within org * feat(organization_endpoints.py): allow internal user to query `/organizations/list` and get all orgs they belong to enables org admin to select org they belong to, to create teams * fix(navbar.tsx): show change in ui when org switcher clicked * feat(page.tsx): update user role based on org they're in allows org admin to create teams in the org context * feat(teams.tsx): working e2e flow for allowing org admin to add new teams * style(navbar.tsx): clarify switching orgs on UI is in BETA * fix(organization_endpoints.py): handle getting but not setting members * test: fix test * fix(client_initialization_utils.py): revert custom llm provider handling fix - causing unintended issues * docs(token_auth.md): cleanup docs	2025-02-09 00:07:15 -08:00
Mubashir Osmani	bc2ac8264e	added gemini 2.0 models (#8412 )	2025-02-08 22:34:22 -08:00
Krish Dholakia	1dd3713f1a	Anthropic Citations API Support (#8382 ) * test(test_anthropic_completion.py): add test ensuring anthropic structured output response is consistent Resolves https://github.com/BerriAI/litellm/issues/8291 * feat(anthropic.py): support citations api with new user document message format Resolves https://github.com/BerriAI/litellm/issues/7970 * fix(anthropic/chat/transformation.py): return citations as a provider-specific-field Resolves https://github.com/BerriAI/litellm/issues/7970 * feat(anthropic/chat/handler.py): add streaming citations support Resolves https://github.com/BerriAI/litellm/issues/7970 * fix(handler.py): fix code qa error * fix(handler.py): only set provider specific fields if non-empty dict * docs(anthropic.md): add citations api to anthropic docs	2025-02-07 22:27:01 -08:00
Krish Dholakia	d720744656	Litellm dev 02 06 2025 p3 (#8343 ) * feat(handle_jwt.py): initial commit to allow scope based model access * feat(handle_jwt.py): allow model access based on token scopes allow admin to control model access from IDP * test(test_jwt.py): add unit testing for scope based model access * docs(token_auth.md): add scope based model access to docs * docs(token_auth.md): update docs * docs(token_auth.md): update docs * build: add gemini commercial rate limits * fix: fix linting error	2025-02-06 23:15:33 -08:00
Ishaan Jaff	e3aab50ab3	docs assembly ai All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details	2025-02-06 21:30:36 -08:00
Ishaan Jaff	229f270dd6	docs assembly ai eu endpoints	2025-02-06 21:13:40 -08:00
Krish Dholakia	f031926b82	fix(utils.py): handle key error in msg validation (#8325 ) * fix(utils.py): handle key error in msg validation * Support running Aim Guard during LLM call (#7918) * support running Aim Guard during LLM call * Rename header * adjust docs and fix type annotations * fix(timeout.md): doc fix for openai example on dynamic timeouts --------- Co-authored-by: Tomer Bin <117278227+hxtomer@users.noreply.github.com>	2025-02-06 18:13:46 -08:00
Rok Benko	3ec9c28fb7	Update local_debugging.md (#8308 )	2025-02-06 16:19:32 -08:00
exiao	85491a0bab	Add Arize Cookbook for Turning on LiteLLM Proxy (#8336 ) * Add files via upload * Update arize_integration.md	2025-02-06 16:16:28 -08:00
Tyler Wagner	5e921804b9	fix: docs links (#8294 ) Fixed the docs links in the enterprise md.	2025-02-05 20:41:20 -08:00
Zhaohan Dong	88e7046165	Added compatibility guidance, etc. for xAI Grok model (#8282 ) * Various updates Signed-off-by: Zhaohan Dong <65422392+zhaohan-dong@users.noreply.github.com> * Update xAI branding Signed-off-by: Zhaohan Dong <65422392+zhaohan-dong@users.noreply.github.com> * Revert changes Signed-off-by: Zhaohan Dong <65422392+zhaohan-dong@users.noreply.github.com> --------- Signed-off-by: Zhaohan Dong <65422392+zhaohan-dong@users.noreply.github.com>	2025-02-05 17:21:47 -08:00
waterstark	fbe3c58372	Added a guide for users who want to use LiteLLM with AI/ML API. (#7058 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details * Added a guide for users who want to use LiteLLM with AI/ML. * Minor changes * Minor changes * Fix sidebars.js --------- Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>	2025-02-05 06:20:35 -08:00
Krish Dholakia	8d3a942fbd	Litellm staging (#8270 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 15s Details * fix(opik.py): cleanup * docs(opik_integration.md): cleanup opik integration docs * fix(redact_messages.py): fix redact messages check header logic ensures stringified bool value in header is still asserted to true allows dynamic message redaction * feat(redact_messages.py): support `x-litellm-enable-message-redaction` request header allows dynamic message redaction	2025-02-04 22:35:48 -08:00
Krish Dholakia	4e34fc3bf8	[BETA] Support OIDC `role` based access to proxy (#8260 ) * feat(proxy/_types.py): add new jwt field params allows users + services to auth into proxy * feat(handle_jwt.py): allow team role proxy access allows proxy admin to set allowed team roles * fix(proxy/_types.py): add 'routes' to role based permissions allow proxy admin to restrict what routes a team can access easily * feat(handle_jwt.py): support more flexible role based route access v2 on role based 'allowed_routes' * test(test_jwt.py): add unit test for rbac for proxy routes * feat(handle_jwt.py): ensure cost tracking always works for any jwt request with `enforce_rbac=True` * docs(token_auth.md): add documentation on controlling model access via OIDC Roles * test: increase time delay before retrying * test: handle model overloaded for test	2025-02-04 21:59:39 -08:00
Ishaan Jaff	8fd60a420d	(Feat) - New pass through add assembly ai passthrough endpoints (#8220 ) * add assembly ai pass through request * fix assembly pass through * fix test_assemblyai_basic_transcribe * fix assemblyai auth check * test_assemblyai_transcribe_with_non_admin_key * working assembly ai test * working assembly ai proxy route * use helper func to pass through logging * clean up logging assembly ai * test: update test to handle gemini token counter change * fix(factory.py): fix bedrock http:// handling * add unit testing for assembly pt handler * docs assembly ai pass through endpoint * fix proxy_pass_through_endpoint_tests * fix standard_passthrough_logging_object * fix ASSEMBLYAI_API_KEY * test test_assemblyai_proxy_route_basic_post * test_assemblyai_proxy_route_get_transcript * fix is is_assemblyai_route * test_is_assemblyai_route --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>	2025-02-03 21:54:32 -08:00
foreign-sub	aa6a18ecc2	docs: fix typo in lm_studio.md (#8222 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details	2025-02-03 18:37:31 -08:00
Krish Dholakia	c8494abdea	test(base_llm_unit_tests.py): add test to ensure drop params is respe… (#8224 ) * test(base_llm_unit_tests.py): add test to ensure drop params is respected * fix(types/prometheus.py): use typing_extensions for python3.8 compatibility * build: add cherry picked commits	2025-02-03 16:04:44 -08:00
Zhaohan Dong	d60d3ee970	Add xAI and fix some old model config (#8218 ) Signed-off-by: Zhaohan Dong <65422392+zhaohan-dong@users.noreply.github.com>	2025-02-03 15:29:19 -08:00
fzowl	d1d9c1e95a	docs: Updating the available VoyageAI models in the docs (#8215 ) * Refresh VoyageAI models and prices and context * Refresh VoyageAI models and prices and context * Refresh VoyageAI models and prices and context * Updating the available VoyageAI models in the docs	2025-02-03 07:26:33 -08:00
Ishaan Jaff	4e9c2d5b21	docs update log stream event	2025-02-01 16:33:28 -08:00
superpoussin22	931b44ac1f	Update bedrock.md remove CUSTOM_ for consistency	2025-02-01 21:23:39 +01:00
Krish Dholakia	23f458d2da	Improved O3 + Azure O3 support (#8181 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details * fix: support azure o3 model family for fake streaming workaround (#8162) * fix: support azure o3 model family for fake streaming workaround * refactor: rename helper to is_o_series_model for clarity * update function calling parameters for o3 models (#8178) * refactor(o1_transformation.py): refactor o1 config to be o series config, expand o series model check to o3 ensures max_tokens is correctly translated for o3 * feat(openai/): refactor o1 files to be 'o_series' files expands naming to cover o3 * fix(azure/chat/o1_handler.py): azure openai is an instance of openai - was causing resets * test(test_azure_o_series.py): assert stream faked for azure o3 mini Resolves https://github.com/BerriAI/litellm/pull/8162 * fix(o1_transformation.py): fix o1 transformation logic to handle explicit o1_series routing * docs(azure.md): update doc with `o_series/` model name --------- Co-authored-by: byrongrogan <47910641+byrongrogan@users.noreply.github.com> Co-authored-by: Low Jian Sheng <15527690+lowjiansheng@users.noreply.github.com>	2025-02-01 09:52:28 -08:00
Krish Dholakia	2147cad307	Litellm dev 01 31 2025 p2 (#8164 ) * docs(token_auth.md): clarify title * refactor(handle_jwt.py): add jwt auth manager + refactor to handle groups allows user to call model if user belongs to group with model access * refactor(handle_jwt.py): refactor to first check if service call then check user call * feat(handle_jwt.py): new `enforce_team_access` param only allows user to call model if a team they belong to has model access allows controlling user model access by team * fix(handle_jwt.py): fix error string, remove unecessary param * docs(token_auth.md): add controlling model access for jwt tokens via teams to docs * test: fix tests post refactor * fix: fix linting errors * fix: fix linting error * test: fix import error	2025-01-31 22:52:35 -08:00
Ishaan Jaff	1d9ccb7fbe	doc fix	2025-01-31 21:19:39 -08:00
Ishaan Jaff	9ff27809b2	(Feat) add bedrock/deepseek custom import models (#8132 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details * add support for using llama spec with bedrock * fix get_bedrock_invoke_provider * add support for using bedrock provider in mappings * working request * test_bedrock_custom_deepseek * test_bedrock_custom_deepseek * fix _get_model_id_for_llama_like_model * test_bedrock_custom_deepseek * doc DeepSeek-R1-Distill-Llama-70B * test_bedrock_custom_deepseek	2025-01-31 18:40:44 -08:00
Krrish Dholakia	4b6fc4bba3	docs: fix dead links All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details	2025-01-31 10:09:49 -08:00

... 5 6 7 8 9 ...

3401 commits