litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 11:43:54 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	ffde1d75d5	docs fix order of logging integrations	2025-01-22 08:45:22 -08:00
Krish Dholakia	4d89da9c97	Deepseek r1 support + watsonx qa improvements (#7907 ) * fix(types/utils.py): support returning 'reasoning_content' for deepseek models Fixes https://github.com/BerriAI/litellm/issues/7877#issuecomment-2603813218 * fix(convert_dict_to_response.py): return deepseek response in provider_specific_field allows for separating openai vs. non-openai params in model response * fix(utils.py): support 'provider_specific_field' in delta chunk as well allows deepseek reasoning content chunk to be returned to user from stream as well Fixes https://github.com/BerriAI/litellm/issues/7877#issuecomment-2603813218 * fix(watsonx/chat/handler.py): fix passing space id to watsonx on chat route * fix(watsonx/): fix watsonx_text/ route with space id * fix(watsonx/): qa item - also adds better unit testing for watsonx embedding calls * fix(utils.py): rename to '..fields' * fix: fix linting errors * fix(utils.py): fix typing - don't show provider-specific field if none or empty - prevents default respons e from being non-oai compatible * fix: cleanup unused imports * docs(deepseek.md): add docs for deepseek reasoning model	2025-01-21 23:13:15 -08:00
Ishaan Jaff	b39c2eb226	fix litellm_overhead_latency_metric	2025-01-21 21:33:31 -08:00
Yuki Watanabe	6bdc722007	Update MLflow calllback and documentation (#7809 ) * Update MLlfow tracer Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> * doc update Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> * doc update Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> * image rename Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>	2025-01-21 20:56:48 -08:00
Krish Dholakia	94c9f76767	Litellm dev 01 20 2025 p3 (#7890 ) * fix(router.py): pass stream timeout correctly for non openai / azure models Fixes https://github.com/BerriAI/litellm/issues/7870 * test(test_router_timeout.py): add test for streaming * test(test_router_timeout.py): add unit testing for new router functions * docs(ollama.md): link to section on calling ollama within docker container * test: remove redundant test * test: fix test to include timeout value * docs(config_settings.md): document new router settings param	2025-01-20 21:46:36 -08:00
Krish Dholakia	4c1d4acabc	Litellm dev 01 20 2025 p1 (#7884 ) * fix(initial-test-to-return-api-timeout-value-in-openai-timeout-exception): Makes it easier for user to debug why request timed out * feat(openai.py): return timeout value + time taken on openai timeout errors helps debug timeout errors * fix(utils.py): fix num retries extraction logic when num_retries = 0 * fix(config_settings.md): litellm_logging.py support printing payload to console if 'LITELLM_PRINT_STANDARD_LOGGING_PAYLOAD' is true Enables easier debug * test(test_auth_checks.py'): remove common checks userapikeyauth enforcement check * fix(litellm_logging.py): fix linting error	2025-01-20 21:45:48 -08:00
King	1ede4c2077	Fix typo Update alerting.md (#7880 )	2025-01-20 08:04:03 -08:00
Ishaan Jaff	e9289736f5	docs - Custom Retention Policies	2025-01-20 07:29:48 -08:00
Ishaan Jaff	4990001a50	docs Data Retention Policy	2025-01-20 07:00:38 -08:00
Krish Dholakia	f558f6ce7c	JWT Auth - `enforce_rbac` support + UI team view, spend calc fix (#7863 ) * fix(user_dashboard.tsx): fix spend calculation when team selected sum all team keys, not user keys * docs(admin_ui_sso.md): fix docs tabbing * feat(user_api_key_auth.py): introduce new 'enforce_rbac' param on jwt auth allows proxy admin to prevent any unmapped yet authenticated jwt tokens from calling proxy Fixes https://github.com/BerriAI/litellm/issues/6793 * test: more unit testing + refactoring * fix: fix returning id when obj not found in db * fix(user_api_key_auth.py): add end user id tracking from jwt auth * docs(token_auth.md): add doc on rbac with JWTs * fix: fix unused params * test: remove old test	2025-01-19 21:28:55 -08:00
Krish Dholakia	e3e1fe59da	feat(health_check.py): set upperbound for api when making health check call (#7865 ) * feat(health_check.py): set upperbound for api when making health check call prevent bad model from health check to hang and cause pod restarts * fix(health_check.py): cleanup task once completed * fix(constants.py): bump default health check timeout to 1min * docs(health.md): add 'health_check_timeout' to health docs on litellm * build(proxy_server_config.yaml): add bad model to health check	2025-01-18 19:47:43 -08:00
Ishaan Jaff	62a5aa758a	docs data sec	2025-01-18 17:44:02 -08:00
Ishaan Jaff	58016f0468	litellm security page	2025-01-18 17:24:39 -08:00
Ishaan Jaff	84dbb87418	docs Security Certifications	2025-01-18 17:12:42 -08:00
Ishaan Jaff	b6f8bd3cb6	docs data privacy	2025-01-18 17:01:57 -08:00
Ishaan Jaff	a23a36775f	ui release note	2025-01-17 20:27:53 -08:00
Ishaan Jaff	1e5f3e2970	ui logs - view messages / responses	2025-01-17 20:20:49 -08:00
Ishaan Jaff	bc6a9cd29c	[Hashicorp - secret manager] - use vault namespace for tls auth (#7834 ) * hcorp - use x-vault-namespace * _get_tls_cert_auth_body * HCP_VAULT_CERT_ROLE * test_hashicorp_secret_manager_tls_cert_auth * HCP_VAULT_CERT_ROLE	2025-01-17 19:27:56 -08:00
Nikolaiev Dmytro	b51c46200c	Update instructor tutorial (#7784 )	2025-01-15 15:10:50 -08:00
Hugues Chocart	c919e3ca45	[integrations/lunary] Improve Lunary documentaiton (#7770 ) * update lunary doc * better title * tweaks * Update langchain.md * Update lunary_integration.md	2025-01-15 15:00:25 -08:00
Ishaan Jaff	fadc224fcd	docs iam role based access for bedrock (#7774 )	2025-01-14 19:02:02 -08:00
Krish Dholakia	d7a13ad561	Support temporary budget increases on keys (#7754 ) * fix(gpt_transformation.py): fix response_format translation check for 4o models Fixes https://github.com/BerriAI/litellm/issues/7616 * feat(key_management_endpoints.py): support 'temp_budget_increase' and 'temp_budget_expiry' fields Allow proxy admin to grant temporary budget increases to keys * fix(proxy/_types.py): enforce temp_budget_increase and temp_budget_expiry are always passed together * feat(user_api_key_auth.py): initial working temp budget increase logic ensures key budget exceeded error checks for temp budget in key metadata * feat(proxy_server.py): return the key max budget and key spend in the response headers Allows clientside user to know their remaining limits * test: add unit testing for new proxy utils Ensures new key budget is correctly handled * docs(temporary_budget_increase.md): add doc on temporary budget increase * fix(utils.py): remove 3.5 from response_format check for now not all azure 3.5 models support response_format * fix(user_api_key_auth.py): return valid user api key auth object on all paths	2025-01-14 17:03:11 -08:00
Krish Dholakia	000d3152a8	Litellm dev 01 14 2025 p1 (#7771 ) * First-class Aim Guardrails support (#7738) * initial aim support * add tests * docs(langsmith_integration.md): cleanup * style: cleanup unused imports --------- Co-authored-by: Tomer Bin <117278227+hxtomer@users.noreply.github.com>	2025-01-14 16:18:21 -08:00
Ishaan Jaff	68c2c6ce7f	docs benchmark	2025-01-14 10:48:43 -08:00
Ishaan Jaff	106b9a6439	update benchmarks	2025-01-14 10:45:28 -08:00
Ishaan Jaff	f30c87f4f0	(fix) health check - allow setting `health_check_model` (#7752 ) * use _update_litellm_params_for_health_check * fix Wildcard Routes * test_update_litellm_params_for_health_check * test_perform_health_check_with_health_check_model * fix doc string * huggingface/mistralai/Mistral-7B-Instruct-v0.3	2025-01-13 20:16:44 -08:00
Krish Dholakia	01e2e26bd1	add azure o1 pricing (#7715 ) * build(model_prices_and_context_window.json): add azure o1 pricing Closes https://github.com/BerriAI/litellm/issues/7712 * refactor: replace regex with string method for whitespace check in stop-sequences handling (#7713) * Allows overriding keep_alive time in ollama (#7079) * Allows overriding keep_alive time in ollama * Also adds to ollama_chat * Adds some info on the docs about this parameter * fix: together ai warning (#7688) Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com> * fix(proxy_server.py): handle config containing thread locked objects when using get_config_state * fix(proxy_server.py): add exception to debug * build(model_prices_and_context_window.json): update 'supports_vision' for azure o1 --------- Co-authored-by: Wolfram Ravenwolf <52386626+WolframRavenwolf@users.noreply.github.com> Co-authored-by: Regis David Souza Mesquita <github@rdsm.dev> Co-authored-by: Carl <45709281+capsenz@users.noreply.github.com> Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com>	2025-01-12 18:15:35 -08:00
Krrish Dholakia	9ebb8a8795	docs(enterprise.md): cleanup docs and add faq	2025-01-11 10:46:55 -08:00
Krrish Dholakia	34e10ba4da	docs(enterprise.md): clarify sla for patching vulnerabilities	2025-01-11 10:42:32 -08:00
Krish Dholakia	513874858b	fix(model_hub.tsx): clarify cost in model hub is per 1m tokens (#7687 ) * fix(model_hub.tsx): clarify cost in model hub is per 1m tokens * docs: test blog * docs: improve release note docs * docs(docs/): new stable release doc * docs(docs/): specify date in all posts * docs(docs/): add git diff to stable release docs	2025-01-11 09:57:09 -08:00
Krrish Dholakia	f39adc00e3	docs: new release notes	2025-01-10 22:49:20 -08:00
Krrish Dholakia	179507e5c3	docs(logging.md): docs(logging.md): add docs on s3 bucket logging with team alias prefix	2025-01-10 22:28:05 -08:00
Krish Dholakia	953c021aa7	Litellm dev 01 10 2025 p3 (#7682 ) * feat(langfuse.py): log the used prompt when prompt management used * test: fix test * docs(self_serve.md): add doc on restricting personal key creation on ui * feat(s3.py): support s3 logging with team alias prefixes (if available) New preview feature * fix(main.py): remove old if block - simplify to just await if coroutine returned fixes lm_studio async embedding error * fix(langfuse.py): handle get prompt check	2025-01-10 21:56:42 -08:00
Krish Dholakia	e54d23c919	Litellm dev 01 10 2025 p2 (#7679 ) * test(test_basic_python_version.py): assert all optional dependencies are marked as extras on poetry Fixes https://github.com/BerriAI/litellm/issues/7677 * docs(secret.md): clarify 'read_and_write' secret manager usage on aws * docs(secret.md): fix doc * build(ui/teams.tsx): add edit/delete button for updating user / team membership on ui allows updating user role to admin on ui * build(ui/teams.tsx): display edit member component on ui, when edit button on member clicked * feat(team_endpoints.py): support updating team member role to admin via api endpoints allows team member to become admin post-add * build(ui/user_dashboard.tsx): if team admin - show all team keys Fixes https://github.com/BerriAI/litellm/issues/7650 * test(config.yml): add tomli to ci/cd * test: don't call python_basic_testing in local testing (covered by python 3.13 testing)	2025-01-10 21:50:53 -08:00
Ishaan Jaff	e0d5afbd3e	fix showing release notes	2025-01-10 20:40:50 -08:00
Krish Dholakia	ebc66c1e1e	LiteLLM Minor Fixes & Improvements (01/10/2025) - p1 (#7670 ) * test(test_get_model_info.py): add unit test confirming router deployment updates global 'get_model_info' * fix(get_supported_openai_params.py): fix custom llm provider 'get_supported_openai_params' Fixes https://github.com/BerriAI/litellm/issues/7668 * docs(azure.md): clarify how azure ad token refresh on proxy works Closes https://github.com/BerriAI/litellm/issues/7665	2025-01-10 17:49:05 -08:00
Krrish Dholakia	525f015af9	docs(config_settings.md): update docs to include new athina env var	2025-01-10 10:46:12 -08:00
vivek-athina	eba95acb2a	Use environment variable for Athina logging URL (#7628 ) * Use environment variable for Athina logging URL * Added to docs as well * Changed the env var name	2025-01-10 07:47:12 -08:00
Krish Dholakia	75c3ddfc9e	fix(vertex_ai/gemini/transformation.py): handle 'http://' in gemini p… (#7660 ) * fix(vertex_ai/gemini/transformation.py): handle 'http://' in gemini process url * refactor(router.py): refactor '_prompt_management_factory' to use logging obj get_chat_completion logic deduplicates code * fix(litellm_logging.py): update 'get_chat_completion_prompt' to update logging object messages * docs(prompt_management.md): update prompt management to be in beta given feedback - this still needs to be revised (e.g. passing in user message, not ignoring) * refactor(prompt_management_base.py): introduce base class for prompt management allows consistent behaviour across prompt management integrations * feat(prompt_management_base.py): support adding client message to template message + refactor langfuse prompt management to use prompt management base * fix(litellm_logging.py): log prompt id + prompt variables to langfuse if set allows tracking what prompt was used for what purpose * feat(litellm_logging.py): log prompt management metadata in standard logging payload + use in langfuse allows logging prompt id / prompt variables to langfuse * test: fix test * fix(router.py): cleanup unused imports * fix: fix linting error * fix: fix trace param typing * fix: fix linting errors * fix: fix code qa check	2025-01-10 07:31:59 -08:00
Krish Dholakia	afdcbe3d64	fix(main.py): fix lm_studio/ embedding routing (#7658 ) * fix(main.py): fix lm_studio/ embedding routing adds the mapping + updates docs with example * docs(self_serve.md): update doc to show how to auto-add sso users to teams * fix(streaming_handler.py): simplify async iterator check, to just check if streaming response is an async iterable	2025-01-09 23:03:24 -08:00
Ishaan Jaff	19cac744f8	(Feat - Batches API) add support for retrieving vertex api batch jobs (#7661 ) * add _async_retrieve_batch * fix aretrieve_batch * fix _get_batch_id_from_vertex_ai_batch_response * fix batches docs	2025-01-09 18:35:03 -08:00
Krrish Dholakia	3c62c2f068	docs(intro.md): add a section on 'why pass through endpoints' helps proxy admin understand when these would be useful	2025-01-08 19:15:41 -08:00
Ishaan Jaff	74873317c2	(feat) - allow building litellm proxy from pip package (#7633 ) * fix working build from pip * add tests for proxy_build_from_pip_tests * doc clean up for deployment * docs cleanup * docs build from pip * fix cd docker/build_from_pip	2025-01-08 16:36:57 -08:00
Ishaan Jaff	0587a5dcfe	fix docs	2025-01-08 12:51:59 -08:00
Ishaan Jaff	de11c1aa77	update load test docs	2025-01-08 12:48:21 -08:00
Ishaan Jaff	3090ce7a19	sort rn	2025-01-08 12:16:01 -08:00
Ishaan Jaff	f755ad4b08	docs v1.57.3	2025-01-08 12:08:19 -08:00
Krish Dholakia	d413f31fb7	Litellm dev 01 07 2025 p3 (#7635 ) * fix(__init__.py): fix mistral large tool calling map bedrock mistral large to converse endpoint Fixes https://github.com/BerriAI/litellm/issues/7521 * braintrust logging: respect project_id, add more metrics + more (#7613) * braintrust logging: respect project_id, add more metrics * braintrust logger: improve json formatting * braintrust logger: add test for passing specific project_id * rm unneeded import * braintrust logging: rm unneeded var in tets * add project_name * update docs --------- Co-authored-by: H <no@email.com> --------- Co-authored-by: hi019 <65871571+hi019@users.noreply.github.com> Co-authored-by: H <no@email.com>	2025-01-08 11:46:24 -08:00
Ishaan Jaff	a6fd407e7c	update docs	2025-01-07 22:35:07 -08:00
Krrish Dholakia	9146b4c7e7	docs: cleanup keys	2025-01-06 21:57:18 -08:00

1 2 3 4 5 ...

3061 commits