litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 03:04:13 +00:00

Author	SHA1	Message	Date
Krish Dholakia	ec5a354eac	add azure o1 pricing (#7715 ) * build(model_prices_and_context_window.json): add azure o1 pricing Closes https://github.com/BerriAI/litellm/issues/7712 * refactor: replace regex with string method for whitespace check in stop-sequences handling (#7713) * Allows overriding keep_alive time in ollama (#7079) * Allows overriding keep_alive time in ollama * Also adds to ollama_chat * Adds some info on the docs about this parameter * fix: together ai warning (#7688) Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com> * fix(proxy_server.py): handle config containing thread locked objects when using get_config_state * fix(proxy_server.py): add exception to debug * build(model_prices_and_context_window.json): update 'supports_vision' for azure o1 --------- Co-authored-by: Wolfram Ravenwolf <52386626+WolframRavenwolf@users.noreply.github.com> Co-authored-by: Regis David Souza Mesquita <github@rdsm.dev> Co-authored-by: Carl <45709281+capsenz@users.noreply.github.com> Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com>	2025-01-12 18:15:35 -08:00
Krrish Dholakia	3062564488	docs(enterprise.md): cleanup docs and add faq All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details	2025-01-11 10:46:55 -08:00
Krrish Dholakia	d988bfb6f8	docs(enterprise.md): clarify sla for patching vulnerabilities	2025-01-11 10:42:32 -08:00
Krish Dholakia	5e537fbdb1	fix(model_hub.tsx): clarify cost in model hub is per 1m tokens (#7687 ) * fix(model_hub.tsx): clarify cost in model hub is per 1m tokens * docs: test blog * docs: improve release note docs * docs(docs/): new stable release doc * docs(docs/): specify date in all posts * docs(docs/): add git diff to stable release docs	2025-01-11 09:57:09 -08:00
Krrish Dholakia	9a1c050cf7	docs: new release notes All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 41s Details	2025-01-10 22:49:20 -08:00
Krrish Dholakia	f2ca244766	docs(logging.md): docs(logging.md): add docs on s3 bucket logging with team alias prefix	2025-01-10 22:28:05 -08:00
Krish Dholakia	27892acdfc	Litellm dev 01 10 2025 p3 (#7682 ) * feat(langfuse.py): log the used prompt when prompt management used * test: fix test * docs(self_serve.md): add doc on restricting personal key creation on ui * feat(s3.py): support s3 logging with team alias prefixes (if available) New preview feature * fix(main.py): remove old if block - simplify to just await if coroutine returned fixes lm_studio async embedding error * fix(langfuse.py): handle get prompt check	2025-01-10 21:56:42 -08:00
Krish Dholakia	c4780479a9	Litellm dev 01 10 2025 p2 (#7679 ) * test(test_basic_python_version.py): assert all optional dependencies are marked as extras on poetry Fixes https://github.com/BerriAI/litellm/issues/7677 * docs(secret.md): clarify 'read_and_write' secret manager usage on aws * docs(secret.md): fix doc * build(ui/teams.tsx): add edit/delete button for updating user / team membership on ui allows updating user role to admin on ui * build(ui/teams.tsx): display edit member component on ui, when edit button on member clicked * feat(team_endpoints.py): support updating team member role to admin via api endpoints allows team member to become admin post-add * build(ui/user_dashboard.tsx): if team admin - show all team keys Fixes https://github.com/BerriAI/litellm/issues/7650 * test(config.yml): add tomli to ci/cd * test: don't call python_basic_testing in local testing (covered by python 3.13 testing)	2025-01-10 21:50:53 -08:00
Ishaan Jaff	49d74748b0	fix showing release notes	2025-01-10 20:40:50 -08:00
Krish Dholakia	a3e65c9bcb	LiteLLM Minor Fixes & Improvements (01/10/2025) - p1 (#7670 ) * test(test_get_model_info.py): add unit test confirming router deployment updates global 'get_model_info' * fix(get_supported_openai_params.py): fix custom llm provider 'get_supported_openai_params' Fixes https://github.com/BerriAI/litellm/issues/7668 * docs(azure.md): clarify how azure ad token refresh on proxy works Closes https://github.com/BerriAI/litellm/issues/7665	2025-01-10 17:49:05 -08:00
Krrish Dholakia	e98c1b86f4	docs(config_settings.md): update docs to include new athina env var	2025-01-10 10:46:12 -08:00
vivek-athina	8e2653c609	Use environment variable for Athina logging URL (#7628 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details * Use environment variable for Athina logging URL * Added to docs as well * Changed the env var name	2025-01-10 07:47:12 -08:00
Krish Dholakia	c10ae8879e	fix(vertex_ai/gemini/transformation.py): handle 'http://' in gemini p… (#7660 ) * fix(vertex_ai/gemini/transformation.py): handle 'http://' in gemini process url * refactor(router.py): refactor '_prompt_management_factory' to use logging obj get_chat_completion logic deduplicates code * fix(litellm_logging.py): update 'get_chat_completion_prompt' to update logging object messages * docs(prompt_management.md): update prompt management to be in beta given feedback - this still needs to be revised (e.g. passing in user message, not ignoring) * refactor(prompt_management_base.py): introduce base class for prompt management allows consistent behaviour across prompt management integrations * feat(prompt_management_base.py): support adding client message to template message + refactor langfuse prompt management to use prompt management base * fix(litellm_logging.py): log prompt id + prompt variables to langfuse if set allows tracking what prompt was used for what purpose * feat(litellm_logging.py): log prompt management metadata in standard logging payload + use in langfuse allows logging prompt id / prompt variables to langfuse * test: fix test * fix(router.py): cleanup unused imports * fix: fix linting error * fix: fix trace param typing * fix: fix linting errors * fix: fix code qa check	2025-01-10 07:31:59 -08:00
Krish Dholakia	865e6d5bda	fix(main.py): fix lm_studio/ embedding routing (#7658 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 36s Details * fix(main.py): fix lm_studio/ embedding routing adds the mapping + updates docs with example * docs(self_serve.md): update doc to show how to auto-add sso users to teams * fix(streaming_handler.py): simplify async iterator check, to just check if streaming response is an async iterable	2025-01-09 23:03:24 -08:00
Ishaan Jaff	13f364682d	(Feat - Batches API) add support for retrieving vertex api batch jobs (#7661 ) * add _async_retrieve_batch * fix aretrieve_batch * fix _get_batch_id_from_vertex_ai_batch_response * fix batches docs	2025-01-09 18:35:03 -08:00
Krrish Dholakia	39ee4c6bb4	docs(intro.md): add a section on 'why pass through endpoints' helps proxy admin understand when these would be useful	2025-01-08 19:15:41 -08:00
Ishaan Jaff	fd0a03f719	(feat) - allow building litellm proxy from pip package (#7633 ) * fix working build from pip * add tests for proxy_build_from_pip_tests * doc clean up for deployment * docs cleanup * docs build from pip * fix cd docker/build_from_pip	2025-01-08 16:36:57 -08:00
Ishaan Jaff	43566e9842	fix docs All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details	2025-01-08 12:51:59 -08:00
Ishaan Jaff	e5717d2cb0	update load test docs	2025-01-08 12:48:21 -08:00
Ishaan Jaff	74b41d29d3	sort rn	2025-01-08 12:16:01 -08:00
Ishaan Jaff	f95439af26	docs v1.57.3	2025-01-08 12:08:19 -08:00
Krish Dholakia	a187cee538	Litellm dev 01 07 2025 p3 (#7635 ) * fix(__init__.py): fix mistral large tool calling map bedrock mistral large to converse endpoint Fixes https://github.com/BerriAI/litellm/issues/7521 * braintrust logging: respect project_id, add more metrics + more (#7613) * braintrust logging: respect project_id, add more metrics * braintrust logger: improve json formatting * braintrust logger: add test for passing specific project_id * rm unneeded import * braintrust logging: rm unneeded var in tets * add project_name * update docs --------- Co-authored-by: H <no@email.com> --------- Co-authored-by: hi019 <65871571+hi019@users.noreply.github.com> Co-authored-by: H <no@email.com>	2025-01-08 11:46:24 -08:00
Ishaan Jaff	04eb718f7a	update docs	2025-01-07 22:35:07 -08:00
Krrish Dholakia	d5a288e29e	docs: cleanup keys	2025-01-06 21:57:18 -08:00
Krrish Dholakia	16f13dd55c	docs(prompt_management.md): update docs to show how to point to load balanced model name	2025-01-06 21:09:09 -08:00
Ishaan Jaff	6125ba1e2b	(Feat) - allow including dd-trace in litellm base image (#7587 ) * introduce USE_DDTRACE=true * update dd tracer * update * bump dd trace * use og slim image * DD tracing * fix _init_dd_tracer	2025-01-06 17:27:09 -08:00
minpeter	f7931b659b	FriendliAI: Documentation Updates (#7517 ) * docs(friendliai.md): update FriendliAI documentation and model details * docs(friendliai.md): remove unused imports for cleaner documentation * feat: add support for parallel function calling, system messages, and response schema in model configuration	2025-01-04 22:44:24 -08:00
Ishaan Jaff	46d9d29bff	(Feat) Hashicorp Secret Manager - Allow storing virtual keys in secret manager (#7549 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details * use a base abstract class * async_write_secret for hcorp * fix hcorp * async_write_secret for hashicopr secret manager * store virtual keys in hcorp * add delete secret * test_hashicorp_secret_manager_write_secret * test_hashicorp_secret_manager_delete_secret * docs Supported Secret Managers * docs storing keys in hcorp * docs hcorp * docs secret managers * test_key_generate_with_secret_manager_call * fix unused imports	2025-01-04 11:35:59 -08:00
Krish Dholakia	d43d83f9ef	feat(router.py): support request prioritization for text completion c… (#7540 ) * feat(router.py): support request prioritization for text completion calls * fix(internal_user_endpoints.py): fix sql query to return all keys, including null team id keys on `/user/info` Fixes https://github.com/BerriAI/litellm/issues/7485 * fix: fix linting errors * fix: fix linting error * test(test_router_helper_utils.py): add direct test for '_schedule_factory' Fixes code qa test	2025-01-03 19:35:44 -08:00
Krish Dholakia	f770dd0c95	Support checking provider-specific `/models` endpoints for available models based on key (#7538 ) * test(test_utils.py): initial test for valid models Addresses https://github.com/BerriAI/litellm/issues/7525 * fix: test * feat(fireworks_ai/transformation.py): support retrieving valid models from fireworks ai endpoint * refactor(fireworks_ai/): support checking model info on `/v1/models` route * docs(set_keys.md): update docs to clarify check llm provider api usage * fix(watsonx/common_utils.py): support 'WATSONX_ZENAPIKEY' for iam auth * fix(watsonx): read in watsonx token from env var * fix: fix linting errors * fix(utils.py): fix provider config check * style: cleanup unused imports	2025-01-03 19:29:59 -08:00
Ishaan Jaff	1bb4941036	[Feature]: - allow print alert log to console (#7534 ) * update send_to_webhook * test_print_alerting_payload_warning * add alerting_args spec * test_alerting.py	2025-01-03 17:48:13 -08:00
Ishaan Jaff	fb59f20979	(Feat) - Hashicorp secret manager, use TLS cert authentication (#7532 ) * fix - don't print hcorp secrets in debug logs * hcorp - tls auth fixes * fix tls_ca_cert_path * test_hashicorp_secret_manager_tls_cert_auth * hcp secret docs	2025-01-03 14:23:53 -08:00
Ishaan Jaff	d3a3e45e5b	docs pass through routes All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details	2025-01-03 12:55:23 -08:00
Krish Dholakia	25e6f46910	Litellm dev 01 02 2025 p2 (#7512 ) * feat(deepgram/transformation.py): support reading in deepgram api base from env var * fix(litellm_logging.py): make skipping log message a .info easier to see * docs(logging.md): add doc on turn off all tracking/logging for a request	2025-01-02 21:57:51 -08:00
Ishaan Jaff	b9280528d3	docs enable_pre_call_checks All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details	2025-01-02 08:27:03 -08:00
Krrish Dholakia	c292f5805a	docs(humanloop.md): add humanloop docs All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details	2025-01-01 22:18:01 -08:00
Krish Dholakia	07fc394072	Litellm dev 01 01 2025 p1 (#7498 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 11s Details * refactor(prometheus.py): refactor to remove `_tag` metrics and incorporate in regular metrics * fix(prometheus.py): handle label values not set in enum values * feat(prometheus.py): working e2e custom metadata labels * docs(prometheus.md): update docs to clarify how custom metrics would work * test(test_prometheus_unit_tests.py): fix test * test: add unit testing	2025-01-01 18:59:28 -08:00
Ishaan Jaff	665fb59f48	doc update	2025-01-01 18:40:59 -08:00
Ishaan Jaff	cf60444916	(Feat) Add support for reading secrets from Hashicorp vault (#7497 ) * HashicorpSecretManager * test_hashicorp_secret_managerv * use 1 helper initialize_secret_manager * add HASHICORP_VAULT * working config * hcorp read_secret * HashicorpSecretManager * add secret_manager_testing * use 1 folder for secret manager testing * test_hashicorp_secret_manager_get_secret * HashicorpSecretManager * docs HCP secrets * update folder name * docs hcorp secret manager * remove unused imports * add conftest.py * fix tests * docs document env vars	2025-01-01 18:35:05 -08:00
Ishaan Jaff	e1fcd3ee43	(docs) Add docs on load testing benchmarks (#7499 ) * docs benchmarks * docs benchmarks	2025-01-01 18:33:20 -08:00
Ishaan Jaff	38bfefa6ef	(Feat) - LiteLLM Use `UsernamePasswordCredential` for Azure OpenAI (#7496 ) * add get_azure_ad_token_from_username_password * docs azure use username / password for auth * update doc * get_azure_ad_token_from_username_password * test test_get_azure_ad_token_from_username_password	2025-01-01 14:11:27 -08:00
Ishaan Jaff	2979b8301c	(feat) POST `/fine_tuning/jobs` support passing vertex specific hyper params (#7490 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details * update convert_openai_request_to_vertex * test_create_vertex_fine_tune_jobs_mocked * fix order of methods * update LiteLLMFineTuningJobCreate * update OpenAIFineTuningHyperparameters * update vertex hyper params in response * _transform_openai_hyperparameters_to_vertex_hyperparameters * supervised_tuning_spec["hyperParameters"] fix * fix mapping for ft params testing * docs fine tuning apis * fix test_convert_basic_openai_request_to_vertex_request * update hyperparams for create fine tuning * fix linting * test_create_vertex_fine_tune_jobs_mocked_with_hyperparameters * run ci/cd again * test_convert_basic_openai_request_to_vertex_request	2025-01-01 07:44:48 -08:00
Krish Dholakia	d984a9281a	Prometheus - custom metrics support + other improvements (#7489 ) * fix(prometheus.py): refactor litellm_input_tokens_metric to use label factory makes adding new metrics easier * feat(prometheus.py): add 'request_model' to 'litellm_input_tokens_metric' * refactor(prometheus.py): refactor 'litellm_output_tokens_metric' to use label factory makes adding new metrics easier * feat(prometheus.py): emit requested model in 'litellm_output_tokens_metric' * feat(prometheus.py): support tracking success events with custom metrics * refactor(prometheus.py): refactor '_set_latency_metrics' to just use the initially created enum values dictionary reduces scope for missing values * feat(prometheus.py): refactor all tags to support custom metadata tags enables metadata tags to be used across for e2e tracking * fix(prometheus.py): fix requested model on success event enum_values * test: fix test * test: fix test * test: handle filenotfound error * docs(prometheus.md): add new values to prometheus * docs(prometheus.md): document adding custom metrics on prometheus * bump: version 1.56.5 → 1.56.6	2025-01-01 07:41:50 -08:00
Ishaan Jaff	03b1db5a7d	(Feat) - Add PagerDuty Alerting Integration (#7478 ) * define basic types * fix verbose_logger.exception statement * fix basic alerting * test pager duty alerting * test_pagerduty_alerting_high_failure_rate * PagerDutyAlerting * async_log_failure_event * use pre_call_hook * add _request_is_completed helper util * update AlertingConfig * rename PagerDutyInternalEvent * _send_alert_if_thresholds_crossed * use pagerduty as _custom_logger_compatible_callbacks_literal * fix slack alerting imports * fix imports in slack alerting * PagerDutyAlerting * fix _load_alerting_settings * test_pagerduty_hanging_request_alerting * working pager duty alerting * fix linting * doc pager duty alerting * update hanging_response_handler * fix import location * update failure_threshold * update async_pre_call_hook * docs pagerduty * test - callback_class_str_to_classType * fix linting errors * fix linting + testing error * PagerDutyAlerting * test_pagerduty_hanging_request_alerting * fix unused imports * docs pager duty * @pytest.mark.flaky(retries=6, delay=2) * test_model_info_bedrock_converse_enforcement	2025-01-01 07:12:51 -08:00
Daniel Ko	01a108cf82	Added missing quote (#7481 )	2024-12-31 23:23:49 -08:00
Krish Dholakia	080de89cfb	Fix team-based logging to langfuse + allow custom tokenizer on `/token_counter` endpoint (#7493 ) * fix(langfuse_prompt_management.py): migrate dynamic logging to langfuse custom logger compatible class * fix(langfuse_prompt_management.py): support failure callback logging to langfuse as well * feat(proxy_server.py): support setting custom tokenizer on config.yaml Allows customizing value for `/utils/token_counter` * fix(proxy_server.py): fix linting errors * test: skip if file not found * style: cleanup unused import * docs(configs.md): add docs on setting custom tokenizer	2024-12-31 23:18:41 -08:00
Ishaan Jaff	6705e30d5d	(docs) Add docs on using Vertex with Fine Tuning APIs (#7491 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details * docs add Overview for vertex endpoints * docs add vertex ft api to docs * Advanced use case - Passing `adapter_size` to the Vertex AI API	2024-12-31 18:50:18 -08:00
Ishaan Jaff	60bdfb437f	doc on streaming usage litellm proxy	2024-12-30 21:06:34 -08:00
Ishaan Jaff	24dd6559a6	localeCompare All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 35s Details	2024-12-28 20:32:49 -08:00
Krrish Dholakia	192c3b2848	docs(index.md): fix doc link	2024-12-28 20:28:50 -08:00

... 7 8 9 10 11 ...

3401 commits