litellm

Author	SHA1	Message	Date
Krish Dholakia	60709a0753	LiteLLM Minor Fixes and Improvements (09/13/2024) (#5689 ) * refactor: cleanup unused variables + fix pyright errors * feat(health_check.py): Closes https://github.com/BerriAI/litellm/issues/5686 * fix(o1_reasoning.py): add stricter check for o-1 reasoning model * refactor(mistral/): make it easier to see mistral transformation logic * fix(openai.py): fix openai o-1 model param mapping Fixes https://github.com/BerriAI/litellm/issues/5685 * feat(main.py): infer finetuned gemini model from base model Fixes https://github.com/BerriAI/litellm/issues/5678 * docs(vertex.md): update docs to call finetuned gemini models * feat(proxy_server.py): allow admin to hide proxy model aliases Closes https://github.com/BerriAI/litellm/issues/5692 * docs(load_balancing.md): add docs on hiding alias models from proxy config * fix(base.py): don't raise notimplemented error * fix(user_api_key_auth.py): fix model max budget check * fix(router.py): fix elif * fix(user_api_key_auth.py): don't set team_id to empty str * fix(team_endpoints.py): fix response type * test(test_completion.py): handle predibase error * test(test_proxy_server.py): fix test * fix(o1_transformation.py): fix max_completion_token mapping * test(test_image_generation.py): mark flaky test	2024-09-14 10:02:55 -07:00
Krish Dholakia	4657a40ef1	LiteLLM Minor Fixes and Improvements (09/12/2024) (#5658 ) * fix(factory.py): handle tool call content as list Fixes https://github.com/BerriAI/litellm/issues/5652 * fix(factory.py): enforce stronger typing * fix(router.py): return model alias in /v1/model/info and /v1/model_group/info * fix(user_api_key_auth.py): move noisy warning message to debug cleanup logs * fix(types.py): cleanup pydantic v2 deprecated param Fixes https://github.com/BerriAI/litellm/issues/5649 * docs(gemini.md): show how to pass inline data to gemini api Fixes https://github.com/BerriAI/litellm/issues/5674	2024-09-12 23:04:06 -07:00
Krish Dholakia	98c34a7e27	LiteLLM Minor Fixes and Improvements (11/09/2024) (#5634 ) * fix(caching.py): set ttl for async_increment cache fixes issue where ttl for redis client was not being set on increment_cache Fixes https://github.com/BerriAI/litellm/issues/5609 * fix(caching.py): fix increment cache w/ ttl for sync increment cache on redis Fixes https://github.com/BerriAI/litellm/issues/5609 * fix(router.py): support adding retry policy + allowed fails policy via config.yaml * fix(router.py): don't cooldown single deployments No point, as there's no other deployment to loadbalance with. * fix(user_api_key_auth.py): support setting allowed email domains on jwt tokens Closes https://github.com/BerriAI/litellm/issues/5605 * docs(token_auth.md): add user upsert + allowed email domain to jwt auth docs * fix(litellm_pre_call_utils.py): fix dynamic key logging when team id is set Fixes issue where key logging would not be set if team metadata was not none * fix(secret_managers/main.py): load environment variables correctly Fixes issue where os.environ/ was not being loaded correctly * test(test_router.py): fix test * feat(spend_tracking_utils.py): support logging additional usage params - e.g. prompt caching values for deepseek * test: fix tests * test: fix test * test: fix test * test: fix test * test: fix test	2024-09-11 22:36:06 -07:00
Krish Dholakia	72e961af3c	LiteLLM Minor Fixes and Improvements (08/06/2024) (#5567 ) * fix(utils.py): return citations for perplexity streaming Fixes https://github.com/BerriAI/litellm/issues/5535 * fix(anthropic/chat.py): support fallbacks for anthropic streaming (#5542) * fix(anthropic/chat.py): support fallbacks for anthropic streaming Fixes https://github.com/BerriAI/litellm/issues/5512 * fix(anthropic/chat.py): use module level http client if none given (prevents early client closure) * fix: fix linting errors * fix(http_handler.py): fix raise_for_status error handling * test: retry flaky test * fix otel type * fix(bedrock/embed): fix error raising * test(test_openai_batches_and_files.py): skip azure batches test (for now) quota exceeded * fix(test_router.py): skip azure batch route test (for now) - hit batch quota limits --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> * All `model_group_alias` should show up in `/models`, `/model/info` , `/model_group/info` (#5539) * fix(router.py): support returning model_alias model names in `/v1/models` * fix(proxy_server.py): support returning model alias'es on `/model/info` * feat(router.py): support returning model group alias for `/model_group/info` * fix(proxy_server.py): fix linting errors * fix(proxy_server.py): fix linting errors * build(model_prices_and_context_window.json): add amazon titan text premier pricing information Closes https://github.com/BerriAI/litellm/issues/5560 * feat(litellm_logging.py): log standard logging response object for pass through endpoints. Allows bedrock /invoke agent calls to be correctly logged to langfuse + s3 * fix(success_handler.py): fix linting error * fix(success_handler.py): fix linting errors * fix(team_endpoints.py): Allows admin to update team member budgets --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2024-09-06 17:16:24 -07:00
Krish Dholakia	1e7e538261	LiteLLM Minor fixes + improvements (08/04/2024) (#5505 ) * Minor IAM AWS OIDC Improvements (#5246) * AWS IAM: Temporary tokens are valid across all regions after being issued, so it is wasteful to request one for each region. * AWS IAM: Include an inline policy, to help reduce misuse of overly permissive IAM roles. * (test_bedrock_completion.py): Ensure we are testing cross AWS region OIDC flow. * fix(router.py): log rejected requests Fixes https://github.com/BerriAI/litellm/issues/5498 * refactor: don't use verbose_logger.exception, if exception is raised User might already have handling for this. But alerting systems in prod will raise this as an unhandled error. * fix(datadog.py): support setting datadog source as an env var Fixes https://github.com/BerriAI/litellm/issues/5508 * docs(logging.md): add dd_source to datadog docs * fix(proxy_server.py): expose `/customer/list` endpoint for showing all customers * (bedrock): Fix usage with Cloudflare AI Gateway, and proxies in general. (#5509) * feat(anthropic.py): support 'cache_control' param for content when it is a string * Revert "(bedrock): Fix usage with Cloudflare AI Gateway, and proxies in gener…" (#5519) This reverts commit `3fac0349c2`. * refactor: ci/cd run again --------- Co-authored-by: David Manouchehri <david.manouchehri@ai.moda>	2024-09-04 22:16:55 -07:00
Krish Dholakia	f9e6507cd1	LiteLLM Minor Fixes + Improvements (#5474 ) * feat(proxy/_types.py): add lago billing to callbacks ui Closes https://github.com/BerriAI/litellm/issues/5472 * fix(anthropic.py): return anthropic prompt caching information Fixes https://github.com/BerriAI/litellm/issues/5364 * feat(bedrock/chat.py): support 'json_schema' for bedrock models Closes https://github.com/BerriAI/litellm/issues/5434 * fix(bedrock/embed/embeddings.py): support async embeddings for amazon titan models * fix: linting fixes * fix: handle key errors * fix(bedrock/chat.py): fix bedrock ai21 streaming object * feat(bedrock/embed): support bedrock embedding optional params * fix(databricks.py): fix usage chunk * fix(internal_user_endpoints.py): apply internal user defaults, if user role updated Fixes issue where user update wouldn't apply defaults * feat(slack_alerting.py): provide multiple slack channels for a given alert type multiple channels might be interested in receiving an alert for a given type * docs(alerting.md): add multiple channel alerting to docs	2024-09-02 14:29:57 -07:00
Krish Dholakia	37f9705d6e	Bedrock Embeddings refactor + model support (#5462 ) * refactor(bedrock): initial commit to refactor bedrock to a folder Improve code readability + maintainability * refactor: more refactor work * fix: fix imports * feat(bedrock/embeddings.py): support translating embedding into amazon embedding formats * fix: fix linting errors * test: skip test on end of life model * fix(cohere/embed.py): fix linting error * fix(cohere/embed.py): fix typing * fix(cohere/embed.py): fix post-call logging for cohere embedding call * test(test_embeddings.py): fix error message assertion in test	2024-09-01 13:29:58 -07:00
Krrish Dholakia	18731cf42b	fix: fix linting errors	2024-08-27 12:14:23 -07:00
Krrish Dholakia	068aafdff9	fix(utils.py): correctly re-raise the headers from an exception, if present Fixes issue where retry after on router was not using azure / openai numbers	2024-08-24 12:30:30 -07:00
Krrish Dholakia	5add6687cc	fix(types/utils.py): fix linting errors	2024-08-03 11:48:33 -07:00
Krrish Dholakia	ae4bcd8a41	fix(utils.py): fix trim_messages to handle tool calling Fixes https://github.com/BerriAI/litellm/issues/4931	2024-07-29 13:04:41 -07:00
Krrish Dholakia	dd2d61bfce	build(pre-commit.yaml): update	2024-07-29 12:29:56 -07:00
Krrish Dholakia	59384c84a5	fix(utils.py): correctly re-raise azure api connection error '	2024-07-29 12:28:25 -07:00
James Braza	2d661a1d2a	Added poetry-check to pre-commit	2024-07-02 13:02:26 -04:00
Ishaan Jaff	0810b5fcb4	ci/cd again	2024-06-20 20:06:52 -07:00
Ishaan Jaff	0f29207c48	test add flake8 check on cli.py	2024-06-20 20:05:32 -07:00
Krrish Dholakia	7dd0151f83	fix(bedrock.py): support custom prompt templates for all providers Fixes https://github.com/BerriAI/litellm/issues/4239	2024-06-17 08:28:46 -07:00
Krrish Dholakia	877b37c6de	ci(pre-commit-config.yaml): add isort as a pre-commit hook	2024-06-15 16:48:00 -07:00
Krrish Dholakia	4f91205530	refactor(utils.py): refactor Logging to it's own class. Cut down utils.py to <10k lines. Easier debugging Reference: https://github.com/BerriAI/litellm/issues/4206	2024-06-15 10:57:20 -07:00
Krrish Dholakia	290bcc09e0	refactor(check_file_length.py): add local pre-commit check for file length	2024-06-15 09:18:53 -07:00
Ishaan Jaff	58eb352ddb	feat - refactor /chat/completions to have a common helper	2024-06-07 12:18:53 -07:00
Ishaan Jaff	02b5c03739	(ci/cd) use ruff	2024-06-07 10:07:48 -07:00
Krrish Dholakia	9083d8e490	fix: fix linting errors	2024-05-09 17:55:27 -07:00
Krrish Dholakia	6575143460	feat(proxy_server.py): return litellm version in response headers	2024-05-08 16:00:08 -07:00
Ishaan Jaff	533117f4d9	fix pre commit hook should test integrations	2024-05-01 19:14:08 -07:00
Krrish Dholakia	cccd577e75	feat(proxy_server.py): expose new permissions field for keys	2024-02-15 20:03:32 -08:00
ishaan-jaff	8b571159fc	(feat) add pre-commit hook to check model_prices_and_context_window.json litellm/model_prices_and_context_window_backup.json	2024-02-05 15:00:13 -08:00
ishaan-jaff	a6836a0996	(feat) pre-commit hook to validate	2024-02-05 14:42:10 -08:00
ishaan-jaff	08d57a20a7	(feat) pre-commit check print on proxy_server.py	2024-01-01 13:51:27 +05:30
Krrish Dholakia	4905929de3	refactor: add black formatting	2023-12-25 14:11:20 +05:30
Krrish Dholakia	402b2e5733	build(test_streaming.py): fix linting issues	2023-12-25 07:34:54 +05:30
Krrish Dholakia	ed50522863	fix(proxy_server.py): fix pydantic version errors	2023-12-09 12:09:49 -08:00
Krrish Dholakia	5fa2b6e5ad	fix(proxy_server.py): enable pre+post-call hooks and max parallel request limits	2023-12-08 17:11:30 -08:00
Krrish Dholakia	8cc0e8e5c5	ci(pre-commit-config.yaml): adding mypy linting as a pre-commit hook	2023-12-06 22:57:14 -08:00
Krrish Dholakia	6b40546e59	refactor(all-files): removing all print statements; adding pre-commit + flake8 to prevent future regressions	2023-11-04 12:50:15 -07:00

35 commits