litellm

Author	SHA1	Message	Date
Ishaan Jaff	0c33b8dd12	docs	2024-09-14 19:13:45 -07:00
Ishaan Jaff	c220fc0e92	docs max_completion_tokens	2024-09-14 19:12:12 -07:00
Ishaan Jaff	c8eff2dc65	[Feat-Prometheus] Track exception status on `litellm_deployment_failure_responses` (#5706 ) * add litellm_deployment_cooled_down * track num cooldowns on prometheus * track exception status * fix linting * docs prom metrics * cleanup premium user checks * prom track deployment failure state * docs prometheus	2024-09-14 18:44:31 -07:00
Ishaan Jaff	7c2ddba6c6	sambanova support (#5547 ) (#5703 ) * add sambanova support * sambanova support * updated api endpoint for sambanova --------- Co-authored-by: Venu Anuganti <venu@venublog.com> Co-authored-by: Venu Anuganti <venu@vairmac2020>	2024-09-14 17:23:04 -07:00
Krish Dholakia	60709a0753	LiteLLM Minor Fixes and Improvements (09/13/2024) (#5689 ) * refactor: cleanup unused variables + fix pyright errors * feat(health_check.py): Closes https://github.com/BerriAI/litellm/issues/5686 * fix(o1_reasoning.py): add stricter check for o-1 reasoning model * refactor(mistral/): make it easier to see mistral transformation logic * fix(openai.py): fix openai o-1 model param mapping Fixes https://github.com/BerriAI/litellm/issues/5685 * feat(main.py): infer finetuned gemini model from base model Fixes https://github.com/BerriAI/litellm/issues/5678 * docs(vertex.md): update docs to call finetuned gemini models * feat(proxy_server.py): allow admin to hide proxy model aliases Closes https://github.com/BerriAI/litellm/issues/5692 * docs(load_balancing.md): add docs on hiding alias models from proxy config * fix(base.py): don't raise notimplemented error * fix(user_api_key_auth.py): fix model max budget check * fix(router.py): fix elif * fix(user_api_key_auth.py): don't set team_id to empty str * fix(team_endpoints.py): fix response type * test(test_completion.py): handle predibase error * test(test_proxy_server.py): fix test * fix(o1_transformation.py): fix max_completion_token mapping * test(test_image_generation.py): mark flaky test	2024-09-14 10:02:55 -07:00
Krish Dholakia	4657a40ef1	LiteLLM Minor Fixes and Improvements (09/12/2024) (#5658 ) * fix(factory.py): handle tool call content as list Fixes https://github.com/BerriAI/litellm/issues/5652 * fix(factory.py): enforce stronger typing * fix(router.py): return model alias in /v1/model/info and /v1/model_group/info * fix(user_api_key_auth.py): move noisy warning message to debug cleanup logs * fix(types.py): cleanup pydantic v2 deprecated param Fixes https://github.com/BerriAI/litellm/issues/5649 * docs(gemini.md): show how to pass inline data to gemini api Fixes https://github.com/BerriAI/litellm/issues/5674	2024-09-12 23:04:06 -07:00
Ishaan Jaff	13ba22d6fd	docs add o1 to docs	2024-09-12 19:06:13 -07:00
Krish Dholakia	98c34a7e27	LiteLLM Minor Fixes and Improvements (11/09/2024) (#5634 ) * fix(caching.py): set ttl for async_increment cache fixes issue where ttl for redis client was not being set on increment_cache Fixes https://github.com/BerriAI/litellm/issues/5609 * fix(caching.py): fix increment cache w/ ttl for sync increment cache on redis Fixes https://github.com/BerriAI/litellm/issues/5609 * fix(router.py): support adding retry policy + allowed fails policy via config.yaml * fix(router.py): don't cooldown single deployments No point, as there's no other deployment to loadbalance with. * fix(user_api_key_auth.py): support setting allowed email domains on jwt tokens Closes https://github.com/BerriAI/litellm/issues/5605 * docs(token_auth.md): add user upsert + allowed email domain to jwt auth docs * fix(litellm_pre_call_utils.py): fix dynamic key logging when team id is set Fixes issue where key logging would not be set if team metadata was not none * fix(secret_managers/main.py): load environment variables correctly Fixes issue where os.environ/ was not being loaded correctly * test(test_router.py): fix test * feat(spend_tracking_utils.py): support logging additional usage params - e.g. prompt caching values for deepseek * test: fix tests * test: fix test * test: fix test * test: fix test * test: fix test	2024-09-11 22:36:06 -07:00
steffen-sbt	de9a39e7c6	Add the option to specify a schema in the postgres DB, also modify docs (#5640 )	2024-09-11 14:53:52 -07:00
Miri Bar	ebf42d6764	docs: update ai21 docs	2024-09-11 13:35:40 +03:00
dependabot[bot]	e48459389c	Bump send and express in /docs/my-website Bumps [send](https://github.com/pillarjs/send) and [express](https://github.com/expressjs/express). These dependencies needed to be updated together. Updates `send` from 0.18.0 to 0.19.0 - [Release notes](https://github.com/pillarjs/send/releases) - [Changelog](https://github.com/pillarjs/send/blob/master/HISTORY.md) - [Commits](https://github.com/pillarjs/send/compare/0.18.0...0.19.0) Updates `express` from 4.19.2 to 4.20.0 - [Release notes](https://github.com/expressjs/express/releases) - [Changelog](https://github.com/expressjs/express/blob/master/History.md) - [Commits](https://github.com/expressjs/express/compare/4.19.2...4.20.0) --- updated-dependencies: - dependency-name: send dependency-type: indirect - dependency-name: express dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2024-09-11 02:11:46 +00:00
Ishaan Jaff	899eaa9566	Merge pull request #5571 from jalammar/cohere-updated-models Add Cohere refresh models and update pricing	2024-09-10 17:22:51 -07:00
Ishaan Jaff	87bac7c026	fix rps / rpm values on load testing	2024-09-10 11:22:19 -07:00
Jay Alammar	795b29dfc4	Updating Cohere models, prices, and documentation	2024-09-10 13:47:05 -04:00
Ishaan Jaff	479b12be09	Merge branch 'main' into litellm_allow_turning_off_message_logging_for_callbacks	2024-09-09 21:59:36 -07:00
Ishaan Jaff	a6d3bd0ab7	Merge branch 'main' into litellm_tag_routing_fixes	2024-09-09 17:45:18 -07:00
Ishaan Jaff	949af7be2e	fix team based logging doc	2024-09-09 16:49:26 -07:00
Ishaan Jaff	4592d80f43	add doc on redacting otel message / response	2024-09-09 16:10:13 -07:00
Ishaan Jaff	2fceeedd94	add "default" tag	2024-09-09 14:41:22 -07:00
Ishaan Jaff	3bf6589fab	docs architecture	2024-09-07 19:09:33 -07:00
Krrish Dholakia	8294e8793c	docs(deploy.md): add published non-root docker image to docs	2024-09-07 18:01:31 -07:00
Ishaan Jaff	ba41a72f92	High Level architecture	2024-09-07 16:29:22 -07:00
Ishaan Jaff	9eb59e3645	Merge pull request #5585 from BerriAI/litellm_docs_arch_diagram [Docs] - Add Lifecycle of a request through LiteLLM Gateway	2024-09-07 16:22:02 -07:00
Ishaan Jaff	c2c63e4dbe	docs add arch diagram	2024-09-07 16:21:29 -07:00
Ishaan Jaff	54db564529	add arch diagram	2024-09-07 15:49:51 -07:00
Ishaan Jaff	ecb774c3e8	add doc on spend report frequency	2024-09-07 11:54:33 -07:00
Ishaan Jaff	009a1f7f86	Merge pull request #5579 from BerriAI/litellm_set_redis_cluster_env [Feat] Allow setting up Redis Cluster using .env vars	2024-09-07 11:31:38 -07:00
Ishaan Jaff	05505903b2	docs better sidebar	2024-09-07 11:31:07 -07:00
Ishaan Jaff	3984b9080c	docs cleanup	2024-09-07 11:23:44 -07:00
Ishaan Jaff	2cf0714b0d	docs organize sidebar	2024-09-07 11:23:06 -07:00
Ishaan Jaff	808ba36b55	ui cleanup	2024-09-07 11:20:07 -07:00
Ishaan Jaff	3bf2c06e06	add config for setting up redis cluster	2024-09-07 09:37:23 -07:00
Pradyumna Singh Rathore	a4f5fb3c30	fix missing class object instantiation in custom_llm_server provider documentation's quick start (#5578 ) Co-authored-by: Pradyumna Singh Rathore <pradyumna.singhrathore@halliburton.com>	2024-09-07 08:22:18 -07:00
Ishaan Jaff	bf1325e898	Merge pull request #5411 from gary149/huggingface-update Update Hugging Face Doc	2024-09-06 16:18:31 -07:00
Krish Dholakia	7ced9c8c0e	Update lago.py to accomodate API change (#5495 ) (#5543 ) * Update lago.py to accomodate API change (#5495) external_customer_id is deprecated. external_subscription_id is the replacement. * fix(lago.py): fixes \ --------- Co-authored-by: Raymond Weitekamp <19483938+rawwerks@users.noreply.github.com>	2024-09-05 17:27:40 -07:00
Ishaan Jaff	d8ef8c133e	docs add video for key based logging	2024-09-05 17:10:12 -07:00
Krrish Dholakia	08976370a3	docs(configs.md): update to clarify you can use os.environ/ for any config value	2024-09-05 12:40:31 -07:00
Krrish Dholakia	a9be7dd79b	docs(pass_through/bedrock.md): add bedrock agents support	2024-09-04 22:30:14 -07:00
Krish Dholakia	1e7e538261	LiteLLM Minor fixes + improvements (08/04/2024) (#5505 ) * Minor IAM AWS OIDC Improvements (#5246) * AWS IAM: Temporary tokens are valid across all regions after being issued, so it is wasteful to request one for each region. * AWS IAM: Include an inline policy, to help reduce misuse of overly permissive IAM roles. * (test_bedrock_completion.py): Ensure we are testing cross AWS region OIDC flow. * fix(router.py): log rejected requests Fixes https://github.com/BerriAI/litellm/issues/5498 * refactor: don't use verbose_logger.exception, if exception is raised User might already have handling for this. But alerting systems in prod will raise this as an unhandled error. * fix(datadog.py): support setting datadog source as an env var Fixes https://github.com/BerriAI/litellm/issues/5508 * docs(logging.md): add dd_source to datadog docs * fix(proxy_server.py): expose `/customer/list` endpoint for showing all customers * (bedrock): Fix usage with Cloudflare AI Gateway, and proxies in general. (#5509) * feat(anthropic.py): support 'cache_control' param for content when it is a string * Revert "(bedrock): Fix usage with Cloudflare AI Gateway, and proxies in gener…" (#5519) This reverts commit `3fac0349c2`. * refactor: ci/cd run again --------- Co-authored-by: David Manouchehri <david.manouchehri@ai.moda>	2024-09-04 22:16:55 -07:00
Ishaan Jaff	4b9163c7dc	Merge pull request #5514 from BerriAI/litellm_add_presidio [Fix-Refactor] support presidio on new guardrails config	2024-09-04 16:09:54 -07:00
Krrish Dholakia	aefca63a8c	docs(logging.md): fix name in docs	2024-09-04 14:36:53 -07:00
Ishaan Jaff	528154764b	docs update presidio	2024-09-04 13:43:14 -07:00
Ishaan Jaff	36505058e0	doc setting language per request	2024-09-04 13:23:17 -07:00
Ishaan Jaff	6c30f18f8c	docs new presidio language controls	2024-09-04 13:04:19 -07:00
Victor Mustar (aider)	f65ceef46a	remove conversational-task deprecated	2024-09-04 19:21:10 +02:00
Krish Dholakia	be3c7b401e	LiteLLM Minor fixes + improvements (08/03/2024) (#5488 ) * fix(internal_user_endpoints.py): set budget_reset_at for /user/update * fix(vertex_and_google_ai_studio_gemini.py): handle accumulated json Fixes https://github.com/BerriAI/litellm/issues/5479 * fix(vertex_ai_and_gemini.py): fix assistant message function call when content is not None Fixes https://github.com/BerriAI/litellm/issues/5490 * fix(proxy_server.py): generic state uuid for okta sso * fix(lago.py): improve debug logs Debugging for https://github.com/BerriAI/litellm/issues/5477 * docs(bedrock.md): add bedrock cross-region inferencing to docs * fix(azure.py): return azure response headers on aembedding call * feat(azure.py): return azure response headers for `/audio/transcription` * fix(types/utils.py): standardize deepseek / anthropic prompt caching usage information Closes https://github.com/BerriAI/litellm/issues/5285 * docs(usage.md): add docs on litellm usage object * test(test_completion.py): mark flaky test	2024-09-03 21:21:34 -07:00
Ishaan Jaff	115de195f1	Merge branch 'main' into litellm_allow_setting_route_controls	2024-09-03 17:00:22 -07:00
Ishaan Jaff	814bc67c84	docs control routes on proxy	2024-09-03 16:58:11 -07:00
Ishaan Jaff	19dbfff620	Merge pull request #5489 from BerriAI/litellm_Add_secret_managers [Feat] Add Google Secret Manager Support	2024-09-03 14:51:32 -07:00
dependabot[bot]	ae37d60ba1	Bump pagefind from 1.1.0 to 1.1.1 in /docs/my-website Bumps [pagefind](https://github.com/CloudCannon/pagefind) from 1.1.0 to 1.1.1. - [Release notes](https://github.com/CloudCannon/pagefind/releases) - [Changelog](https://github.com/CloudCannon/pagefind/blob/main/CHANGELOG.md) - [Commits](https://github.com/CloudCannon/pagefind/compare/v1.1.0...v1.1.1) --- updated-dependencies: - dependency-name: pagefind dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2024-09-03 19:35:54 +00:00

... 3 4 5 6 7 ...

2844 commits