litellm

Author	SHA1	Message	Date
Ishaan Jaff	4399deab2e	docs fallback/login	2024-09-18 16:43:19 -07:00
Ishaan Jaff	5480563281	docs add info on `/fallback/login`	2024-09-18 16:41:19 -07:00
Ishaan Jaff	eba76377ca	[Chore-Proxy] enforce jwt auth as enterprise feature (#5770 ) * enforce prometheus as enterprise feature * show correct error on prometheus metric when not enrterprise user * docs promethues metrics enforced * docs enforce JWT auth * enforce JWT auth as enterprise feature * fix merge conflicts	2024-09-18 16:28:37 -07:00
Ishaan Jaff	50cc7c0353	[Chore LiteLLM Proxy] enforce prometheus metrics as enterprise feature (#5769 ) * enforce prometheus as enterprise feature * show correct error on prometheus metric when not enrterprise user * docs promethues metrics enforced * fix enforcing	2024-09-18 16:28:12 -07:00
Ishaan Jaff	7e07c37be7	[Feat-Proxy] Add Azure Assistants API - Create Assistant, Delete Assistant Support (#5777 ) * update docs to show providers * azure - move assistants in it's own file * create new azure assistants file * add azure create assistants * add test for create / delete assistants * azure add delete assistants support * docs add Azure to support providers for assistants api * fix linting errors * fix standard logging merge conflict * docs azure create assistants * fix doc	2024-09-18 16:27:33 -07:00
Ishaan Jaff	a109853d21	[Prometheus] track requested model (#5774 ) * enforce prometheus as enterprise feature * show correct error on prometheus metric when not enrterprise user * docs promethues metrics enforced * track requested model on prometheus * docs prom metrics * fix prom tracking failures	2024-09-18 12:46:58 -07:00
Ishaan Jaff	a4549b5b6c	docs update what gets logged on gcs buckets	2024-09-18 10:18:57 -07:00
Ishaan Jaff	aa84bcebaf	docs update standard logging object	2024-09-18 10:17:09 -07:00
Ishaan Jaff	2987b14f3b	docs clarify how virtual key is read from cache / db	2024-09-18 09:39:54 -07:00
Krrish Dholakia	920280155b	docs(azure_ai.md): add rerank api endpoint to docs	2024-09-17 23:06:19 -07:00
Krish Dholakia	98c335acd0	LiteLLM Minor Fixes & Improvements (09/17/2024) (#5742 ) * fix(proxy_server.py): use default azure credentials to support azure non-client secret kms * fix(langsmith.py): raise error if credentials missing * feat(langsmith.py): support error logging for langsmith + standard logging payload Fixes https://github.com/BerriAI/litellm/issues/5738 * Fix hardcoding of schema in view check (#5749) * fix - deal with case when check view exists returns None (#5740) * Revert "fix - deal with case when check view exists returns None (#5740)" (#5741) This reverts commit `535228159b`. * test(test_router_debug_logs.py): move to mock response * Fix hardcoding of schema --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> * fix(proxy_server.py): allow admin to disable ui via `DISABLE_ADMIN_UI` flag * fix(router.py): fix default model name value Fixes `55db19a1e4 (r1763712148)` * fix(utils.py): fix unbound variable error * feat(rerank/main.py): add azure ai rerank endpoints Closes https://github.com/BerriAI/litellm/issues/5667 * feat(secret_detection.py): Allow configuring secret detection params Allows admin to control what plugins to run for secret detection. Prevents overzealous secret detection. * docs(secret_detection.md): add secret detection guardrail docs * fix: fix linting errors * fix - deal with case when check view exists returns None (#5740) * Revert "fix - deal with case when check view exists returns None (#5740)" (#5741) This reverts commit `535228159b`. * Litellm fix router testing (#5748) * test: fix testing - azure changed content policy error logic * test: fix tests to use mock responses * test(test_image_generation.py): handle api instability * test(test_image_generation.py): handle azure api instability * fix(utils.py): fix unbounded variable error * fix(utils.py): fix unbounded variable error * test: refactor test to use mock response * test: mark flaky azure tests * Bump next from 14.1.1 to 14.2.10 in /ui/litellm-dashboard (#5753) Bumps [next](https://github.com/vercel/next.js) from 14.1.1 to 14.2.10. - [Release notes](https://github.com/vercel/next.js/releases) - [Changelog](https://github.com/vercel/next.js/blob/canary/release.js) - [Commits](https://github.com/vercel/next.js/compare/v14.1.1...v14.2.10) --- updated-dependencies: - dependency-name: next dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [Fix] o1-mini causes pydantic warnings on `reasoning_tokens` (#5754) * add requester_metadata in standard logging payload * log requester_metadata in metadata * use StandardLoggingPayload for logging * docs StandardLoggingPayload * fix import * include standard logging object in failure * add test for requester metadata * handle completion_tokens_details * add test for completion_tokens_details * [Feat-Proxy-DataDog] Log Redis, Postgres Failure events on DataDog (#5750) * dd - start tracking redis status on dd * add async_service_succes_hook / failure hook in custom logger * add async_service_failure_hook * log service failures on dd * fix import error * add test for redis errors / warning * [Fix] Router/ Proxy - Tag Based routing, raise correct error when no deployments found and tag filtering is on (#5745) * fix tag routing - raise correct error when no model with tag based routing * fix error string from tag based routing * test router tag based routing * raise 401 error when no tags avialable for deploymen * linting fix * [Feat] Log Request metadata on gcs bucket logging (#5743) * add requester_metadata in standard logging payload * log requester_metadata in metadata * use StandardLoggingPayload for logging * docs StandardLoggingPayload * fix import * include standard logging object in failure * add test for requester metadata * fix(litellm_logging.py): fix logging message * fix(rerank_api/main.py): fix linting errors * fix(custom_guardrails.py): maintain backwards compatibility for older guardrails * fix(rerank_api/main.py): fix cost tracking for rerank endpoints --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: steffen-sbt <148480574+steffen-sbt@users.noreply.github.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-09-17 23:00:04 -07:00
Ishaan Jaff	be96c79b3c	update datadog docs	2024-09-17 20:42:36 -07:00
Ishaan Jaff	7f4dfe434a	[Fix] o1-mini causes pydantic warnings on `reasoning_tokens` (#5754 ) * add requester_metadata in standard logging payload * log requester_metadata in metadata * use StandardLoggingPayload for logging * docs StandardLoggingPayload * fix import * include standard logging object in failure * add test for requester metadata * handle completion_tokens_details * add test for completion_tokens_details	2024-09-17 20:23:14 -07:00
Krish Dholakia	234185ec13	LiteLLM Minor Fixes & Improvements (09/16/2024) (#5723 ) (#5731 ) * LiteLLM Minor Fixes & Improvements (09/16/2024) (#5723) * coverage (#5713) Signed-off-by: dbczumar <corey.zumar@databricks.com> * Move (#5714) Signed-off-by: dbczumar <corey.zumar@databricks.com> * fix(litellm_logging.py): fix logging client re-init (#5710) Fixes https://github.com/BerriAI/litellm/issues/5695 * fix(presidio.py): Fix logging_hook response and add support for additional presidio variables in guardrails config Fixes https://github.com/BerriAI/litellm/issues/5682 * feat(o1_handler.py): fake streaming for openai o1 models Fixes https://github.com/BerriAI/litellm/issues/5694 * docs: deprecated traceloop integration in favor of native otel (#5249) * fix: fix linting errors * fix: fix linting errors * fix(main.py): fix o1 import --------- Signed-off-by: dbczumar <corey.zumar@databricks.com> Co-authored-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: Nir Gazit <nirga@users.noreply.github.com> * feat(spend_management_endpoints.py): expose `/global/spend/refresh` endpoint for updating material view (#5730) * feat(spend_management_endpoints.py): expose `/global/spend/refresh` endpoint for updating material view Supports having `MonthlyGlobalSpend` view be a material view, and exposes an endpoint to refresh it * fix(custom_logger.py): reset calltype * fix: fix linting errors * fix: fix linting error * fix: fix import * test(test_databricks.py): fix databricks tests --------- Signed-off-by: dbczumar <corey.zumar@databricks.com> Co-authored-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: Nir Gazit <nirga@users.noreply.github.com>	2024-09-17 08:05:52 -07:00
Ishaan Jaff	b6ae2204a8	[Feat-Proxy] Slack Alerting - allow using os.environ/ vars for alert to webhook url (#5726 ) * allow using os.environ for slack urls * use env vars for webhook urls * fix types for get_secret * fix linting * fix linting * fix linting * linting fixes * linting fix * docs alerting slack * fix get data	2024-09-16 18:03:37 -07:00
Ishaan Jaff	8fbe2abb89	[Feat-Proxy] Add upperbound key duration param (#5727 ) * add upperbound key duration param * use upper bound values when None set * docs upperbound params	2024-09-16 16:28:36 -07:00
Krrish Dholakia	3c741b7beb	docs(docker_quick_start.md): update quick start with azure connection error	2024-09-16 07:31:32 -07:00
Ishaan Jaff	0c33b8dd12	docs	2024-09-14 19:13:45 -07:00
Ishaan Jaff	c220fc0e92	docs max_completion_tokens	2024-09-14 19:12:12 -07:00
Ishaan Jaff	c8eff2dc65	[Feat-Prometheus] Track exception status on `litellm_deployment_failure_responses` (#5706 ) * add litellm_deployment_cooled_down * track num cooldowns on prometheus * track exception status * fix linting * docs prom metrics * cleanup premium user checks * prom track deployment failure state * docs prometheus	2024-09-14 18:44:31 -07:00
Ishaan Jaff	7c2ddba6c6	sambanova support (#5547 ) (#5703 ) * add sambanova support * sambanova support * updated api endpoint for sambanova --------- Co-authored-by: Venu Anuganti <venu@venublog.com> Co-authored-by: Venu Anuganti <venu@vairmac2020>	2024-09-14 17:23:04 -07:00
Krish Dholakia	60709a0753	LiteLLM Minor Fixes and Improvements (09/13/2024) (#5689 ) * refactor: cleanup unused variables + fix pyright errors * feat(health_check.py): Closes https://github.com/BerriAI/litellm/issues/5686 * fix(o1_reasoning.py): add stricter check for o-1 reasoning model * refactor(mistral/): make it easier to see mistral transformation logic * fix(openai.py): fix openai o-1 model param mapping Fixes https://github.com/BerriAI/litellm/issues/5685 * feat(main.py): infer finetuned gemini model from base model Fixes https://github.com/BerriAI/litellm/issues/5678 * docs(vertex.md): update docs to call finetuned gemini models * feat(proxy_server.py): allow admin to hide proxy model aliases Closes https://github.com/BerriAI/litellm/issues/5692 * docs(load_balancing.md): add docs on hiding alias models from proxy config * fix(base.py): don't raise notimplemented error * fix(user_api_key_auth.py): fix model max budget check * fix(router.py): fix elif * fix(user_api_key_auth.py): don't set team_id to empty str * fix(team_endpoints.py): fix response type * test(test_completion.py): handle predibase error * test(test_proxy_server.py): fix test * fix(o1_transformation.py): fix max_completion_token mapping * test(test_image_generation.py): mark flaky test	2024-09-14 10:02:55 -07:00
Krish Dholakia	4657a40ef1	LiteLLM Minor Fixes and Improvements (09/12/2024) (#5658 ) * fix(factory.py): handle tool call content as list Fixes https://github.com/BerriAI/litellm/issues/5652 * fix(factory.py): enforce stronger typing * fix(router.py): return model alias in /v1/model/info and /v1/model_group/info * fix(user_api_key_auth.py): move noisy warning message to debug cleanup logs * fix(types.py): cleanup pydantic v2 deprecated param Fixes https://github.com/BerriAI/litellm/issues/5649 * docs(gemini.md): show how to pass inline data to gemini api Fixes https://github.com/BerriAI/litellm/issues/5674	2024-09-12 23:04:06 -07:00
Ishaan Jaff	13ba22d6fd	docs add o1 to docs	2024-09-12 19:06:13 -07:00
Krish Dholakia	98c34a7e27	LiteLLM Minor Fixes and Improvements (11/09/2024) (#5634 ) * fix(caching.py): set ttl for async_increment cache fixes issue where ttl for redis client was not being set on increment_cache Fixes https://github.com/BerriAI/litellm/issues/5609 * fix(caching.py): fix increment cache w/ ttl for sync increment cache on redis Fixes https://github.com/BerriAI/litellm/issues/5609 * fix(router.py): support adding retry policy + allowed fails policy via config.yaml * fix(router.py): don't cooldown single deployments No point, as there's no other deployment to loadbalance with. * fix(user_api_key_auth.py): support setting allowed email domains on jwt tokens Closes https://github.com/BerriAI/litellm/issues/5605 * docs(token_auth.md): add user upsert + allowed email domain to jwt auth docs * fix(litellm_pre_call_utils.py): fix dynamic key logging when team id is set Fixes issue where key logging would not be set if team metadata was not none * fix(secret_managers/main.py): load environment variables correctly Fixes issue where os.environ/ was not being loaded correctly * test(test_router.py): fix test * feat(spend_tracking_utils.py): support logging additional usage params - e.g. prompt caching values for deepseek * test: fix tests * test: fix test * test: fix test * test: fix test * test: fix test	2024-09-11 22:36:06 -07:00
steffen-sbt	de9a39e7c6	Add the option to specify a schema in the postgres DB, also modify docs (#5640 )	2024-09-11 14:53:52 -07:00
Miri Bar	ebf42d6764	docs: update ai21 docs	2024-09-11 13:35:40 +03:00
dependabot[bot]	e48459389c	Bump send and express in /docs/my-website Bumps [send](https://github.com/pillarjs/send) and [express](https://github.com/expressjs/express). These dependencies needed to be updated together. Updates `send` from 0.18.0 to 0.19.0 - [Release notes](https://github.com/pillarjs/send/releases) - [Changelog](https://github.com/pillarjs/send/blob/master/HISTORY.md) - [Commits](https://github.com/pillarjs/send/compare/0.18.0...0.19.0) Updates `express` from 4.19.2 to 4.20.0 - [Release notes](https://github.com/expressjs/express/releases) - [Changelog](https://github.com/expressjs/express/blob/master/History.md) - [Commits](https://github.com/expressjs/express/compare/4.19.2...4.20.0) --- updated-dependencies: - dependency-name: send dependency-type: indirect - dependency-name: express dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2024-09-11 02:11:46 +00:00
Ishaan Jaff	899eaa9566	Merge pull request #5571 from jalammar/cohere-updated-models Add Cohere refresh models and update pricing	2024-09-10 17:22:51 -07:00
Ishaan Jaff	87bac7c026	fix rps / rpm values on load testing	2024-09-10 11:22:19 -07:00
Jay Alammar	795b29dfc4	Updating Cohere models, prices, and documentation	2024-09-10 13:47:05 -04:00
Ishaan Jaff	479b12be09	Merge branch 'main' into litellm_allow_turning_off_message_logging_for_callbacks	2024-09-09 21:59:36 -07:00
Ishaan Jaff	a6d3bd0ab7	Merge branch 'main' into litellm_tag_routing_fixes	2024-09-09 17:45:18 -07:00
Ishaan Jaff	949af7be2e	fix team based logging doc	2024-09-09 16:49:26 -07:00
Ishaan Jaff	4592d80f43	add doc on redacting otel message / response	2024-09-09 16:10:13 -07:00
Ishaan Jaff	2fceeedd94	add "default" tag	2024-09-09 14:41:22 -07:00
Ishaan Jaff	3bf6589fab	docs architecture	2024-09-07 19:09:33 -07:00
Krrish Dholakia	8294e8793c	docs(deploy.md): add published non-root docker image to docs	2024-09-07 18:01:31 -07:00
Ishaan Jaff	ba41a72f92	High Level architecture	2024-09-07 16:29:22 -07:00
Ishaan Jaff	9eb59e3645	Merge pull request #5585 from BerriAI/litellm_docs_arch_diagram [Docs] - Add Lifecycle of a request through LiteLLM Gateway	2024-09-07 16:22:02 -07:00
Ishaan Jaff	c2c63e4dbe	docs add arch diagram	2024-09-07 16:21:29 -07:00
Ishaan Jaff	54db564529	add arch diagram	2024-09-07 15:49:51 -07:00
Ishaan Jaff	ecb774c3e8	add doc on spend report frequency	2024-09-07 11:54:33 -07:00
Ishaan Jaff	009a1f7f86	Merge pull request #5579 from BerriAI/litellm_set_redis_cluster_env [Feat] Allow setting up Redis Cluster using .env vars	2024-09-07 11:31:38 -07:00
Ishaan Jaff	05505903b2	docs better sidebar	2024-09-07 11:31:07 -07:00
Ishaan Jaff	3984b9080c	docs cleanup	2024-09-07 11:23:44 -07:00
Ishaan Jaff	2cf0714b0d	docs organize sidebar	2024-09-07 11:23:06 -07:00
Ishaan Jaff	808ba36b55	ui cleanup	2024-09-07 11:20:07 -07:00
Ishaan Jaff	3bf2c06e06	add config for setting up redis cluster	2024-09-07 09:37:23 -07:00
Pradyumna Singh Rathore	a4f5fb3c30	fix missing class object instantiation in custom_llm_server provider documentation's quick start (#5578 ) Co-authored-by: Pradyumna Singh Rathore <pradyumna.singhrathore@halliburton.com>	2024-09-07 08:22:18 -07:00

... 2 3 4 5 6 ...

2811 commits