Commit graph

11479 commits

Author SHA1 Message Date
Ishaan Jaff
791ec2e26a ui - new build 2024-05-06 18:06:25 -07:00
Ishaan Jaff
0012463a9b
Merge pull request #3489 from BerriAI/litellm_send_slack_alerts_model_new
Feat - send slack alerts on `model/new`
2024-05-06 18:01:08 -07:00
Ishaan Jaff
7c789fe0af
Merge pull request #3492 from BerriAI/litellm_fixes_retry_policy
[UI] Fixes for getting/setting Retry Policy by Model Group
2024-05-06 18:00:21 -07:00
Ishaan Jaff
6a056c0c17 fix support InternalServerErrorRetries 2024-05-06 17:59:38 -07:00
Ishaan Jaff
f04f45a80d ui - fix show num retries for 500 error 2024-05-06 17:58:56 -07:00
Ishaan Jaff
eb5dc38fc9 fix - set retry policy on ui 2024-05-06 17:57:07 -07:00
Ishaan Jaff
4fdf1147d3 ui - get/set model retry policy 2024-05-06 17:44:55 -07:00
Ishaan Jaff
116e555a03 fix - setting model_group_retry_policy in db 2024-05-06 17:43:43 -07:00
Ishaan Jaff
57a7870348 ui - show retry policy per model group 2024-05-06 17:23:51 -07:00
Krrish Dholakia
718f423d7d feat(slack_alerting.py): support sending daily reports on deployments
allow admin to easily know slow + failing deployments

Closes  https://github.com/BerriAI/litellm/issues/3483
2024-05-06 17:18:42 -07:00
Ishaan Jaff
040dc1f23e fix retry policy 2024-05-06 16:43:42 -07:00
Ishaan Jaff
1de50d62f7 fix test router debug logs 2024-05-06 16:38:16 -07:00
Ishaan Jaff
1725b7a9e1
Merge pull request #3461 from BerriAI/litellm_set_router_retry_policy_ui
[UI] Set Router Retry Policy
2024-05-06 16:36:09 -07:00
Ishaan Jaff
02f0ef7158 feat - format model price in slack alerts for new model 2024-05-06 15:57:22 -07:00
Ishaan Jaff
c600371e6e feat - send alert on adding new model 2024-05-06 15:45:07 -07:00
Ishaan Jaff
62adaa0fd4 feat - send slack alerts on adding new model 2024-05-06 15:43:02 -07:00
Ishaan Jaff
448c3827e6
Merge pull request #3488 from BerriAI/litellm_show_key_name_team_slack_alerts
[Feat] show `key alias` and  `team  alias` in slack alerts
2024-05-06 14:36:01 -07:00
Ishaan Jaff
a9ec5789e7 feat - add key name and team alias to slack alerts 2024-05-06 14:30:10 -07:00
Ishaan Jaff
6ff37aabb0 fix add key name + team name in alerting messages 2024-05-06 14:29:04 -07:00
Ishaan Jaff
24b9fbe19f fix - show api key name and alias in alerts 2024-05-06 14:15:34 -07:00
dependabot[bot]
6fc625b54d
build(deps): bump jinja2 from 3.1.3 to 3.1.4
Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.3 to 3.1.4.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/3.1.3...3.1.4)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-05-06 20:55:32 +00:00
Ishaan Jaff
562ef2d2e1 fix - add better debugging on num_callbacks test 2024-05-06 13:42:20 -07:00
Krrish Dholakia
863f9c60a2 refactor: trigger new build 2024-05-06 11:46:30 -07:00
Krrish Dholakia
bbd5ba3ea1 build(config.yml): update env variable 2024-05-06 11:45:25 -07:00
Krrish Dholakia
c6d830d377 fix(anthropic.py): remove raise error on 'empty content'
Fixes https://github.com/BerriAI/litellm/issues/3453
2024-05-06 11:42:09 -07:00
Ishaan Jaff
31c4f5f811 (ci/cd) add detailed debugging info 2024-05-06 11:40:16 -07:00
Ishaan Jaff
4f8ee64e8a ci/cd run again 2024-05-06 11:32:14 -07:00
Ishaan Jaff
c83348b7a8 fix build and test 2024-05-06 11:24:31 -07:00
Krrish Dholakia
b014a72f7a test(test_openai_endpoints.py): change key 2024-05-06 11:19:47 -07:00
Ishaan Jaff
6a06aba443 (ci/cd) use db connection limit 2024-05-06 11:15:22 -07:00
Ishaan Jaff
44c1dedf64 test fix - test_async_chat_openai_stream 2024-05-06 11:13:07 -07:00
Ishaan Jaff
4bd3967a1a (ci/cd) run again 2024-05-06 11:04:43 -07:00
Krrish Dholakia
4b5cf26c1b fix(utils.py): handle gemini chunk no parts error
Fixes https://github.com/BerriAI/litellm/issues/3468
2024-05-06 10:59:53 -07:00
Ishaan Jaff
e8d3dd475a fix fake endpoint used on ci/cd 2024-05-06 10:37:39 -07:00
Krrish Dholakia
cad049b6a8 build(bump-helm-chart-app-version): bump helm chart app version to latest 2024-05-06 10:26:01 -07:00
Krrish Dholakia
7ebe00599b build(host-helm-chart-on-root): allows helm repo add to work with litellm 2024-05-06 10:11:56 -07:00
Simon Sanchez Viloria
6181d1eaad Merge branch 'main' into feature/watsonx-integration 2024-05-06 17:27:14 +02:00
Ishaan Jaff
ec63a30095 docs - deploy litellm on gcp cloud run 2024-05-06 08:10:35 -07:00
Simon Sanchez Viloria
83a274b54b (feat) support for async stream to watsonx provider 2024-05-06 17:08:40 +02:00
Simon Sanchez Viloria
62b3f25398 (fix) fix ibm_granite_pt and use hf_chat_template for llama-3 instruct prompts 2024-05-06 17:08:40 +02:00
Ishaan Jaff
e0001a9121 docs - add using vertex embedding models 2024-05-06 07:56:17 -07:00
Ishaan Jaff
c762e60ae5 stash ui changes 2024-05-06 07:53:55 -07:00
Rajan Paneru
2347cb2247 bring back the to be depricated mistral-medium 2024-05-07 00:19:19 +09:30
Krish Dholakia
9f58583888
Merge pull request #3299 from themrzmaster/main
Allowing extra headers for bedrock
2024-05-06 07:45:53 -07:00
Krrish Dholakia
b5f3f198f2 fix(utils.py): anthropic error handling 2024-05-06 07:25:12 -07:00
Krrish Dholakia
d83f0b02da test: fix local tests 2024-05-06 07:14:33 -07:00
Krish Dholakia
5f119f2abb
Merge pull request #3469 from jackmpcollins/fix-ollama-streamed-tool-calls
Fix Ollama streamed tool calls. Set finish_reason to tool_calls for all tool_calls responses
2024-05-06 07:13:37 -07:00
Ishaan Jaff
817a77b23f
Merge pull request #3463 from RoniGurvichCycode/main
gunicorn version bump
2024-05-06 07:11:01 -07:00
Lucca Zenóbio
b22517845e
Merge branch 'main' into main 2024-05-06 09:40:23 -03:00
Jack Collins
07b13ff7c5 Remove unused ModelResponse import 2024-05-06 00:16:58 -07:00