Commit graph

11876 commits

Author SHA1 Message Date
Ishaan Jaff
57a7870348 ui - show retry policy per model group 2024-05-06 17:23:51 -07:00
Krrish Dholakia
718f423d7d feat(slack_alerting.py): support sending daily reports on deployments
allow admin to easily know slow + failing deployments

Closes  https://github.com/BerriAI/litellm/issues/3483
2024-05-06 17:18:42 -07:00
Ishaan Jaff
040dc1f23e fix retry policy 2024-05-06 16:43:42 -07:00
Ishaan Jaff
1de50d62f7 fix test router debug logs 2024-05-06 16:38:16 -07:00
Ishaan Jaff
1725b7a9e1
Merge pull request #3461 from BerriAI/litellm_set_router_retry_policy_ui
[UI] Set Router Retry Policy
2024-05-06 16:36:09 -07:00
Ishaan Jaff
02f0ef7158 feat - format model price in slack alerts for new model 2024-05-06 15:57:22 -07:00
Ishaan Jaff
c600371e6e feat - send alert on adding new model 2024-05-06 15:45:07 -07:00
Ishaan Jaff
62adaa0fd4 feat - send slack alerts on adding new model 2024-05-06 15:43:02 -07:00
Ishaan Jaff
448c3827e6
Merge pull request #3488 from BerriAI/litellm_show_key_name_team_slack_alerts
[Feat] show `key alias` and  `team  alias` in slack alerts
2024-05-06 14:36:01 -07:00
Ishaan Jaff
a9ec5789e7 feat - add key name and team alias to slack alerts 2024-05-06 14:30:10 -07:00
Ishaan Jaff
6ff37aabb0 fix add key name + team name in alerting messages 2024-05-06 14:29:04 -07:00
Ishaan Jaff
24b9fbe19f fix - show api key name and alias in alerts 2024-05-06 14:15:34 -07:00
dependabot[bot]
6fc625b54d
build(deps): bump jinja2 from 3.1.3 to 3.1.4
Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.3 to 3.1.4.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/3.1.3...3.1.4)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-05-06 20:55:32 +00:00
Ishaan Jaff
562ef2d2e1 fix - add better debugging on num_callbacks test 2024-05-06 13:42:20 -07:00
Krrish Dholakia
863f9c60a2 refactor: trigger new build 2024-05-06 11:46:30 -07:00
Krrish Dholakia
bbd5ba3ea1 build(config.yml): update env variable 2024-05-06 11:45:25 -07:00
Krrish Dholakia
c6d830d377 fix(anthropic.py): remove raise error on 'empty content'
Fixes https://github.com/BerriAI/litellm/issues/3453
2024-05-06 11:42:09 -07:00
Ishaan Jaff
31c4f5f811 (ci/cd) add detailed debugging info 2024-05-06 11:40:16 -07:00
Ishaan Jaff
4f8ee64e8a ci/cd run again 2024-05-06 11:32:14 -07:00
Ishaan Jaff
c83348b7a8 fix build and test 2024-05-06 11:24:31 -07:00
Krrish Dholakia
b014a72f7a test(test_openai_endpoints.py): change key 2024-05-06 11:19:47 -07:00
Ishaan Jaff
6a06aba443 (ci/cd) use db connection limit 2024-05-06 11:15:22 -07:00
Ishaan Jaff
44c1dedf64 test fix - test_async_chat_openai_stream 2024-05-06 11:13:07 -07:00
Ishaan Jaff
4bd3967a1a (ci/cd) run again 2024-05-06 11:04:43 -07:00
Krrish Dholakia
4b5cf26c1b fix(utils.py): handle gemini chunk no parts error
Fixes https://github.com/BerriAI/litellm/issues/3468
2024-05-06 10:59:53 -07:00
Ishaan Jaff
e8d3dd475a fix fake endpoint used on ci/cd 2024-05-06 10:37:39 -07:00
Krrish Dholakia
cad049b6a8 build(bump-helm-chart-app-version): bump helm chart app version to latest 2024-05-06 10:26:01 -07:00
Krrish Dholakia
7ebe00599b build(host-helm-chart-on-root): allows helm repo add to work with litellm 2024-05-06 10:11:56 -07:00
Simon Sanchez Viloria
6181d1eaad Merge branch 'main' into feature/watsonx-integration 2024-05-06 17:27:14 +02:00
Ishaan Jaff
ec63a30095 docs - deploy litellm on gcp cloud run 2024-05-06 08:10:35 -07:00
Simon Sanchez Viloria
83a274b54b (feat) support for async stream to watsonx provider 2024-05-06 17:08:40 +02:00
Simon Sanchez Viloria
62b3f25398 (fix) fix ibm_granite_pt and use hf_chat_template for llama-3 instruct prompts 2024-05-06 17:08:40 +02:00
Ishaan Jaff
e0001a9121 docs - add using vertex embedding models 2024-05-06 07:56:17 -07:00
Ishaan Jaff
c762e60ae5 stash ui changes 2024-05-06 07:53:55 -07:00
Rajan Paneru
2347cb2247 bring back the to be depricated mistral-medium 2024-05-07 00:19:19 +09:30
Krish Dholakia
9f58583888
Merge pull request #3299 from themrzmaster/main
Allowing extra headers for bedrock
2024-05-06 07:45:53 -07:00
Krrish Dholakia
b5f3f198f2 fix(utils.py): anthropic error handling 2024-05-06 07:25:12 -07:00
Krrish Dholakia
d83f0b02da test: fix local tests 2024-05-06 07:14:33 -07:00
Krish Dholakia
5f119f2abb
Merge pull request #3469 from jackmpcollins/fix-ollama-streamed-tool-calls
Fix Ollama streamed tool calls. Set finish_reason to tool_calls for all tool_calls responses
2024-05-06 07:13:37 -07:00
Ishaan Jaff
817a77b23f
Merge pull request #3463 from RoniGurvichCycode/main
gunicorn version bump
2024-05-06 07:11:01 -07:00
Lucca Zenóbio
b22517845e
Merge branch 'main' into main 2024-05-06 09:40:23 -03:00
Jack Collins
07b13ff7c5 Remove unused ModelResponse import 2024-05-06 00:16:58 -07:00
Jack Collins
51c02fdadf Add tests for ollama + ollama chat tool calls +/- stream 2024-05-06 00:13:42 -07:00
Rajan Paneru
d0d063e25a Synced the model with the Mistral
* Ordered them based on previous order
* Excluded to be depricated models
* Use the model name, instead of the slug in the first column as referred by the name of the column
2024-05-06 13:16:24 +09:30
Mehmet Bektas
64a64c68c5 update ollama test file 2024-05-05 20:25:43 -07:00
Mehmet Bektas
3acad270e5 support sync ollama embeddings 2024-05-05 19:44:25 -07:00
Jack Collins
bb6132eee1 Fix: get format from data not optional_params ollama non-stream completion 2024-05-05 18:59:26 -07:00
Jack Collins
81b1c46c6f Add missing import itertools.chain 2024-05-05 18:54:08 -07:00
Jack Collins
03b82b78c1 Fix: Set finish_reason to tool_calls for non-stream responses in ollama 2024-05-05 18:52:31 -07:00
Jack Collins
297543e3e5 Parse streamed function calls as single delta in ollama 2024-05-05 18:52:20 -07:00