Ishaan Jaff
57a7870348
ui - show retry policy per model group
2024-05-06 17:23:51 -07:00
Krrish Dholakia
718f423d7d
feat(slack_alerting.py): support sending daily reports on deployments
...
allow admin to easily know slow + failing deployments
Closes https://github.com/BerriAI/litellm/issues/3483
2024-05-06 17:18:42 -07:00
Ishaan Jaff
040dc1f23e
fix retry policy
2024-05-06 16:43:42 -07:00
Ishaan Jaff
1de50d62f7
fix test router debug logs
2024-05-06 16:38:16 -07:00
Ishaan Jaff
1725b7a9e1
Merge pull request #3461 from BerriAI/litellm_set_router_retry_policy_ui
...
[UI] Set Router Retry Policy
2024-05-06 16:36:09 -07:00
Ishaan Jaff
02f0ef7158
feat - format model price in slack alerts for new model
2024-05-06 15:57:22 -07:00
Ishaan Jaff
c600371e6e
feat - send alert on adding new model
2024-05-06 15:45:07 -07:00
Ishaan Jaff
62adaa0fd4
feat - send slack alerts on adding new model
2024-05-06 15:43:02 -07:00
Ishaan Jaff
448c3827e6
Merge pull request #3488 from BerriAI/litellm_show_key_name_team_slack_alerts
...
[Feat] show `key alias` and `team alias` in slack alerts
2024-05-06 14:36:01 -07:00
Ishaan Jaff
a9ec5789e7
feat - add key name and team alias to slack alerts
2024-05-06 14:30:10 -07:00
Ishaan Jaff
6ff37aabb0
fix add key name + team name in alerting messages
2024-05-06 14:29:04 -07:00
Ishaan Jaff
24b9fbe19f
fix - show api key name and alias in alerts
2024-05-06 14:15:34 -07:00
dependabot[bot]
6fc625b54d
build(deps): bump jinja2 from 3.1.3 to 3.1.4
...
Bumps [jinja2](https://github.com/pallets/jinja ) from 3.1.3 to 3.1.4.
- [Release notes](https://github.com/pallets/jinja/releases )
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst )
- [Commits](https://github.com/pallets/jinja/compare/3.1.3...3.1.4 )
---
updated-dependencies:
- dependency-name: jinja2
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
2024-05-06 20:55:32 +00:00
Ishaan Jaff
562ef2d2e1
fix - add better debugging on num_callbacks test
2024-05-06 13:42:20 -07:00
Krrish Dholakia
863f9c60a2
refactor: trigger new build
2024-05-06 11:46:30 -07:00
Krrish Dholakia
bbd5ba3ea1
build(config.yml): update env variable
2024-05-06 11:45:25 -07:00
Krrish Dholakia
c6d830d377
fix(anthropic.py): remove raise error on 'empty content'
...
Fixes https://github.com/BerriAI/litellm/issues/3453
2024-05-06 11:42:09 -07:00
Ishaan Jaff
31c4f5f811
(ci/cd) add detailed debugging info
2024-05-06 11:40:16 -07:00
Ishaan Jaff
4f8ee64e8a
ci/cd run again
2024-05-06 11:32:14 -07:00
Ishaan Jaff
c83348b7a8
fix build and test
2024-05-06 11:24:31 -07:00
Krrish Dholakia
b014a72f7a
test(test_openai_endpoints.py): change key
2024-05-06 11:19:47 -07:00
Ishaan Jaff
6a06aba443
(ci/cd) use db connection limit
2024-05-06 11:15:22 -07:00
Ishaan Jaff
44c1dedf64
test fix - test_async_chat_openai_stream
2024-05-06 11:13:07 -07:00
Ishaan Jaff
4bd3967a1a
(ci/cd) run again
2024-05-06 11:04:43 -07:00
Krrish Dholakia
4b5cf26c1b
fix(utils.py): handle gemini chunk no parts error
...
Fixes https://github.com/BerriAI/litellm/issues/3468
2024-05-06 10:59:53 -07:00
Ishaan Jaff
e8d3dd475a
fix fake endpoint used on ci/cd
2024-05-06 10:37:39 -07:00
Krrish Dholakia
cad049b6a8
build(bump-helm-chart-app-version): bump helm chart app version to latest
2024-05-06 10:26:01 -07:00
Krrish Dholakia
7ebe00599b
build(host-helm-chart-on-root): allows helm repo add
to work with litellm
2024-05-06 10:11:56 -07:00
Simon Sanchez Viloria
6181d1eaad
Merge branch 'main' into feature/watsonx-integration
2024-05-06 17:27:14 +02:00
Ishaan Jaff
ec63a30095
docs - deploy litellm on gcp cloud run
2024-05-06 08:10:35 -07:00
Simon Sanchez Viloria
83a274b54b
(feat) support for async stream to watsonx provider
2024-05-06 17:08:40 +02:00
Simon Sanchez Viloria
62b3f25398
(fix) fix ibm_granite_pt and use hf_chat_template for llama-3 instruct prompts
2024-05-06 17:08:40 +02:00
Ishaan Jaff
e0001a9121
docs - add using vertex embedding models
2024-05-06 07:56:17 -07:00
Ishaan Jaff
c762e60ae5
stash ui changes
2024-05-06 07:53:55 -07:00
Rajan Paneru
2347cb2247
bring back the to be depricated mistral-medium
2024-05-07 00:19:19 +09:30
Krish Dholakia
9f58583888
Merge pull request #3299 from themrzmaster/main
...
Allowing extra headers for bedrock
2024-05-06 07:45:53 -07:00
Krrish Dholakia
b5f3f198f2
fix(utils.py): anthropic error handling
2024-05-06 07:25:12 -07:00
Krrish Dholakia
d83f0b02da
test: fix local tests
2024-05-06 07:14:33 -07:00
Krish Dholakia
5f119f2abb
Merge pull request #3469 from jackmpcollins/fix-ollama-streamed-tool-calls
...
Fix Ollama streamed tool calls. Set finish_reason to tool_calls for all tool_calls responses
2024-05-06 07:13:37 -07:00
Ishaan Jaff
817a77b23f
Merge pull request #3463 from RoniGurvichCycode/main
...
gunicorn version bump
2024-05-06 07:11:01 -07:00
Lucca Zenóbio
b22517845e
Merge branch 'main' into main
2024-05-06 09:40:23 -03:00
Jack Collins
07b13ff7c5
Remove unused ModelResponse import
2024-05-06 00:16:58 -07:00
Jack Collins
51c02fdadf
Add tests for ollama + ollama chat tool calls +/- stream
2024-05-06 00:13:42 -07:00
Rajan Paneru
d0d063e25a
Synced the model with the Mistral
...
* Ordered them based on previous order
* Excluded to be depricated models
* Use the model name, instead of the slug in the first column as referred by the name of the column
2024-05-06 13:16:24 +09:30
Mehmet Bektas
64a64c68c5
update ollama test file
2024-05-05 20:25:43 -07:00
Mehmet Bektas
3acad270e5
support sync ollama embeddings
2024-05-05 19:44:25 -07:00
Jack Collins
bb6132eee1
Fix: get format from data not optional_params ollama non-stream completion
2024-05-05 18:59:26 -07:00
Jack Collins
81b1c46c6f
Add missing import itertools.chain
2024-05-05 18:54:08 -07:00
Jack Collins
03b82b78c1
Fix: Set finish_reason to tool_calls for non-stream responses in ollama
2024-05-05 18:52:31 -07:00
Jack Collins
297543e3e5
Parse streamed function calls as single delta in ollama
2024-05-05 18:52:20 -07:00