Commit graph

11166 commits

Author SHA1 Message Date
Krrish Dholakia
80378966a0 build: add azure resource template 2024-05-08 15:24:58 -07:00
Krish Dholakia
91bb7cd261
Merge pull request #3437 from msabramo/add-engines-model-chat-completions-endpoint
Add `/engines/{model}/chat/completions` endpoint
2024-05-08 14:30:39 -07:00
Ishaan Jaff
ba08a82885
Merge pull request #3532 from BerriAI/litellm_send_alert_on_cooling_down_deploymeny
[Feat] send alert on cooling down deployment
2024-05-08 14:30:31 -07:00
Ishaan Jaff
597b09598c feat - send alert on cooling down a deploymeny 2024-05-08 14:14:14 -07:00
Ishaan Jaff
aef3d89f0c fix add cooldown_deployment alert_type 2024-05-08 14:13:51 -07:00
Krrish Dholakia
51b6c3bdbc test(test_function_call_parsing.py): add test for function call parsing
Closes https://github.com/BerriAI/litellm/issues/2654
2024-05-08 10:54:26 -07:00
Ishaan Jaff
c60f12a70b ui - show guardrails 2024-05-08 10:51:34 -07:00
Krrish Dholakia
c5897543c8 docs(hosted.md): add feature list 2024-05-08 09:53:13 -07:00
Krrish Dholakia
6179675b09 build(config.yml): bump wait time for docker 2024-05-08 09:38:31 -07:00
Krrish Dholakia
0ab57026f3 bump: version 1.36.2 → 1.36.3 2024-05-08 09:24:43 -07:00
Krrish Dholakia
6ce13ab364 refactor(main.py): trigger new build 2024-05-08 09:24:01 -07:00
Krrish Dholakia
a854824c02 fix(main.py): fix together ai text completion call 2024-05-08 09:10:45 -07:00
Krrish Dholakia
59080431b8 docs(users.md): simplify doc with end-user vs. internal user 2024-05-08 08:34:40 -07:00
Ishaan Jaff
190e2fc3d5
Merge pull request #3519 from CyanideByte/pydantic-warning
Pydantic warning conflict with protected namespace
2024-05-08 08:20:39 -07:00
CyanideByte
b7c60031ca Pydantic warning snuck in here 2024-05-07 23:59:18 -07:00
Krish Dholakia
cec17d61a4
Merge pull request #3512 from BerriAI/litellm_ui_fixes_6
Edit cost per input + cost per output token on UI
2024-05-07 22:01:11 -07:00
Krish Dholakia
0e709fdc21
Merge branch 'main' into litellm_ui_fixes_6 2024-05-07 22:01:04 -07:00
Krish Dholakia
b6be2f61ee
Merge pull request #3518 from BerriAI/revert-3478-Issue-#3474-anthropic-roles-alternation-issue
Revert "* feat(factory.py): add support for merging consecutive messages of one role when separated with empty message of another role"
2024-05-07 21:42:26 -07:00
Krish Dholakia
303e0c6226
Revert "* feat(factory.py): add support for merging consecutive messages of one role when separated with empty message of another role" 2024-05-07 21:42:18 -07:00
Krrish Dholakia
67b4aa28bd docs(routing.md): make clear lowest cost routing is async 2024-05-07 21:34:18 -07:00
Krrish Dholakia
fbcda918de feat(ui/model_dashboard.tsx): show if model is config or db model 2024-05-07 21:34:18 -07:00
Krrish Dholakia
f45954d154 test(test_config.py): fix test 2024-05-07 21:31:03 -07:00
Krish Dholakia
a325bf2fb8
Merge pull request #3478 from nkvch/Issue-#3474-anthropic-roles-alternation-issue
* feat(factory.py): add support for merging consecutive messages of one role when separated with empty message of another role
2024-05-07 21:24:47 -07:00
Krrish Dholakia
8d04f4686f test: fix test 2024-05-07 21:14:33 -07:00
Krrish Dholakia
dc8239ecc6 feat(model_dashboard.tsx): allow adding output cost per token on ui 2024-05-07 21:09:49 -07:00
Krrish Dholakia
5a16bec6a1 feat(model_dashboard.tsx): allow user to edit input cost per token for model on ui
also contains fixes for `/model/update`
2024-05-07 20:57:21 -07:00
Ishaan Jaff
21d3407b95 fix replicate test 2024-05-07 19:48:46 -07:00
Ishaan Jaff
4e7b5aa9d7
Merge pull request #3439 from phact/patch-3
add_function_to_prompt bug fix
2024-05-07 19:31:19 -07:00
Ishaan Jaff
2725a55e7a
Merge pull request #3470 from mbektas/fix-ollama-embeddings
support sync ollama embeddings
2024-05-07 19:21:37 -07:00
Ishaan Jaff
f45feff13c bump: version 1.36.1 → 1.36.2 2024-05-07 19:16:32 -07:00
Ishaan Jaff
6e72857cf7 fix model cost map 2024-05-07 19:15:58 -07:00
Ishaan Jaff
223e386737
Merge pull request #3511 from BerriAI/litellm_router_send_exceptions_slack
[Feat] litellm.Router / litellm.completion -  send llm exceptions to slack
2024-05-07 19:09:44 -07:00
Ishaan Jaff
596adf6e2f test - slack alerting on litellm router 2024-05-07 19:04:25 -07:00
Krrish Dholakia
312249ca44 feat(ui/model_dashboard.tsx): show if model is config or db model 2024-05-07 18:29:14 -07:00
Ishaan Jaff
dc74204427 fix typo 2024-05-07 18:27:49 -07:00
Ishaan Jaff
d46544d2bc docs setup alerting on router 2024-05-07 18:26:45 -07:00
Ishaan Jaff
e8053c3d0b fix slack alerting 2024-05-07 18:17:12 -07:00
Ishaan Jaff
c08352a0ce router- initialize alerting 2024-05-07 18:03:04 -07:00
Ishaan Jaff
5fd3b12d34 add router alerting type 2024-05-07 17:46:18 -07:00
Ishaan Jaff
17787db973
Merge pull request #3503 from paul-gauthier/deepseek
Added "deepseek/" as a supported provider (openai compatible)
2024-05-07 15:15:47 -07:00
Ishaan Jaff
b1230dd919 test - slack alerts on router 2024-05-07 15:12:21 -07:00
Ishaan Jaff
32f3e032e9 feat - send slack alerts litellm.router 2024-05-07 15:10:47 -07:00
Krish Dholakia
2aaaa5e1b4
Merge pull request #3506 from BerriAI/litellm_reintegrate_langfuse_url_slack_alert
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
2024-05-07 15:03:29 -07:00
Ishaan Jaff
84055c0546
Merge pull request #3510 from BerriAI/litellm_make_lowest_cost_async
[Feat] Make lowest cost routing Async
2024-05-07 14:14:04 -07:00
Ishaan Jaff
8644aec8d3 test - lowest cost router 2024-05-07 13:52:34 -07:00
Ishaan Jaff
6983e7a84f feat - make lowest_cost pure async 2024-05-07 13:51:50 -07:00
Krrish Dholakia
f210318bf1 fix(proxy_server.py): return budget duration in user response object 2024-05-07 13:47:32 -07:00
Krrish Dholakia
f2766fddbf fix(proxy_server.py): fix /v1/models bug where it would return empty list
handle 'all-team-models' being set for a given key
2024-05-07 13:43:15 -07:00
Ishaan Jaff
8e5437c8e9
Merge pull request #3504 from BerriAI/litellm_add_lowest_cost_routing
[Feat + Test] Add lowest cost routing - litellm.Router
2024-05-07 13:22:58 -07:00
Krrish Dholakia
b872da4e6f test: fix linting error 2024-05-07 13:18:49 -07:00