Commit graph

11363 commits

Author SHA1 Message Date
Ishaan Jaff
2725a55e7a
Merge pull request #3470 from mbektas/fix-ollama-embeddings
support sync ollama embeddings
2024-05-07 19:21:37 -07:00
Ishaan Jaff
f45feff13c bump: version 1.36.1 → 1.36.2 2024-05-07 19:16:32 -07:00
Ishaan Jaff
6e72857cf7 fix model cost map 2024-05-07 19:15:58 -07:00
Ishaan Jaff
223e386737
Merge pull request #3511 from BerriAI/litellm_router_send_exceptions_slack
[Feat] litellm.Router / litellm.completion -  send llm exceptions to slack
2024-05-07 19:09:44 -07:00
Ishaan Jaff
596adf6e2f test - slack alerting on litellm router 2024-05-07 19:04:25 -07:00
Krrish Dholakia
312249ca44 feat(ui/model_dashboard.tsx): show if model is config or db model 2024-05-07 18:29:14 -07:00
Ishaan Jaff
dc74204427 fix typo 2024-05-07 18:27:49 -07:00
Ishaan Jaff
d46544d2bc docs setup alerting on router 2024-05-07 18:26:45 -07:00
Ishaan Jaff
e8053c3d0b fix slack alerting 2024-05-07 18:17:12 -07:00
Ishaan Jaff
c08352a0ce router- initialize alerting 2024-05-07 18:03:04 -07:00
Ishaan Jaff
5fd3b12d34 add router alerting type 2024-05-07 17:46:18 -07:00
Ishaan Jaff
17787db973
Merge pull request #3503 from paul-gauthier/deepseek
Added "deepseek/" as a supported provider (openai compatible)
2024-05-07 15:15:47 -07:00
Ishaan Jaff
b1230dd919 test - slack alerts on router 2024-05-07 15:12:21 -07:00
Ishaan Jaff
32f3e032e9 feat - send slack alerts litellm.router 2024-05-07 15:10:47 -07:00
Krish Dholakia
2aaaa5e1b4
Merge pull request #3506 from BerriAI/litellm_reintegrate_langfuse_url_slack_alert
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
2024-05-07 15:03:29 -07:00
David Manouchehri
44b1b21911
feat(utils.py) - Add OIDC caching for Google Cloud Run and GitHub Actions. 2024-05-07 21:24:55 +00:00
Ishaan Jaff
84055c0546
Merge pull request #3510 from BerriAI/litellm_make_lowest_cost_async
[Feat] Make lowest cost routing Async
2024-05-07 14:14:04 -07:00
Ishaan Jaff
8644aec8d3 test - lowest cost router 2024-05-07 13:52:34 -07:00
Ishaan Jaff
6983e7a84f feat - make lowest_cost pure async 2024-05-07 13:51:50 -07:00
Krrish Dholakia
f210318bf1 fix(proxy_server.py): return budget duration in user response object 2024-05-07 13:47:32 -07:00
Krrish Dholakia
f2766fddbf fix(proxy_server.py): fix /v1/models bug where it would return empty list
handle 'all-team-models' being set for a given key
2024-05-07 13:43:15 -07:00
Ishaan Jaff
8e5437c8e9
Merge pull request #3504 from BerriAI/litellm_add_lowest_cost_routing
[Feat + Test] Add lowest cost routing - litellm.Router
2024-05-07 13:22:58 -07:00
Krrish Dholakia
b872da4e6f test: fix linting error 2024-05-07 13:18:49 -07:00
Ishaan Jaff
429f569360 test - lowest cost with custom pricing 2024-05-07 13:17:32 -07:00
Ishaan Jaff
d5f93048cc docs - lowest cost routing 2024-05-07 13:15:30 -07:00
Krrish Dholakia
e85468badb test: fix linting error 2024-05-07 13:12:06 -07:00
Jean-Luc Duckworth
d60aa8282e
Fixed typo. test_jwt.py tests pass 2024-05-07 16:08:36 -04:00
Ishaan Jaff
486cbb990c fix allow user to pass input_cost and output_cost 2024-05-07 13:08:16 -07:00
Ishaan Jaff
71a92b4fef test - lowest cost router 2024-05-07 13:04:12 -07:00
David Manouchehri
cb49fb004d
fix(azure.py): Correct invalid .get to a .post for OIDC 2024-05-07 20:01:46 +00:00
David Manouchehri
9a0bb36865
fix+feat(router.py): Fix missing azure_ad_token, and allow use OIDC auth 2024-05-07 20:01:40 +00:00
David Manouchehri
e268354acc
feat(azure.py): Support OIDC auth 2024-05-07 20:01:33 +00:00
Krrish Dholakia
872470ff1f feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
this ensures langfuse trace url returned in llm api exception err
2024-05-07 12:58:49 -07:00
Ishaan Jaff
690d7b10a6 fix - default value for cost 2024-05-07 12:51:52 -07:00
Ishaan Jaff
245960708d fix - lowest cost routing 2024-05-07 12:49:20 -07:00
Ishaan Jaff
6cb059cce8 fix - use cost-based-routing 2024-05-07 12:48:53 -07:00
Ishaan Jaff
41ffaee821 test - basic lowest cost routing 2024-05-07 12:48:20 -07:00
Jean-Luc Duckworth
d5767e9403
Expanding jwt access to other RS and PS algos. Updated to resolve merge conflicts. 2024-05-07 15:45:07 -04:00
Ishaan Jaff
4c909194c7 docs - lowest - latency routing 2024-05-07 12:43:44 -07:00
Ishaan Jaff
e5e477d7f5 test - lowest cost routing 2024-05-07 12:19:44 -07:00
Ishaan Jaff
1ba4440096 feat add lowest cost router 2024-05-07 12:12:39 -07:00
Ishaan Jaff
31ac43bfdc feat - add lowst cost router 2024-05-07 12:12:09 -07:00
phact
4c64e3da10 locals().copy() 2024-05-07 14:58:35 -04:00
Paul Gauthier
82a4c68e60 Added deepseek completion test 2024-05-07 11:58:05 -07:00
Krrish Dholakia
724660606a fix(slack_alerting.py): fix storing + reading datetime object from cache
this converts the dt object to isoformat before storing, and loads it back to dt when comparing
2024-05-07 11:44:55 -07:00
Paul Gauthier
9162f9c2c5 Added costs & context json 2024-05-07 11:44:55 -07:00
Paul Gauthier
90eb0ea022 Added support for the deepseek api 2024-05-07 11:44:03 -07:00
Krish Dholakia
93e5fb49d3
Merge pull request #3500 from ghaemisr/main
Added support for JWT auth with PEM cert public keys
2024-05-07 11:07:30 -07:00
phact
7c5c9a8152 looks like cohere does support function calling 2024-05-07 13:41:05 -04:00
Sara Ghaemi
86e0dd68c3 updated tests 2024-05-07 13:28:57 -04:00