Ishaan Jaff
|
2725a55e7a
|
Merge pull request #3470 from mbektas/fix-ollama-embeddings
support sync ollama embeddings
|
2024-05-07 19:21:37 -07:00 |
|
Ishaan Jaff
|
f45feff13c
|
bump: version 1.36.1 → 1.36.2
|
2024-05-07 19:16:32 -07:00 |
|
Ishaan Jaff
|
6e72857cf7
|
fix model cost map
|
2024-05-07 19:15:58 -07:00 |
|
Ishaan Jaff
|
223e386737
|
Merge pull request #3511 from BerriAI/litellm_router_send_exceptions_slack
[Feat] litellm.Router / litellm.completion - send llm exceptions to slack
|
2024-05-07 19:09:44 -07:00 |
|
Ishaan Jaff
|
596adf6e2f
|
test - slack alerting on litellm router
|
2024-05-07 19:04:25 -07:00 |
|
Krrish Dholakia
|
312249ca44
|
feat(ui/model_dashboard.tsx): show if model is config or db model
|
2024-05-07 18:29:14 -07:00 |
|
Ishaan Jaff
|
dc74204427
|
fix typo
|
2024-05-07 18:27:49 -07:00 |
|
Ishaan Jaff
|
d46544d2bc
|
docs setup alerting on router
|
2024-05-07 18:26:45 -07:00 |
|
Ishaan Jaff
|
e8053c3d0b
|
fix slack alerting
|
2024-05-07 18:17:12 -07:00 |
|
Ishaan Jaff
|
c08352a0ce
|
router- initialize alerting
|
2024-05-07 18:03:04 -07:00 |
|
Ishaan Jaff
|
5fd3b12d34
|
add router alerting type
|
2024-05-07 17:46:18 -07:00 |
|
Ishaan Jaff
|
17787db973
|
Merge pull request #3503 from paul-gauthier/deepseek
Added "deepseek/" as a supported provider (openai compatible)
|
2024-05-07 15:15:47 -07:00 |
|
Ishaan Jaff
|
b1230dd919
|
test - slack alerts on router
|
2024-05-07 15:12:21 -07:00 |
|
Ishaan Jaff
|
32f3e032e9
|
feat - send slack alerts litellm.router
|
2024-05-07 15:10:47 -07:00 |
|
Krish Dholakia
|
2aaaa5e1b4
|
Merge pull request #3506 from BerriAI/litellm_reintegrate_langfuse_url_slack_alert
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
|
2024-05-07 15:03:29 -07:00 |
|
David Manouchehri
|
44b1b21911
|
feat(utils.py) - Add OIDC caching for Google Cloud Run and GitHub Actions.
|
2024-05-07 21:24:55 +00:00 |
|
Ishaan Jaff
|
84055c0546
|
Merge pull request #3510 from BerriAI/litellm_make_lowest_cost_async
[Feat] Make lowest cost routing Async
|
2024-05-07 14:14:04 -07:00 |
|
Ishaan Jaff
|
8644aec8d3
|
test - lowest cost router
|
2024-05-07 13:52:34 -07:00 |
|
Ishaan Jaff
|
6983e7a84f
|
feat - make lowest_cost pure async
|
2024-05-07 13:51:50 -07:00 |
|
Krrish Dholakia
|
f210318bf1
|
fix(proxy_server.py): return budget duration in user response object
|
2024-05-07 13:47:32 -07:00 |
|
Krrish Dholakia
|
f2766fddbf
|
fix(proxy_server.py): fix /v1/models bug where it would return empty list
handle 'all-team-models' being set for a given key
|
2024-05-07 13:43:15 -07:00 |
|
Ishaan Jaff
|
8e5437c8e9
|
Merge pull request #3504 from BerriAI/litellm_add_lowest_cost_routing
[Feat + Test] Add lowest cost routing - litellm.Router
|
2024-05-07 13:22:58 -07:00 |
|
Krrish Dholakia
|
b872da4e6f
|
test: fix linting error
|
2024-05-07 13:18:49 -07:00 |
|
Ishaan Jaff
|
429f569360
|
test - lowest cost with custom pricing
|
2024-05-07 13:17:32 -07:00 |
|
Ishaan Jaff
|
d5f93048cc
|
docs - lowest cost routing
|
2024-05-07 13:15:30 -07:00 |
|
Krrish Dholakia
|
e85468badb
|
test: fix linting error
|
2024-05-07 13:12:06 -07:00 |
|
Jean-Luc Duckworth
|
d60aa8282e
|
Fixed typo. test_jwt.py tests pass
|
2024-05-07 16:08:36 -04:00 |
|
Ishaan Jaff
|
486cbb990c
|
fix allow user to pass input_cost and output_cost
|
2024-05-07 13:08:16 -07:00 |
|
Ishaan Jaff
|
71a92b4fef
|
test - lowest cost router
|
2024-05-07 13:04:12 -07:00 |
|
David Manouchehri
|
cb49fb004d
|
fix(azure.py): Correct invalid .get to a .post for OIDC
|
2024-05-07 20:01:46 +00:00 |
|
David Manouchehri
|
9a0bb36865
|
fix+feat(router.py): Fix missing azure_ad_token, and allow use OIDC auth
|
2024-05-07 20:01:40 +00:00 |
|
David Manouchehri
|
e268354acc
|
feat(azure.py): Support OIDC auth
|
2024-05-07 20:01:33 +00:00 |
|
Krrish Dholakia
|
872470ff1f
|
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
this ensures langfuse trace url returned in llm api exception err
|
2024-05-07 12:58:49 -07:00 |
|
Ishaan Jaff
|
690d7b10a6
|
fix - default value for cost
|
2024-05-07 12:51:52 -07:00 |
|
Ishaan Jaff
|
245960708d
|
fix - lowest cost routing
|
2024-05-07 12:49:20 -07:00 |
|
Ishaan Jaff
|
6cb059cce8
|
fix - use cost-based-routing
|
2024-05-07 12:48:53 -07:00 |
|
Ishaan Jaff
|
41ffaee821
|
test - basic lowest cost routing
|
2024-05-07 12:48:20 -07:00 |
|
Jean-Luc Duckworth
|
d5767e9403
|
Expanding jwt access to other RS and PS algos. Updated to resolve merge conflicts.
|
2024-05-07 15:45:07 -04:00 |
|
Ishaan Jaff
|
4c909194c7
|
docs - lowest - latency routing
|
2024-05-07 12:43:44 -07:00 |
|
Ishaan Jaff
|
e5e477d7f5
|
test - lowest cost routing
|
2024-05-07 12:19:44 -07:00 |
|
Ishaan Jaff
|
1ba4440096
|
feat add lowest cost router
|
2024-05-07 12:12:39 -07:00 |
|
Ishaan Jaff
|
31ac43bfdc
|
feat - add lowst cost router
|
2024-05-07 12:12:09 -07:00 |
|
phact
|
4c64e3da10
|
locals().copy()
|
2024-05-07 14:58:35 -04:00 |
|
Paul Gauthier
|
82a4c68e60
|
Added deepseek completion test
|
2024-05-07 11:58:05 -07:00 |
|
Krrish Dholakia
|
724660606a
|
fix(slack_alerting.py): fix storing + reading datetime object from cache
this converts the dt object to isoformat before storing, and loads it back to dt when comparing
|
2024-05-07 11:44:55 -07:00 |
|
Paul Gauthier
|
9162f9c2c5
|
Added costs & context json
|
2024-05-07 11:44:55 -07:00 |
|
Paul Gauthier
|
90eb0ea022
|
Added support for the deepseek api
|
2024-05-07 11:44:03 -07:00 |
|
Krish Dholakia
|
93e5fb49d3
|
Merge pull request #3500 from ghaemisr/main
Added support for JWT auth with PEM cert public keys
|
2024-05-07 11:07:30 -07:00 |
|
phact
|
7c5c9a8152
|
looks like cohere does support function calling
|
2024-05-07 13:41:05 -04:00 |
|
Sara Ghaemi
|
86e0dd68c3
|
updated tests
|
2024-05-07 13:28:57 -04:00 |
|