Commit graph

10408 commits

Author SHA1 Message Date
Simon Sanchez Viloria
72cbe369be (docs) updated watsonx cookbook 2024-04-24 17:19:02 +02:00
Simon Sanchez Viloria
777b4b2bbc (feat) make manage_response work with request.request instead of httpx.Request 2024-04-24 12:55:25 +02:00
Simon Sanchez Viloria
9fc30e8b31 (test) Added completion and embedding tests for watsonx provider 2024-04-24 12:52:29 +02:00
Simon Sanchez Viloria
f9a7456eaa (docs) updated cookbook 2024-04-23 16:22:41 +02:00
Simon Sanchez Viloria
d72b725273 Fixed bugs in prompt factory for ibm-mistral and llama 3 models. 2024-04-23 16:20:49 +02:00
Simon S. Viloria
2ef4fb2efa
Merge branch 'BerriAI:main' into feature/watsonx-integration 2024-04-23 12:18:34 +02:00
Simon Sanchez Viloria
e64aceea91 (feat) Update WatsonX credentials and variable names 2024-04-23 12:16:04 +02:00
Simon Sanchez Viloria
7cbe9835c9 (docs) updated litellm watsonx cookbook 2024-04-23 12:01:13 +02:00
Simon Sanchez Viloria
74d2ba0a23 feat - watsonx refractoring, removed dependency, and added support for embedding calls 2024-04-23 12:01:13 +02:00
Ishaan Jaff
774fb33f28
Merge pull request #3231 from Manouchehri/fix-groq-1
(utils.py) - Fix response_format typo for Groq
2024-04-22 22:06:48 -07:00
David Manouchehri
6d61607ee3
(utils.py) - Fix response_format typo for Groq 2024-04-23 04:26:26 +00:00
Krrish Dholakia
ec2c70e362 fix(vertex_ai.py): fix streaming logic 2024-04-22 19:15:20 -07:00
Krrish Dholakia
0bb8a4434e fix(vertex_ai.py): remove ExtendedGenerationConfig usage 2024-04-22 18:23:21 -07:00
Ishaan Jaff
7e9587c102 ui - new build 2024-04-22 18:16:54 -07:00
Ishaan Jaff
fdf432798e
Merge pull request #3228 from BerriAI/litellm_ui_polish
[Fix] Non-Admin SSO Login
2024-04-22 18:15:10 -07:00
Ishaan Jaff
9250f61a4c fix - sso login for non admins 2024-04-22 17:57:47 -07:00
Krish Dholakia
14d92aa944
Merge pull request #3214 from Manouchehri/add-p-and-f-pen-gemmini-1
(Vertex AI) - Add `frequency_penalty` and `presence_penalty` support
2024-04-22 16:46:49 -07:00
Krish Dholakia
b1512d23be
Merge pull request #3211 from Manouchehri/simplify-gemini-config-1
improve(vertex_ai.py): Switch to simpler dict type for supporting JSON mode
2024-04-22 16:46:33 -07:00
Ishaan Jaff
9886392694
Merge pull request #3226 from BerriAI/litellm_alerting_fix
[Bug-Fix] Alerting - don't send hanging request alert on failed request
2024-04-22 16:32:35 -07:00
Ishaan Jaff
bd0d6bce0f fix models displayed when logging in 2024-04-22 16:31:47 -07:00
Ishaan Jaff
8874eaa0b3 fix - track litellm_status=fail 2024-04-22 16:11:04 -07:00
Ishaan Jaff
517f577292 fix - dont send alert on fail request 2024-04-22 16:07:58 -07:00
Krish Dholakia
9a49065b0b
Merge pull request #3224 from BerriAI/litellm_prometheus_key_owner
fix(prometheus.py): add user tracking to prometheus
2024-04-22 15:45:26 -07:00
Ishaan Jaff
dbb06141a3
Merge pull request #3219 from BerriAI/litellm_ui_show_teams_ui
[UI-Fix] Show all teams on Admin UI
2024-04-22 15:22:14 -07:00
Ishaan Jaff
c777e5d9d2
Merge pull request #3223 from paul-gauthier/main
Added openrouter/meta-llama/llama-3-70b-instruct context and cost metrics
2024-04-22 15:18:07 -07:00
Krrish Dholakia
6ac0dba5c2 fix(prometheus.py): add user tracking to prometheus 2024-04-22 15:14:38 -07:00
Paul Gauthier
0a021a6fa2 Added openrouter/meta-llama/llama-3-70b-instruct context and cost metadata
Per https://openrouter.ai/models/meta-llama/llama-3-70b-instruct

Meta: Llama 3 70B Instruct
meta-llama/llama-3-70b-instruct

Updated Apr 18
8,192 context
$0.8/M input tkns
$0.8/M output tkns
2024-04-22 15:07:58 -07:00
Ishaan Jaff
aa73c115f6
Merge pull request #3205 from bllchmbrs/patch-1
Update langsmith_integration.md
2024-04-22 14:21:25 -07:00
Ishaan Jaff
4fbb92e9e9
Merge pull request #3218 from BerriAI/ui_cleanup_text_input
[UI-Polish] Cleanup Inputing Key Name, Team Name, User Email
2024-04-22 14:20:29 -07:00
Ishaan Jaff
50bbd188fb ui - show all teams on ui 2024-04-22 14:15:50 -07:00
Ishaan Jaff
cd3b2a21c1 ui - find all teams 2024-04-22 14:15:09 -07:00
Ishaan Jaff
877c4e27f4
Merge pull request #3212 from BerriAI/ui_increase_default_session_time
UI - increase default session time to 2 hours
2024-04-22 13:46:18 -07:00
Ishaan Jaff
1a945092de
Merge pull request #3210 from BerriAI/litellm_ui_round_up_team_spend_2_decimals
[UI] round up team spend to 2 decimals + diversify legend for team spend
2024-04-22 13:45:52 -07:00
Ishaan Jaff
753bed86e5 ui - clean up order 2024-04-22 13:44:37 -07:00
Krrish Dholakia
d4bca6707b ci(proxy_server_config.yaml): use redis for usage-based-routing-v2 2024-04-22 13:34:36 -07:00
Ishaan Jaff
40ae951634 ui - cleanup input text boxes 2024-04-22 13:34:23 -07:00
Ishaan Jaff
69121360ba fix text input box on ui 2024-04-22 13:29:48 -07:00
Ishaan Jaff
860f20d1ab ui - cleanup litellm logo 2024-04-22 13:27:42 -07:00
Krrish Dholakia
ff30dc3cf9 bump: version 1.35.19 → 1.35.20 2024-04-22 13:02:06 -07:00
Krrish Dholakia
a520e1bd6f fix(router.py): add random shuffle and tpm-based shuffle for async shuffle logic 2024-04-22 12:58:59 -07:00
David Manouchehri
c643e04ada
improve(vertex_ai.py): Add frequency_penalty and presence_penalty. 2024-04-22 18:02:59 +00:00
Krrish Dholakia
c015e5e2c6 bump: version 1.35.18 → 1.35.19 2024-04-22 10:54:52 -07:00
Krrish Dholakia
1e9487f639 refactor(main.py): trigger new build 2024-04-22 10:54:35 -07:00
Krrish Dholakia
be4a3de27c fix(utils.py): support deepinfra response object 2024-04-22 10:51:11 -07:00
Ishaan Jaff
bb065f64c6 increase ui default session time to 2 hours 2024-04-22 10:00:53 -07:00
David Manouchehri
1a7eec5786
improve(vertex_ai.py): Switch to simpler dict type. 2024-04-22 17:00:37 +00:00
Ishaan Jaff
127a030a5f ui - simplify team spend color scheme 2024-04-22 09:53:05 -07:00
Ishaan Jaff
f54982a560 fix - round spend to 2 decimals 2024-04-22 09:17:40 -07:00
Ishaan Jaff
b82dd29c99
Merge pull request #3209 from BerriAI/litellm_show_langfuse_link_slack_alerts
[Feat]- show langfuse trace in slack alerts
2024-04-22 08:55:08 -07:00
Ishaan Jaff
094583f18e feat - show langfuse trace in alerts 2024-04-22 08:51:46 -07:00