Simon Sanchez Viloria
|
72cbe369be
|
(docs) updated watsonx cookbook
|
2024-04-24 17:19:02 +02:00 |
|
Simon Sanchez Viloria
|
777b4b2bbc
|
(feat) make manage_response work with request.request instead of httpx.Request
|
2024-04-24 12:55:25 +02:00 |
|
Simon Sanchez Viloria
|
9fc30e8b31
|
(test) Added completion and embedding tests for watsonx provider
|
2024-04-24 12:52:29 +02:00 |
|
Simon Sanchez Viloria
|
f9a7456eaa
|
(docs) updated cookbook
|
2024-04-23 16:22:41 +02:00 |
|
Simon Sanchez Viloria
|
d72b725273
|
Fixed bugs in prompt factory for ibm-mistral and llama 3 models.
|
2024-04-23 16:20:49 +02:00 |
|
Simon S. Viloria
|
2ef4fb2efa
|
Merge branch 'BerriAI:main' into feature/watsonx-integration
|
2024-04-23 12:18:34 +02:00 |
|
Simon Sanchez Viloria
|
e64aceea91
|
(feat) Update WatsonX credentials and variable names
|
2024-04-23 12:16:04 +02:00 |
|
Simon Sanchez Viloria
|
7cbe9835c9
|
(docs) updated litellm watsonx cookbook
|
2024-04-23 12:01:13 +02:00 |
|
Simon Sanchez Viloria
|
74d2ba0a23
|
feat - watsonx refractoring, removed dependency, and added support for embedding calls
|
2024-04-23 12:01:13 +02:00 |
|
Ishaan Jaff
|
774fb33f28
|
Merge pull request #3231 from Manouchehri/fix-groq-1
(utils.py) - Fix response_format typo for Groq
|
2024-04-22 22:06:48 -07:00 |
|
David Manouchehri
|
6d61607ee3
|
(utils.py) - Fix response_format typo for Groq
|
2024-04-23 04:26:26 +00:00 |
|
Krrish Dholakia
|
ec2c70e362
|
fix(vertex_ai.py): fix streaming logic
|
2024-04-22 19:15:20 -07:00 |
|
Krrish Dholakia
|
0bb8a4434e
|
fix(vertex_ai.py): remove ExtendedGenerationConfig usage
|
2024-04-22 18:23:21 -07:00 |
|
Ishaan Jaff
|
7e9587c102
|
ui - new build
|
2024-04-22 18:16:54 -07:00 |
|
Ishaan Jaff
|
fdf432798e
|
Merge pull request #3228 from BerriAI/litellm_ui_polish
[Fix] Non-Admin SSO Login
|
2024-04-22 18:15:10 -07:00 |
|
Ishaan Jaff
|
9250f61a4c
|
fix - sso login for non admins
|
2024-04-22 17:57:47 -07:00 |
|
Krish Dholakia
|
14d92aa944
|
Merge pull request #3214 from Manouchehri/add-p-and-f-pen-gemmini-1
(Vertex AI) - Add `frequency_penalty` and `presence_penalty` support
|
2024-04-22 16:46:49 -07:00 |
|
Krish Dholakia
|
b1512d23be
|
Merge pull request #3211 from Manouchehri/simplify-gemini-config-1
improve(vertex_ai.py): Switch to simpler dict type for supporting JSON mode
|
2024-04-22 16:46:33 -07:00 |
|
Ishaan Jaff
|
9886392694
|
Merge pull request #3226 from BerriAI/litellm_alerting_fix
[Bug-Fix] Alerting - don't send hanging request alert on failed request
|
2024-04-22 16:32:35 -07:00 |
|
Ishaan Jaff
|
bd0d6bce0f
|
fix models displayed when logging in
|
2024-04-22 16:31:47 -07:00 |
|
Ishaan Jaff
|
8874eaa0b3
|
fix - track litellm_status=fail
|
2024-04-22 16:11:04 -07:00 |
|
Ishaan Jaff
|
517f577292
|
fix - dont send alert on fail request
|
2024-04-22 16:07:58 -07:00 |
|
Krish Dholakia
|
9a49065b0b
|
Merge pull request #3224 from BerriAI/litellm_prometheus_key_owner
fix(prometheus.py): add user tracking to prometheus
|
2024-04-22 15:45:26 -07:00 |
|
Ishaan Jaff
|
dbb06141a3
|
Merge pull request #3219 from BerriAI/litellm_ui_show_teams_ui
[UI-Fix] Show all teams on Admin UI
|
2024-04-22 15:22:14 -07:00 |
|
Ishaan Jaff
|
c777e5d9d2
|
Merge pull request #3223 from paul-gauthier/main
Added openrouter/meta-llama/llama-3-70b-instruct context and cost metrics
|
2024-04-22 15:18:07 -07:00 |
|
Krrish Dholakia
|
6ac0dba5c2
|
fix(prometheus.py): add user tracking to prometheus
|
2024-04-22 15:14:38 -07:00 |
|
Paul Gauthier
|
0a021a6fa2
|
Added openrouter/meta-llama/llama-3-70b-instruct context and cost metadata
Per https://openrouter.ai/models/meta-llama/llama-3-70b-instruct
Meta: Llama 3 70B Instruct
meta-llama/llama-3-70b-instruct
Updated Apr 18
8,192 context
$0.8/M input tkns
$0.8/M output tkns
|
2024-04-22 15:07:58 -07:00 |
|
Ishaan Jaff
|
aa73c115f6
|
Merge pull request #3205 from bllchmbrs/patch-1
Update langsmith_integration.md
|
2024-04-22 14:21:25 -07:00 |
|
Ishaan Jaff
|
4fbb92e9e9
|
Merge pull request #3218 from BerriAI/ui_cleanup_text_input
[UI-Polish] Cleanup Inputing Key Name, Team Name, User Email
|
2024-04-22 14:20:29 -07:00 |
|
Ishaan Jaff
|
50bbd188fb
|
ui - show all teams on ui
|
2024-04-22 14:15:50 -07:00 |
|
Ishaan Jaff
|
cd3b2a21c1
|
ui - find all teams
|
2024-04-22 14:15:09 -07:00 |
|
Ishaan Jaff
|
877c4e27f4
|
Merge pull request #3212 from BerriAI/ui_increase_default_session_time
UI - increase default session time to 2 hours
|
2024-04-22 13:46:18 -07:00 |
|
Ishaan Jaff
|
1a945092de
|
Merge pull request #3210 from BerriAI/litellm_ui_round_up_team_spend_2_decimals
[UI] round up team spend to 2 decimals + diversify legend for team spend
|
2024-04-22 13:45:52 -07:00 |
|
Ishaan Jaff
|
753bed86e5
|
ui - clean up order
|
2024-04-22 13:44:37 -07:00 |
|
Krrish Dholakia
|
d4bca6707b
|
ci(proxy_server_config.yaml): use redis for usage-based-routing-v2
|
2024-04-22 13:34:36 -07:00 |
|
Ishaan Jaff
|
40ae951634
|
ui - cleanup input text boxes
|
2024-04-22 13:34:23 -07:00 |
|
Ishaan Jaff
|
69121360ba
|
fix text input box on ui
|
2024-04-22 13:29:48 -07:00 |
|
Ishaan Jaff
|
860f20d1ab
|
ui - cleanup litellm logo
|
2024-04-22 13:27:42 -07:00 |
|
Krrish Dholakia
|
ff30dc3cf9
|
bump: version 1.35.19 → 1.35.20
|
2024-04-22 13:02:06 -07:00 |
|
Krrish Dholakia
|
a520e1bd6f
|
fix(router.py): add random shuffle and tpm-based shuffle for async shuffle logic
|
2024-04-22 12:58:59 -07:00 |
|
David Manouchehri
|
c643e04ada
|
improve(vertex_ai.py): Add frequency_penalty and presence_penalty.
|
2024-04-22 18:02:59 +00:00 |
|
Krrish Dholakia
|
c015e5e2c6
|
bump: version 1.35.18 → 1.35.19
|
2024-04-22 10:54:52 -07:00 |
|
Krrish Dholakia
|
1e9487f639
|
refactor(main.py): trigger new build
|
2024-04-22 10:54:35 -07:00 |
|
Krrish Dholakia
|
be4a3de27c
|
fix(utils.py): support deepinfra response object
|
2024-04-22 10:51:11 -07:00 |
|
Ishaan Jaff
|
bb065f64c6
|
increase ui default session time to 2 hours
|
2024-04-22 10:00:53 -07:00 |
|
David Manouchehri
|
1a7eec5786
|
improve(vertex_ai.py): Switch to simpler dict type.
|
2024-04-22 17:00:37 +00:00 |
|
Ishaan Jaff
|
127a030a5f
|
ui - simplify team spend color scheme
|
2024-04-22 09:53:05 -07:00 |
|
Ishaan Jaff
|
f54982a560
|
fix - round spend to 2 decimals
|
2024-04-22 09:17:40 -07:00 |
|
Ishaan Jaff
|
b82dd29c99
|
Merge pull request #3209 from BerriAI/litellm_show_langfuse_link_slack_alerts
[Feat]- show langfuse trace in slack alerts
|
2024-04-22 08:55:08 -07:00 |
|
Ishaan Jaff
|
094583f18e
|
feat - show langfuse trace in alerts
|
2024-04-22 08:51:46 -07:00 |
|