Krish Dholakia
|
1c6f6592ea
|
Merge pull request #3153 from BerriAI/litellm_usage_based_routing_v2_improvements
usage based routing v2 improvements - unit testing + *NEW* async + sync 'pre_call_checks'
|
2024-04-18 22:16:16 -07:00 |
|
Krrish Dholakia
|
5bb73dc9c0
|
fix(router.py): instrument pre-call-checks for all openai endpoints
|
2024-04-18 21:54:25 -07:00 |
|
Krrish Dholakia
|
308a6e11f8
|
fix(lowest_tpm_rpm_v2.py): ensure backwards compatibility for python 3.8
|
2024-04-18 21:42:35 -07:00 |
|
Krrish Dholakia
|
376ee4e9d7
|
fix(test_lowest_tpm_rpm_routing_v2.py): unit testing for usage-based-routing-v2
|
2024-04-18 21:38:00 -07:00 |
|
Ishaan Jaff
|
94ea8c44a7
|
(ci/cd) vertexai - skip 429 errors
|
2024-04-18 21:27:22 -07:00 |
|
Ishaan Jaff
|
1a780b8bc0
|
(ci/cd) run again
|
2024-04-18 21:04:07 -07:00 |
|
Ishaan Jaff
|
f0d58d27d0
|
Merge pull request #3152 from BerriAI/litellm_show_api_base_hanging_req
[Fix] show api base hanging request alerts
|
2024-04-18 21:02:30 -07:00 |
|
Ishaan Jaff
|
532e252559
|
fix - show api_base in hanging requests
|
2024-04-18 21:01:26 -07:00 |
|
Ishaan Jaff
|
653dc44c08
|
fix - show api base on hanging requests
|
2024-04-18 20:58:02 -07:00 |
|
Ishaan Jaff
|
ecc770be00
|
fix - show api base on hanging requests
|
2024-04-18 20:57:22 -07:00 |
|
Ishaan Jaff
|
29b1745a43
|
(ci/cd) run again
|
2024-04-18 20:35:21 -07:00 |
|
Ishaan Jaff
|
d2ab6dcf7f
|
bump: version 1.35.13 → 1.35.14
|
2024-04-18 20:23:58 -07:00 |
|
Ishaan Jaff
|
44207fd404
|
fix - view spend per user
|
2024-04-18 20:23:04 -07:00 |
|
Ishaan Jaff
|
446b72b903
|
ui - new build
|
2024-04-18 20:18:50 -07:00 |
|
Ishaan Jaff
|
701eb5859b
|
Merge pull request #3151 from BerriAI/litellm_allow_set_get_alerts
[UI] View all alert types
|
2024-04-18 20:12:54 -07:00 |
|
Ishaan Jaff
|
e0edbfe8a1
|
Merge pull request #3148 from BerriAI/ui_show_keys_in_users_page
ui - show key_aliases on `Users` Tab
|
2024-04-18 20:11:47 -07:00 |
|
Ishaan Jaff
|
fa887dbff2
|
fix - GetLLMProvider excepton error raise
|
2024-04-18 20:10:37 -07:00 |
|
Ishaan Jaff
|
977b030dd9
|
ui - show all alert types when getting all callbacks
|
2024-04-18 20:08:13 -07:00 |
|
Ishaan Jaff
|
41823db230
|
ui - show all alert types
|
2024-04-18 20:04:34 -07:00 |
|
Krrish Dholakia
|
72691e05f4
|
fix(tpm_rpm_routing_v2.py): fix tpm rpm routing
|
2024-04-18 20:01:22 -07:00 |
|
Ishaan Jaff
|
ee622e248d
|
Merge pull request #3149 from BerriAI/ui_clean_up_models
[UI] - Models Page - Place litellm params in an accordion
|
2024-04-18 19:49:00 -07:00 |
|
Ishaan Jaff
|
ec352b3bbf
|
ui - place litellm params in an accordion
|
2024-04-18 19:22:34 -07:00 |
|
Krish Dholakia
|
741a18a040
|
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
feat(prometheus_services.py): emit proxy latency for successful llm api requests
|
2024-04-18 19:10:58 -07:00 |
|
Ishaan Jaff
|
19a3e6efaf
|
ui - show keys user owns on ui
|
2024-04-18 18:27:41 -07:00 |
|
Ishaan Jaff
|
34192e4dd4
|
Merge pull request #3146 from BerriAI/litellm_improve_users_pg
[UI]- show user_emails in Users Tab
|
2024-04-18 17:34:48 -07:00 |
|
Ishaan Jaff
|
27333d17e2
|
fix order by spend
|
2024-04-18 17:33:38 -07:00 |
|
Krrish Dholakia
|
852455e7fc
|
fix(caching.py): fix error logging
|
2024-04-18 17:21:10 -07:00 |
|
Ishaan Jaff
|
03b4652af1
|
fix return key aliases on /user/info
|
2024-04-18 17:16:52 -07:00 |
|
Krrish Dholakia
|
124aae7579
|
fix(factory.py): add mistral config to known tokenizer configs
|
2024-04-18 17:11:52 -07:00 |
|
Krrish Dholakia
|
27ea7b5fef
|
fix(caching.py): add call type to service logger object
|
2024-04-18 16:53:34 -07:00 |
|
Ishaan Jaff
|
e388da84e9
|
UI - show user_emails
|
2024-04-18 16:41:06 -07:00 |
|
Krrish Dholakia
|
51cc8dd95b
|
fix(proxy/utils.py): add prometheus failed db request tracking
|
2024-04-18 16:30:29 -07:00 |
|
Krrish Dholakia
|
cdfd873713
|
fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
this is for tracking failed db requests on prometheus
|
2024-04-18 16:24:36 -07:00 |
|
Ishaan Jaff
|
bb07c5fdc5
|
Merge pull request #3112 from BerriAI/litellm_add_alert_types
[Feat] Allow user to select slack alert types to Opt In to
|
2024-04-18 16:21:33 -07:00 |
|
Ishaan Jaff
|
c9b28209e2
|
Merge pull request #3136 from Manouchehri/cohere-chat-seed-1
(feat) - Add seed to Cohere Chat.
|
2024-04-18 16:20:48 -07:00 |
|
Ishaan Jaff
|
3cbbbd5861
|
Merge pull request #3142 from BerriAI/litellm_slack_alerting_show_model_passed
[Fix] Show `model` passed on `"400: {'error': 'Invalid model name passed in mode` errors 👻
|
2024-04-18 16:18:17 -07:00 |
|
Krrish Dholakia
|
d08a36fb58
|
docs(prometheus.md): add litellm service metrics to prometheus docs
|
2024-04-18 16:06:12 -07:00 |
|
Krrish Dholakia
|
7f5bcf38b7
|
feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
|
2024-04-18 16:04:35 -07:00 |
|
Ishaan Jaff
|
5cd0fcd44e
|
Merge pull request #3140 from BerriAI/litellm_trim_messages
[Fix] Slack Alerting - trim messages to first 100 chars
|
2024-04-18 16:01:39 -07:00 |
|
Ishaan Jaff
|
8a3937a7ae
|
test - invalid model name passed test
|
2024-04-18 15:46:49 -07:00 |
|
Ishaan Jaff
|
3125b9fbba
|
fix - show model passed in on Invalid model name passed in error
|
2024-04-18 15:43:30 -07:00 |
|
Ishaan Jaff
|
d6e3f587fe
|
fix trim messages to first 100 chars
|
2024-04-18 15:21:31 -07:00 |
|
David Manouchehri
|
e22f22e0a9
|
(feat) - Add seed to Cohere Chat.
|
2024-04-18 20:57:06 +00:00 |
|
Ishaan Jaff
|
0f941678b4
|
Merge pull request #3130 from BerriAI/litellm_show_vertex_project_exceptions
[FIX] - show vertex_project, vertex_location in Vertex AI exceptions
|
2024-04-18 13:18:20 -07:00 |
|
Ishaan Jaff
|
1ba216627a
|
fix - pass kwargs to exception_type
|
2024-04-18 12:58:30 -07:00 |
|
Krrish Dholakia
|
1b98503be3
|
docs(prometheus.md): add prometheus failed llm api requests to docs
|
2024-04-18 12:55:46 -07:00 |
|
Ishaan Jaff
|
177bc683b3
|
fix - track vertex_location and vertex_project in vertex exceptions
|
2024-04-18 12:53:33 -07:00 |
|
Krrish Dholakia
|
f6ac469573
|
fix(prometheus.py): fix metric name to be more accurate
change metric name from litellm_failed_requests_metric to litellm_llm_api_failed_requests_metric
|
2024-04-18 12:30:44 -07:00 |
|
Krrish Dholakia
|
deccde6be1
|
fix(utils.py): support prometheus failed call metrics
|
2024-04-18 12:29:15 -07:00 |
|
Ishaan Jaff
|
2a18f5b8a9
|
fix - show _vertex_project, _vertex_location in exceptions
|
2024-04-18 11:48:43 -07:00 |
|