Commit graph

6072 commits

Author SHA1 Message Date
Krish Dholakia
1c6f6592ea Merge pull request #3153 from BerriAI/litellm_usage_based_routing_v2_improvements
usage based routing v2 improvements - unit testing + *NEW* async + sync 'pre_call_checks'
2024-04-18 22:16:16 -07:00
Krrish Dholakia
5bb73dc9c0 fix(router.py): instrument pre-call-checks for all openai endpoints 2024-04-18 21:54:25 -07:00
Krrish Dholakia
308a6e11f8 fix(lowest_tpm_rpm_v2.py): ensure backwards compatibility for python 3.8 2024-04-18 21:42:35 -07:00
Krrish Dholakia
376ee4e9d7 fix(test_lowest_tpm_rpm_routing_v2.py): unit testing for usage-based-routing-v2 2024-04-18 21:38:00 -07:00
Ishaan Jaff
94ea8c44a7 (ci/cd) vertexai - skip 429 errors 2024-04-18 21:27:22 -07:00
Ishaan Jaff
1a780b8bc0 (ci/cd) run again 2024-04-18 21:04:07 -07:00
Ishaan Jaff
532e252559 fix - show api_base in hanging requests 2024-04-18 21:01:26 -07:00
Ishaan Jaff
653dc44c08 fix - show api base on hanging requests 2024-04-18 20:58:02 -07:00
Ishaan Jaff
ecc770be00 fix - show api base on hanging requests 2024-04-18 20:57:22 -07:00
Ishaan Jaff
29b1745a43 (ci/cd) run again 2024-04-18 20:35:21 -07:00
Ishaan Jaff
44207fd404 fix - view spend per user 2024-04-18 20:23:04 -07:00
Ishaan Jaff
446b72b903 ui - new build 2024-04-18 20:18:50 -07:00
Ishaan Jaff
701eb5859b Merge pull request #3151 from BerriAI/litellm_allow_set_get_alerts
[UI] View all alert types
2024-04-18 20:12:54 -07:00
Ishaan Jaff
fa887dbff2 fix - GetLLMProvider excepton error raise 2024-04-18 20:10:37 -07:00
Ishaan Jaff
977b030dd9 ui - show all alert types when getting all callbacks 2024-04-18 20:08:13 -07:00
Krrish Dholakia
72691e05f4 fix(tpm_rpm_routing_v2.py): fix tpm rpm routing 2024-04-18 20:01:22 -07:00
Krish Dholakia
741a18a040 Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
feat(prometheus_services.py): emit proxy latency for successful llm api requests
2024-04-18 19:10:58 -07:00
Ishaan Jaff
34192e4dd4 Merge pull request #3146 from BerriAI/litellm_improve_users_pg
[UI]- show user_emails in Users Tab
2024-04-18 17:34:48 -07:00
Ishaan Jaff
27333d17e2 fix order by spend 2024-04-18 17:33:38 -07:00
Krrish Dholakia
852455e7fc fix(caching.py): fix error logging 2024-04-18 17:21:10 -07:00
Ishaan Jaff
03b4652af1 fix return key aliases on /user/info 2024-04-18 17:16:52 -07:00
Krrish Dholakia
124aae7579 fix(factory.py): add mistral config to known tokenizer configs 2024-04-18 17:11:52 -07:00
Krrish Dholakia
27ea7b5fef fix(caching.py): add call type to service logger object 2024-04-18 16:53:34 -07:00
Krrish Dholakia
51cc8dd95b fix(proxy/utils.py): add prometheus failed db request tracking 2024-04-18 16:30:29 -07:00
Krrish Dholakia
cdfd873713 fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
this is for tracking failed db requests on prometheus
2024-04-18 16:24:36 -07:00
Ishaan Jaff
bb07c5fdc5 Merge pull request #3112 from BerriAI/litellm_add_alert_types
[Feat] Allow user to select slack alert types to Opt In to
2024-04-18 16:21:33 -07:00
Ishaan Jaff
c9b28209e2 Merge pull request #3136 from Manouchehri/cohere-chat-seed-1
(feat) - Add seed to Cohere Chat.
2024-04-18 16:20:48 -07:00
Ishaan Jaff
3cbbbd5861 Merge pull request #3142 from BerriAI/litellm_slack_alerting_show_model_passed
[Fix] Show `model` passed on `"400: {'error': 'Invalid model name passed in mode` errors 👻
2024-04-18 16:18:17 -07:00
Krrish Dholakia
7f5bcf38b7 feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
2024-04-18 16:04:35 -07:00
Ishaan Jaff
5cd0fcd44e Merge pull request #3140 from BerriAI/litellm_trim_messages
[Fix] Slack Alerting -  trim messages to first 100 chars
2024-04-18 16:01:39 -07:00
Ishaan Jaff
8a3937a7ae test - invalid model name passed test 2024-04-18 15:46:49 -07:00
Ishaan Jaff
3125b9fbba fix - show model passed in on Invalid model name passed in error 2024-04-18 15:43:30 -07:00
Ishaan Jaff
d6e3f587fe fix trim messages to first 100 chars 2024-04-18 15:21:31 -07:00
David Manouchehri
e22f22e0a9 (feat) - Add seed to Cohere Chat. 2024-04-18 20:57:06 +00:00
Ishaan Jaff
0f941678b4 Merge pull request #3130 from BerriAI/litellm_show_vertex_project_exceptions
[FIX] -  show vertex_project, vertex_location in Vertex AI exceptions
2024-04-18 13:18:20 -07:00
Ishaan Jaff
1ba216627a fix - pass kwargs to exception_type 2024-04-18 12:58:30 -07:00
Ishaan Jaff
177bc683b3 fix - track vertex_location and vertex_project in vertex exceptions 2024-04-18 12:53:33 -07:00
Krrish Dholakia
f6ac469573 fix(prometheus.py): fix metric name to be more accurate
change metric name from litellm_failed_requests_metric to litellm_llm_api_failed_requests_metric
2024-04-18 12:30:44 -07:00
Krrish Dholakia
deccde6be1 fix(utils.py): support prometheus failed call metrics 2024-04-18 12:29:15 -07:00
Ishaan Jaff
2a18f5b8a9 fix - show _vertex_project, _vertex_location in exceptions 2024-04-18 11:48:43 -07:00
Ishaan Jaff
d178916048 fix - test alerting 2024-04-18 11:40:40 -07:00
Ishaan Jaff
670476353f ui - set alert_types 2024-04-18 11:13:05 -07:00
Ishaan Jaff
2580d55539 add alert_types to config.yaml 2024-04-18 08:51:07 -07:00
Krish Dholakia
fe5c63e80b Merge pull request #3105 from BerriAI/litellm_fix_hashing
fix(_types.py): hash api key in UserAPIKeyAuth
2024-04-18 08:16:24 -07:00
Krrish Dholakia
388ecadd5d refactor(main.py): trigger new build 2024-04-18 07:34:09 -07:00
Krrish Dholakia
280d9b4405 fix(utils.py): function_setup empty message fix
fixes https://github.com/BerriAI/litellm/issues/2858
2024-04-18 07:32:29 -07:00
Krrish Dholakia
64fe5b146c fix(utils.py): fix azure streaming logic 2024-04-18 07:08:36 -07:00
Ishaan Jaff
58eea0f330 feat return alert types on /config/get/callback 2024-04-17 21:02:10 -07:00
Ishaan Jaff
91109f32e3 Merge pull request #3114 from BerriAI/litellm_fix_alerting_users
[FIX} Repeat Slack Alerts triggered for "User Crossed Budget"
2024-04-17 20:38:27 -07:00
Ishaan Jaff
a97f8a40c1 fix - user based alerting 2024-04-17 20:35:29 -07:00